langlab.core.ngrams

Module contains n-gram generation function.

gen-ngrams

(gen-ngrams tokens)(gen-ngrams n tokens)(gen-ngrams n m tokens)

Function generates n-grams from a given ‘tokens’ sequence.

The following invocations are possible:

  • (gen-n-grams n tokens) - generates all n-grams
  • (gen-n-grams n m tokens) - generates all n-grams,(n+1)-grams,…,(m)-grams
  • (gen-ngrams tokens) - generates 1 .. (count tokens) n-grams