Langlab 1.3.0
langlab library
langlab.algs.readability
Module contains functions for computing readability indices.
Public variables and functions:
langlab.algs.tagging
Module contains functionality related to tagging with dictionary tags. The implemented functionality can be divided into three main areas:
langlab.core.characters
Module contains string utilities operating on characters.
Public variables and functions:
- contains-digits-only?
- contains-digits?
- contains-letters-only?
- contains-letters-or-digits-only?
- contains-letters-or-digits?
- contains-letters?
- contains-non-bmp?
- contains-punct-only?
- contains-punct?
- contains-whitespace-only?
- contains-whitespace?
- count-latin-vowel-groups
- count-latin-vowel-groups-without-final
- en-count-chars-bi
- en-count-chars-icu-bi
- lg-count-chars-bi
- lg-count-chars-icu-bi
- remove-bmp
- remove-diacritics
- remove-non-bmp
langlab.core.comparators
Module contains various tools for fuzzy strings comparison.
Public variables and functions:
langlab.core.detectors
Module contains language and encoding detection utilities.
Public variables and functions:
langlab.core.multi-stemmers
Module contains stemming algorithms returning multiple results.
Public variables and functions:
langlab.core.parsers
Module contains tools for parsing text into sentences and words.
Public variables and functions:
langlab.core.stemmers
Module contains stemming algorithms.
Public variables and functions:
- ca-stem-snowball
- da-stem-snowball
- de-2-stem-snowball
- de-stem-snowball
- en-lovins-stem-snowball
- en-morpha-stemmer
- en-porter-stem-snowball
- en-stem-snowball
- es-stem-snowball
- eu-stem-snowball
- fi-stem-snowball
- fr-stem-snowball
- ga-stem-snowball
- hu-stem-snowball
- hy-stem-snowball
- it-stem-snowball
- nl-kp-stem-snowball
- nl-stem-snowball
- no-stem-snowball
- pl-stem-light-clef
- pl-stem-stempel
- pt-stem-snowball
- ro-stem-snowball
- ru-stem-snowball
- sv-stem-snowball
- tr-stem-snowball
langlab.core.stopwords
Module contains predefined sets of stopwords/articles for various languages and functions to operate on them.
Public variables and functions:
- ar-get-stopwords-lucene
- bg-get-stopwords-clef
- bg-get-stopwords-lucene
- br-get-stopwords-lucene
- ca-get-stopwords-lucene
- ca-get-stopwords-ranks
- cz-get-stopwords-clef
- cz-get-stopwords-lucene
- cz-get-stopwords-ranks
- da-get-stopwords-lucene
- da-get-stopwords-ranks
- de-get-articles
- de-get-stopwords-clef
- de-get-stopwords-lucene
- de-get-stopwords-ranks
- el-get-stopwords-lucene
- en-get-articles
- en-get-stopwords-clef
- en-get-stopwords-lucene
- en-get-stopwords-ranks-long
- en-get-stopwords-ranks-short
- en-get-stopwords-ranks-vlong
- es-get-articles
- es-get-stopwords-long-clef
- es-get-stopwords-lucene
- es-get-stopwords-ranks
- es-get-stopwords-short-clef
- eu-get-stopwords-lucene
- fa-get-stopwords-lucene
- fi-get-stopwords-lucene
- fi-get-stopwords-ranks
- fr-get-articles
- fr-get-stopwords-clef
- fr-get-stopwords-lucene
- fr-get-stopwords-ranks
- ga-get-stopwords-lucene
- gl-get-stopwords-lucene
- hi-get-stopwords-lucene
- hu-get-stopwords-clef
- hu-get-stopwords-lucene
- hu-get-stopwords-ranks
- hy-get-stopwords-lucene
- id-get-stopwords-lucene
- it-get-articles
- it-get-stopwords-clef
- it-get-stopwords-lucene
- it-get-stopwords-ranks
- lv-get-stopwords-lucene
- nl-get-articles
- nl-get-stopwords-ranks
- no-get-stopwords-lucene
- no-get-stopwords-ranks
- pl-get-stopwords-long-clef
- pl-get-stopwords-lucene
- pl-get-stopwords-ranks
- pl-get-stopwords-short-clef
- pl-get-stopwords-wiki-long
- pl-get-stopwords-wiki-short
- pt-get-articles
- pt-get-stopwords-long-clef
- pt-get-stopwords-lucene
- pt-get-stopwords-ranks
- pt-get-stopwords-short-clef
- ro-get-stopwords-clef
- ro-get-stopwords-lucene
- ru-get-stopwords-clef
- ru-get-stopwords-lucene
- ru-get-stopwords-ranks
- sv-get-stopwords-clef
- sv-get-stopwords-lucene
- sv-get-stopwords-ranks
- th-get-stopwords-lucene
- tr-get-stopwords-lucene
- tr-get-stopwords-ranks
langlab.core.transformers
Module contains utilities for transforming tokens.