Safe Haskell | Safe |
---|---|
Language | Haskell2010 |
- stem :: Text -> Text
- stemCaseInsensitive :: Text -> Text
- data Segmentation = Segmentation {}
- segment' :: Text -> Segmentation
- segment :: Text -> (Text, Text)
- segment'CaseInsensitive :: Text -> Segmentation
- segmentCaseInsensitive :: Text -> (Text, Text)
Documentation
Guess the word stem. This module uses the CISTEM algorithm, published by L. Weißweiler and A. Fraser in "Developing a Stemmer for German Based on a Comparative Analysis of Publicly Available Stemmers" (2017).
stemCaseInsensitive :: Text -> Text Source #
A case insensitive variant. Use only if the text may be incorrectly upper case.
data Segmentation Source #
segment' :: Text -> Segmentation Source #
Split the word into a prefix, the stem and a suffix. In contrast to the stem
function umlauts remain unchanged.
segment :: Text -> (Text, Text) Source #
Split the word into stem and suffix. This is supposed to be compatible to the segment
function from the reference implementation.
segment'CaseInsensitive :: Text -> Segmentation Source #
A case insensitive variant. Use only if the text may be incorrectly upper case.