devanagari-transliterations-0.1.0.0: Conversion between Devanagari Unicode, Harvard-Kyoto, IAST and ISO15919.
Safe HaskellSafe-Inferred
LanguageGHC2021

Data.Char.Devanagari.Tokenizer

Synopsis

Documentation

selectTokenizerByContent :: Text -> Tokenizer Source #

select the correct tokenizer based on the content of the input string.

tokenize :: Tokenizer Source #

tokenize a string of Text into a sequence of DevanagariTokens. The actual tokenizer is selected based on the content of the input string. This tokenizer is then applied to the input string.

fromDevanagari :: Tokenizer Source #

a tokenizer function that parses a Text containing Devanagari script into a Sequence of DevanagariToken instances.

fromIso :: Tokenizer Source #

a tokenizer function that parses a Text containing ISO15919 encoded Devanagari script into a Sequence of DevanagariToken instances.

fromHarvard :: Tokenizer Source #

a tokenizer function that parses a Text containing Harvard-Kyoto encoded Devanagari script into a Sequence of DevanagariToken instances.

fromIast :: Tokenizer Source #

a tokenizer function that parses a Text containing IAST encoded Devanagari script into a Sequence of DevanagariToken instances.

type Tokenizer = Text -> Seq DevanagariToken Source #

A Tokenizer is a function that takes a Text as input and produces a Sequence of DevanagariToken instances as output.