WordAlignment-0.1.0.0: Bigram word pair alignments.

Safe HaskellNone
LanguageHaskell2010

Linguistics.Word

Description

A single word in a language. Uses a MultiChar encoding for the actual characters. MultiChar encodings need to be decoded for printing on screen.

Synopsis

Documentation

data Word Source

A single word we want to align to another word. It comes with an id (here 9), the language name (which we intern), a word class (interned as well), the length of the word (so that we don't have to check wordWord length and check for word delims), and finally the word itself. Indivitual MultiChar characters are interned to reduce memory cost (and we might want to do stuff with the Id's).

9 Albanian_Tosk 1.214 6 ' b a lʸ t ə

Constructors

Word 

Fields

wordID :: !Int
 
wordClass :: !BTI
 
wordLang :: !BTI
 
wordLength :: !Int
 
wordWord :: !(Vector BTI)