chatter-0.4.0.0: A library of simple NLP algorithms.

Safe HaskellNone
LanguageHaskell2010

NLP.Corpora.Conll

Description

Data types representing the POS tags and Chunk tags derived from the Conll2000 training corpus.

Synopsis

Documentation

data Chunk Source

Constructors

ADJP 
ADVP 
CONJP 
INTJ 
LST 
NP

Noun Phrase.

PP

Prepositional Phrase.

PRT 
SBAR 
UCP 
VP

Verb Phrase.

O

"out"; not a chunk.

tagTxtPatterns :: [(Text, Text)] Source

Order matters here: The patterns are replaced in reverse order when generating tags, and in top-to-bottom when generating tags.

data Tag Source

Constructors

START

START tag, used in training.

END

END tag, used in training.

Hash

#

Dollar

$

CloseDQuote

''

OpenDQuote

``

Op_Paren

(

Cl_Paren

)

Comma

,

Term

. Sentence Terminator

Colon

:

CC 
CD 
DT 
EX 
FW 
IN 
JJ 
JJR 
JJS 
MD 
NN 
NNP 
NNPS 
NNS 
PDT 
POS 
PRP 
PRPdollar 
RB 
RBR 
RBS 
RP 
SYM 
TO 
UH 
VB 
VBD 
VBG 
VBN 
VBP 
VBZ 
WDT 
WP 
WPdollar 
WRB 
Unk