alpino-tools-0.2.0: Alpino data manipulation tools

Safe HaskellSafe-Infered

Data.Alpino.Model.Conduit

Synopsis

Documentation

bestScore :: Monad m => Conduit [TrainingInstance] m DoubleSource

Retrieve the best score from a list of training instances.

concat :: Monad m => Conduit [a] m aSource

filterFeatures :: Monad m => (Bool -> Bool) -> Set ByteString -> Conduit TrainingInstance m TrainingInstanceSource

Filter features by exact names. A modifier function can be applied, for instance, the not function would exclude the specified features.

filterFeaturesFunctor :: Monad m => (Bool -> Bool) -> Set ByteString -> Conduit TrainingInstance m TrainingInstanceSource

Filter features by their functor. A modifier function can be applied, for instance, the not function would exclude the specified features.

groupByKey :: Monad m => Conduit TrainingInstance m [TrainingInstance]Source

Group training instances by key.

scoreToBinary :: Monad m => Conduit [TrainingInstance] m [TrainingInstance]Source

Conduit recaculating scores to binary scores (1.0 for best, 0.0 for the rest).

scoreToBinaryNorm :: Monad m => Conduit [TrainingInstance] m [TrainingInstance]Source

Conduit recalculating scores, dividing a score of 1.0 uniformly over instances with the highest quality score.

scoreToNorm :: Monad m => Conduit [TrainingInstance] m [TrainingInstance]Source

Conduit that normalized instance scores over all instances in the list.