snowball: Bindings to the Snowball library.
The Snowball library is used to compute the stems of words in natural languages.
Compared to the older stemmer package, this one:
Correctly handles unicode without relying on the system locale
Takes greater care to avoid memory leaks and to be thread safe
Uses Text rather than String
Gets rid of the need for
stemWords
by using rewrite rules to makemap stem
efficientIncludes a more recent release of Snowball
Attempts to comply with the Snowball licensing terms
Supports
SafeHaskell
when available and is otherwiseHaskell2010
Comes with a test suite
However, although this code is written from scratch, it is heavily modeled after the code of the "stemmer" package.
Downloads
- snowball-1.0.0.1.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
Versions [RSS] | 0.1.0, 0.1.0.1, 0.1.0.2, 1.0.0, 1.0.0.1 |
---|---|
Dependencies | base (>=4 && <5), bytestring, text, text-icu [details] |
License | BSD-3-Clause |
Copyright | (c) 2012 Dag Odenhall, (c) 2008 Tupil, (c) 2002 Richard Boulton |
Author | |
Maintainer | dag.odenhall@gmail.com |
Category | Natural Language Processing, Text |
Home page | http://hub.darcs.net/dag/snowball |
Bug tracker | http://hub.darcs.net/dag/snowball/issues |
Source repo | head: darcs get http://hub.darcs.net/dag/snowball |
Uploaded | by DagOdenhall at 2012-12-22T18:52:19Z |
Distributions | NixOS:1.0.0.1 |
Reverse Dependencies | 1 direct, 0 indirect [details] |
Downloads | 5758 total (14 in the last 30 days) |
Rating | (no votes yet) [estimated by Bayesian average] |
Your Rating | |
Status | Docs uploaded by user Build status unknown [no reports yet] |