language-spelling: Various tools to detect/correct mistakes in words

[ library, natural-language-processing, public-domain ] [ Propose Tags ]

Haskell library meant to be a set of tools to correct spelling mistakes, homophones, and OCR errors.

Sample session:

ghci> :m + Language.Distance.Search.BK
ghci> distance "foo" "bar" :: Distance DamerauLevenshtein
3
ghci> let bk = foldr insert empty ["foo", "foa", "fooa", "ofo", "arstu", "nana", "faa"] :: BKTree String DamerauLevenshtein
ghci> query 0 "foo" bk
[("foo",Distance 0)]
ghci> query 2 "foo" bk
[("faa",Distance 2),("foa",Distance 1),("fooa",Distance 1),("foo",Distance 0),("ofo",Distance 1)]

TODO:

  • Phonetic algorithms: metaphone, double metaphone, maybe others

  • Tests and better benchmarking

  • Cost tuning when searching

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.1, 0.1.1, 0.1.2, 0.2, 0.3, 0.3.1, 0.3.2
Dependencies array, base (>=3 && <5), bk-tree, bytestring, containers, ListLike, listlike-instances, text, tst, vector (>=0.5) [details]
License LicenseRef-PublicDomain
Author Francesco Mazzoli (f@mazzo.li)
Maintainer Francesco Mazzoli (f@mazzo.li)
Category Natural Language Processing
Home page https://github.com/bitonic/language-spelling
Bug tracker https://github.com/bitonic/language-spelling/issues
Source repo head: git clone git://github.com/bitonic/language-spelling.git
Uploaded by FrancescoMazzoli at 2013-01-20T10:29:27Z
Distributions
Reverse Dependencies 1 direct, 0 indirect [details]
Downloads 4727 total (10 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs uploaded by user
Build status unknown [no reports yet]