full-text-search: In-memory full text search engine
This is a package candidate release! Here you can preview how this package release will appear once published to the main package index (which can be accomplished via the 'maintain' link below). Please note that once a package has been published to the main package index it cannot be undone! Please consult the package uploading documentation for more information.
An in-memory full text search engine library. It lets you run full-text queries on a collection of your documents.
Features:
Keyword queries and auto-complete/auto-suggest queries.
Can search over any type of "document". (You explain how to extract search terms from them.)
Supports documents with multiple fields (e.g. title, body)
Supports documents with non-term features (e.g. quality score, page rank)
Uses the state of the art BM25F ranking function
Adjustable ranking parameters (including field weights and non-term feature scores)
In-memory but quite compact. It does not keep a copy of your original documents.
Quick incremental index updates, making it possible to keep your text search in-sync with your data.
It is independent of the document type, so you have to write the document-specific parts: extracting search terms and any stop words, case-normalisation or stemming. This is quite easy using libraries such as tokenize and snowball.
The source package includes a demo to illustrate how to use the library. The demo is a simplified version of how the library is used in the hackage-server where it provides the backend for the package search feature.
Properties
Versions | 0.2.0.0, 0.2.1.0, 0.2.1.0, 0.2.1.1, 0.2.1.3, 0.2.1.4, 0.2.2.0, 0.2.2.1, 0.2.2.2 |
---|---|
Change log | changelog |
Dependencies | array (>=0.4 && <0.5), base (>=4.5 && <4.7), bytestring, Cabal (>=1.14 && <1.20), containers (>=0.4 && <0.6), directory, filepath, full-text-search, mtl, snowball (>=1.0 && <1.1), split (>=0.2 && <0.3), tar, text (>=0.11 && <1.2), time, tokenize (>=0.1 && <0.2), transformers, vector (>=0.10 && <0.11) [details] |
License | BSD-3-Clause |
Copyright | 2013-2014 Duncan Coutts, 2014 Well-Typed LLP, 2014 IRIS Connect Ltd. |
Author | Duncan Coutts |
Maintainer | Duncan Coutts <duncan@well-typed.com> |
Category | Data, Text, Search |
Source repo | head: darcs get http://code.haskell.org/full-text-search/ |
Uploaded | by DuncanCoutts at 2014-03-11T13:41:17Z |
Modules
- Data
- Data.SearchEngine
- Data.SearchEngine.BM25F
- Data.SearchEngine
Flags
Automatic Flags
Name | Description | Default |
---|---|---|
build-search-demo | Build a little program illustrating the use of the library | Disabled |
Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info
Downloads
- full-text-search-0.2.1.0.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
Package maintainers
For package maintainers and hackage trustees