bio: A bioinformatics library

[ bioinformatics, library ] [ Propose Tags ]

This is a collection of data structures and algorithms I've found useful when building various bioinformatics-related tools and utilities.

Current list of features includes: a Sequence data type supporting protein and nucleotide sequences and conversion between them, quality data, reading and writing Fasta formatted files, reading TwoBit and phd formats. Rudimentary support for doing alignments - including dynamic adjustment of scores based on sequence quality - and Blast output parsing. Partly implemented single linkage clustering, and multiple alignment.

The Darcs repository is at: http://malde.org/~ketil/bio.


[Skip to Readme]

Modules

[Last Documentation]

  • Bio
    • Alignment
      • Bio.Alignment.AAlign
      • Bio.Alignment.ACE
      • Bio.Alignment.AlignData
      • Bio.Alignment.Blast
      • Bio.Alignment.BlastData
      • Bio.Alignment.BlastFlat
      • Bio.Alignment.BlastXML
      • Bio.Alignment.Matrices
      • Bio.Alignment.Multiple
      • Bio.Alignment.QAlign
      • Bio.Alignment.SAlign
    • Bio.Clustering
    • Bio.Sequence
      • Bio.Sequence.Entropy
      • Bio.Sequence.Fasta
      • Bio.Sequence.GOA
      • Bio.Sequence.HashWord
      • Bio.Sequence.Phd
      • Bio.Sequence.SeqData
      • Bio.Sequence.TwoBit
    • Bio.Util
      • Bio.Util.Parsex

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.3.3, 0.3.3.1, 0.3.3.2, 0.3.3.4, 0.3.5, 0.4, 0.4.4, 0.4.5, 0.4.6, 0.4.7, 0.4.8, 0.5, 0.5.0.1, 0.5.1, 0.5.2, 0.5.3
Dependencies array, base (>3), binary, bytestring, containers, interlude, parallel, parsec, QuickCheck (<2), tagsoup (>=0.5) [details]
License LicenseRef-LGPL
Author Ketil Malde
Maintainer ketil@ii.uib.no
Category Bioinformatics
Home page http://malde.org/~ketil/
Uploaded by GwernBranwen at 2008-03-04T20:02:42Z
Distributions FreeBSD:0.5.3
Reverse Dependencies 2 direct, 0 indirect [details]
Downloads 14150 total (30 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2017-01-03 [all 8 reports]

Readme for bio-0.3.3

[back to package description]
biolib - a Haskell library for bioinformatics

This is a collection of data structures and algorithms
I've found useful when building various bioinformatics-related tools
and utilities.

Current list of features includes: a Sequence data type supporting
protein and nucleotide sequences and conversion between them, quality
data, reading and writing FASTA formatted files, reading TwoBit and
phd formats.  Rudimentary support for doing alignments - including
dynamic adjustment of scores based on sequence quality - and Blast
output parsing.  Partly implemented single linkage clustering, and
multiple alignment.

To install, you need to acquire a working GHC (possibly other Haskell
system).  You also need the following external libraries:

  QuickCheck   - for unit tests
  binary       - mainly for dealing with the TwoBit sequence format
  tagsoup      - for parsing XML output from Blast

You should be able to get what you need from <http://hackage.haskell.org/>.

You can then build with 'make', doing either 'make install' if you can sudo, or 'make user_install' if you can not.  Of course, the Makefile just proxies for
the regular Cabal routine, which will work just as well:

    runhaskell Setup configure
    runhaskell Setup build
    sudo runhaskell Setup install

(Use --prefix=$HOME and remove the sudo, if you don't want to install as root.)

If that didn't work, mail me at <ketil@malde.org>, and we'll try to
work things out.