word2vec-model: Reading word2vec binary models

[ bsd3, library, program, unclassified ] [ Propose Tags ] [ Report a vulnerability ]

Modules

  • Data
    • Word2Vec
      • Data.Word2Vec.Model

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

  • No Candidates
Versions [RSS] 0.1.0.0
Change log ChangeLog.md
Dependencies attoparsec, base (>=4.7 && <5), binary, binary-ieee754, bytestring, conduit, conduit-combinators, conduit-extra, text, unordered-containers, vector, word2vec-model [details]
License BSD-3-Clause
Copyright BSD3
Author Filip Graliński
Maintainer filipg@amu.edu.pl
Home page https://gonito.net/gitlist/word2vec-model.git
Source repo head: git clone git://gonito.net/word2vec-model.git
Uploaded by filipg at 2017-12-30T11:44:43Z
Distributions
Executables word2vec-model-word-analogy, word2vec-model-similarity
Downloads 752 total (6 in the last 30 days)
Rating 2.0 (votes: 1) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2017-12-30 [all 3 reports]

Readme for word2vec-model-0.1.0.0

[back to package description]

word2vec-model

Reading word2vec binary models (generated with the original tool by Mikolov).

This simple module is only for reading word2vec models (it cannot be used to generate a word2vec model, for this the original word2vec tools should be used).

Note that word2vec binary format is not a proper serialisation format (as it is mostly a raw dump of C data. Caveat emptor, it might be risky to read a model generated on a host with a different architecture.

Example:

{-# LANGUAGE OverloadedStrings #-}
model <- readWord2VecModel "binary.bin"
let theMostSimilar = findKNearestToWord w2v 30 "polska"