Safe Haskell | None |
---|---|
Language | Haskell2010 |
Citeseer document classification dataset, from :
Qing Lu, and Lise Getoor. "Link-based classification." ICML, 2003.
Synopsis
- stash :: FilePath -> IO ()
- sourceCiteseerGraphEdges :: (MonadResource m, MonadThrow m) => FilePath -> Map String (Seq Int16, CiteSeerDoc) -> ConduitT i (Maybe (Graph (ContentRow CiteSeerDoc))) m ()
- loadCiteseerGraph :: FilePath -> IO (Graph (ContentRow CiteSeerDoc))
- data CiteSeerDoc
1. Download the dataset
2. Reconstruct the citation graph
sourceCiteseerGraphEdges Source #
:: (MonadResource m, MonadThrow m) | |
=> FilePath | directory of data files |
-> Map String (Seq Int16, CiteSeerDoc) |
|
-> ConduitT i (Maybe (Graph (ContentRow CiteSeerDoc))) m () |
See sourceGraphEdges
:: FilePath | directory where the data files were saved |
-> IO (Graph (ContentRow CiteSeerDoc)) |
See loadGraph
Types
data CiteSeerDoc Source #
document classes of the Citeseer dataset