datasets-0.2.4: Classical data sets for statistics and machine learning

Safe HaskellNone
LanguageHaskell2010

Numeric.Datasets.Netflix

Contents

Description

Netflix prize dataset

From the README :

The movie rating files contain over 100 million ratings from 480 thousand randomly-chosen, anonymous Netflix customers over 17 thousand movie titles. The data were collected between October, 1998 and December, 2005 and reflect the distribution of all ratings received during this period. The ratings are on a scale from 1 to 5 (integral) stars. To protect customer privacy, each customer id has been replaced with a randomly-assigned id. The date of each rating and the title and year of release for each movie id are also provided.

The competition ended on September, 2009, and the dataset was subsequently removed from the public domain by the company.

We include in this repository a tiny subset of the original dataset for development purposes.

For further information, see http://netflixprize.com/.

Synopsis

Dataset files. The directories are scanned recursively and their contents are presented as (FilePath, ByteString) pairs

Data types

newtype UserId Source #

Training set

Constructors

UserId 

Fields

Instances

data Train Source #

Constructors

Train 

Instances

Eq Train Source # 

Methods

(==) :: Train -> Train -> Bool #

(/=) :: Train -> Train -> Bool #

Show Train Source # 

Methods

showsPrec :: Int -> Train -> ShowS #

show :: Train -> String #

showList :: [Train] -> ShowS #

newtype MovieId Source #

Movies file

Constructors

MovieId 

Fields

data Movie Source #

Constructors

Movie 

Instances

Eq Movie Source # 

Methods

(==) :: Movie -> Movie -> Bool #

(/=) :: Movie -> Movie -> Bool #

Show Movie Source # 

Methods

showsPrec :: Int -> Movie -> ShowS #

show :: Movie -> String #

showList :: [Movie] -> ShowS #

newtype Test Source #

Qualifying file (test set)

Constructors

Test 

Instances

Eq Test Source # 

Methods

(==) :: Test -> Test -> Bool #

(/=) :: Test -> Test -> Bool #

Show Test Source # 

Methods

showsPrec :: Int -> Test -> ShowS #

show :: Test -> String #

showList :: [Test] -> ShowS #

Additional types and helper functions

data RD a Source #

Constructors

RD 

Fields

Instances

Eq a => Eq (RD a) Source # 

Methods

(==) :: RD a -> RD a -> Bool #

(/=) :: RD a -> RD a -> Bool #

Show a => Show (RD a) Source # 

Methods

showsPrec :: Int -> RD a -> ShowS #

show :: RD a -> String #

showList :: [RD a] -> ShowS #

Netflix dataset parsers

Netflix dataset row type parsers

Attoparsec parser combinators

Attoparsec helpers