Frames-streamly: A streamly layer for Frames I/O

[ bsd3, data, library, program ] [ Propose Tags ]

More information is available in the readme.'


[Skip to Readme]

Modules

[Last Documentation]

  • Frames
    • Streamly
      • Frames.Streamly.CSV
      • Frames.Streamly.Categorical
      • Frames.Streamly.ColumnTypeable
      • Frames.Streamly.ColumnUniverse
      • Frames.Streamly.InCore
      • Frames.Streamly.LoadInCore
      • Frames.Streamly.OrMissing
      • Streaming
        • Frames.Streamly.Streaming.Class
        • Frames.Streamly.Streaming.Common
        • Frames.Streamly.Streaming.Pipes
        • Frames.Streamly.Streaming.Streamly
      • Frames.Streamly.TH
      • Frames.Streamly.Transform
  • Paths_Frames_streamly

Flags

Manual Flags

NameDescriptionDefault
streamly9

require streamly >= 0.9

Disabled
Automatic Flags
NameDescriptionDefault
dump-core

Dump HTML for the core generated by GHC during compilation

Disabled

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

Maintainer's Corner

Package maintainers

For package maintainers and hackage trustees

Candidates

Versions [RSS] 0.1.0.0, 0.1.0.1, 0.1.0.2, 0.1.0.3, 0.1.1.0, 0.1.1.1, 0.1.2.0, 0.3.1.0 (info)
Change log Changelog.md
Dependencies base (>=4.12.0 && <5), bytestring (>=0.10.0 && <0.12), containers (>=0.6.0 && <0.7), deepseq (>=1.4 && <1.5), dlist (>=1.0 && <1.1), exceptions (>=0.10.0 && <0.11), foldl (>=1.4.0 && <1.5), Frames (>=0.6 && <0.8), Frames-streamly, fusion-plugin (>=0.2 && <0.3), ghc-prim (>=0.6 && <0.11), monad-control (>=1.0.1 && <1.1), pipes (>=4.3.14 && <4.4), pipes-safe (>=2.3.1 && <2.4), pipes-text (>=1.0 && <1.1), primitive (>=0.7 && <0.9), readable (>=0.3 && <0.4), relude (>=1.0.0 && <1.3), streamly (>=0.8 && <0.9), streamly-bytestring (>=0.1.0 && <0.2), strict (>=0.4 && <0.6), template-haskell (>=2.14 && <2.21), text (>=1.2.3 && <2.2), text-builder (>=0.6.6 && <0.7), vector (>=0.12.1.2 && <0.14), vector-th-unbox (>=0.2.1.3 && <0.3), vinyl (>=0.12 && <0.15), word8 (>=0.1.3 && <0.2) [details]
License BSD-3-Clause
Copyright 2020 Adam Conner-Sax
Author Adam Conner-Sax
Maintainer adam_conner_sax@yahoo.com
Category Data
Home page https://github.com/adamConnerSax/Frames-streamly#readme
Bug tracker https://github.com/adamConnerSax/Frames-streamly/issues
Source repo head: git clone https://github.com/adamConnerSax/Frames-streamly
Uploaded by adamCS at 2023-05-02T15:38:05Z
Distributions
Executables Demo
Downloads 1230 total (16 in the last 30 days)
Rating (no votes yet) [estimated by Bayesian average]
Your Rating
  • λ
  • λ
  • λ
Status Docs not available [build log]
All reported builds failed as of 2023-05-02 [all 2 reports]

Readme for Frames-streamly-0.3.1.0

[back to package description]

Frames-streamly- v 0.3.1

Build Status Hackage Hackage Dependencies

  • Added Flag “streamly9”. To use streamly >= 0.9, set that flag to true. The flag will put a lower-bound of 0.9 on streamly. When the flag is false (the default), an upper bound of < 0.9 will be in place for streamly. Streamly changed the library structure between 0.8.x and 0.9.x in such a way that it is difficult to write this package in a way compatible with both sets of versions.

  • Breaking Change (from 0.1.0.2): Some streaming functions use the StrictReadRec class, a stricter version of ReadRec from Frames. This class is located in Frames.Streamly.CSV.

This library contains some useful functions for using the Frames package with streamly.

More generally, it abstracts the streaming layer in Frames into a class and implements that class for Pipes and Streamly.

Frames has some built-in dependencies on the Pipes package, a few of which--primarily file I/O-- require users of Frames to use the Pipes package explicitly. Streamly provides much of the same functionality as Pipes and may be some users preferred streaming interface.

This package also fleshes out the Frames API in a couple of places:

  1. It adds some flexibility to the functions to write CSV files. Frames supported formatting of fields for CSV via a typeclass ShowCSV. That is supported here as well. But this package also supports using the Show instance, and, for more customized formatting, creating field-by-field formatting on the fly via a Vinyl record of functions. Helpful combinators are provided for formatting any single field with a Show instance or a ShowCSV instance or a user provided function from the field type to Text.

  2. It adds some (experimental) support for Frame transformations using Streamly streams as an intermediate state for transformations which may benefit from the concurrency available in streamly. Such transformations first make any foldable of Records (including a Frame) into a stream, apply a streamly transformation to a stream of some other records and then transforms those into a frame. So the result is a frame -> frame function but one that can take advantage of streamly's features at the cost of the transformation into a stream and then back in to a Frame. This allows use of the concurrent features of Streamly for functions like mapM or mapMaybeM.

  3. It adds Streamly folds for the various stream to in-core transformations in case users want to use them directly in stream to Frame transformations. Frames exposed only the functions to transform an entire stream (a pipe producer) into Frames "AoS" structure. This library provides that functionality as well, in this case using streamly streams as the input. But here we also expose streamly folds from streams of Records to Frames so that more complex stream to Frame transformations can be done by the user. For example, suppose you are doing a map/reduce on a large data set and you want to store the grouped subsets as Frames for memory-efficiency. These folds make that simpler.

  4. There is some experimental support for more flexible loading of data from CSV. New features include:

  • Choosing of specific columns to load (by position or header text)
  • Renaming of columns before the header text is used to create a column type.
  • Improved handling of type-inference of columns with possibly missing data, allowing the user a choice between inference based on non-missing values leading to loading failure if missing values are encountered; inferring Maybe a where a is inferred from the non-missing values, thus succesfully loading data where some values in the column are missing; or an option to choose between the above depending on whether any missing data is encountered in the sample Frames uses for inference. Please see some examples here.

More examples using some of the utilities is here.


LICENSE (BSD-3-Clause)


Copyright (c) 2020, Adam Conner-Sax, All rights reserved.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

* Redistributions of source code must retain the above copyright
  notice, this list of conditions and the following disclaimer.

* Redistributions in binary form must reproduce the above
  copyright notice, this list of conditions and the following
  disclaimer in the documentation and/or other materials provided
  with the distribution.

* Neither the name of Adam Conner-Sax nor the names of other
  contributors may be used to endorse or promote products derived
  from this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.