Support external CSV tokenizers
Internal functionality is now defined more cleanly atop a stream of rows already broken into columns (rather than a stream of rows that we quietly break into columns ourself). This permits the use of external parsers such as provided by the new Frames-dsv package that supplies a CSV parser built atop hw-dsv
.
The built-in CSV parser remains for ease of installation.
GHC 8.6 compatibility
-
Renamed the
rgetf
andrputf
exported by theFrames
module torgetField
andrputField
. This avoids clashing with the same names exported byvinyl
and further advances the process of eliminating the oldFrames
Col
type in favor ofvinyl
'sElField
. -
Add a
ShowCSV
class rather than leaning on overburdenedShow
instances. -
Add support for categorical column types: values of these types are one of a small number of textual values. Because they can only take on a small number of different text values, we can compactly represent values of these types as standard Haskell sum types.
-
Added table joins in
Data.Vinyl.Joins
(Chris Hammill) -
Changed types of
mapMethod
andmapMethodV
These now rely on explicit TypeApplications
rather than Proxy
values.
- Pervasive use of
pipes
for CSV data loading
This provides better exception handling (file handles should be closed more reliably), and offers an interface point for customized handling of input texts. An example of this latter point is working with particular file encodings.
A breaking change is that operations that previously returned IO
values now return MonadSafe
constrained values.
- Adaptation of
Data.Vinyl.Curry.runcurry
to the FramesRecord
type This simply strips the column name information from a row before applying the function fromvinyl
.
-
Refactored to use the
CoRec
type provided byvinyl
>= 0.6.0 -
Fixed bug in typing mostly-numeric columns Such columns must be represented as
Text
. Previously, we strove a bit too hard to avoid falling back toText
resulting in dropping rows containing non-numeric values for columns we crammed into a numeric type. -
Minor optimization of CSV parsing In particular, dealing with RFC4180 style quoting
-
GHC-8.2.1 compatibility
- Added CSV output functions:
produceCSV
andwriteCSV
- Added an Eq instance for the
Frame
type
Fixed column type inference bug that led the inferencer to prefer Bool
too strongly.
This was fallout from typing columns whose values are all 0 or 1 as Bool
.
Re-export Frames.CSV.declareColumn
from Frames
. This makes it much
easier to manually define column types.
Use microlens
instead of lens-family-core
for demos.
GHC-8.0.1 compatibility
Improved documentation based on suggestions by Alexander Kjeldaas
Fixed bug in Monoid
instance of Frame
(@dalejordan)
Added frameConsA
, frameSnoc
, and RecordColumns
to help with
changing row types.
Initial version pushed to hackage.