Feature request: Handle multiple, different delimiters in a file #956

patrickboehnke · 2021-12-29T20:52:19Z

I frequently work with data that uses different delimiters. I would like to be able to store this data as a CSV file without first having to go through and replace the delimiters with just a single choice. The enhancements that I would like to see are for CSV.File as follows:

The 'delim' argument also accepts an Array of Char or String entries to use as delimiters.
The 'ignorerepeated' argument could be updated to consider a three state system: 1) All duplicate delimiters ignored 2) Only duplicate delimiters that are the same are ignored and 3) Each delimiter treated as unique

Thank you for your consideration!

PallHaraldsson · 2022-05-15T14:54:55Z

Pandas allows Regexes, so it's something to consider for feature-compatibility with them.

ryofurue · 2024-07-15T13:57:55Z

As mentioned in the thread which I quote at the end, readdlm() is "deprecated" (actually, I don't know what that means in practice) and CSV.jl is recommended as an alternative.

But, currently, CSV.jl isn't able to treat a string of consecutive "space" characters as a single delimiter (whereas readdlm() is). Because of this, you very often have to preprocess text input files to use them with CSV.jl.

JuliaData/DelimitedFiles.jl#1

nickrobinson251 added new feature question and removed question labels Oct 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Handle multiple, different delimiters in a file #956

Feature request: Handle multiple, different delimiters in a file #956

patrickboehnke commented Dec 29, 2021

PallHaraldsson commented May 15, 2022

ryofurue commented Jul 15, 2024

Feature request: Handle multiple, different delimiters in a file #956

Feature request: Handle multiple, different delimiters in a file #956

Comments

patrickboehnke commented Dec 29, 2021

PallHaraldsson commented May 15, 2022

ryofurue commented Jul 15, 2024