Skip to content

Something regarding the performance #164

Answered by d99kris
KRJackLee asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @KRJackLee - so rapidcsv was mainly designed to be easy to use and enable rapid development. It enables simple high-level access to read and modify CSV data, and this comes with some performance impact (the whole CSV file is read into a vector of vectors of strings - so you can imagine it's not superfast). The number you shared is probably reasonable for rapidcsv. I downloaded a random large CSV file from https://www.stats.govt.nz/large-datasets/csv-files-for-download/ - unzipped it was 818 MB (6 columns, 34959673 rows) and it took 7 seconds for rapidcsv (O2 optimization level with clang) to read on my MacBook Pro (M2 Pro).

There are currently no performance-improving flags to use with…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@KRJackLee
Comment options

@KRJackLee
Comment options

Answer selected by KRJackLee
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants