Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Streaming iterator read #29

Open
xiaodaigh opened this issue Sep 27, 2017 · 2 comments
Open

Streaming iterator read #29

xiaodaigh opened this issue Sep 27, 2017 · 2 comments

Comments

@xiaodaigh
Copy link

One of the most important features is to be able to read a chunk of SAS and then process it because in the wild there are a lot of large SAS datasets which are 50G~100G in size and they can't fit into memory.

What's stopping wider move away from SAS I think is due to not being able to read the data in chunks and then process it and then read the next chunk.

@kylebarron
Copy link

Fyi, pandas includes support for streaming iterator reads for Stata, though it doesn't have such SAS support:
https://pandas.pydata.org/pandas-docs/stable/generated/pandas.read_stata.html

@davidanthoff
Copy link
Member

Somewhere on my todo list is a plan to add streaming support here and integrate it properly with IterableTables.jl, at which point this should be feasible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants