Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support loading a dataset from SQLite files #3094

Closed
albertvillanova opened this issue Oct 15, 2021 · 2 comments · Fixed by #4928
Closed

Support loading a dataset from SQLite files #3094

albertvillanova opened this issue Oct 15, 2021 · 2 comments · Fixed by #4928
Labels
enhancement New feature or request good second issue Issues a bit more difficult than "Good First" issues

Comments

@albertvillanova
Copy link
Member

As requested by @julien-c, we could eventually support loading a dataset from SQLite files, like it is the case for JSON/CSV files.

@albertvillanova albertvillanova added the enhancement New feature or request label Oct 15, 2021
@mariosasko mariosasko added the good second issue Issues a bit more difficult than "Good First" issues label Jun 1, 2022
@julien-c
Copy link
Member

for reference Kaggle has a good number of open source datasets stored in sqlite

Alternatively a tutorial or tool on how to convert from sqlite to parquet would be cool too

@Dref360
Copy link
Contributor

Dref360 commented Aug 30, 2022

Hello, could we leverage pandas.read_sql for this?

This would be basically the same as CSVBuilder
, but uses pandas.read_sql(..., chunksize=1) instead of pandas.read_csv(..., iterator=True)

I'm happy to work on this :)

self-assign

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good second issue Issues a bit more difficult than "Good First" issues
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants