Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

discuss data storage priorities #11

Open
2 tasks
berteh opened this issue Jan 31, 2023 · 1 comment
Open
2 tasks

discuss data storage priorities #11

berteh opened this issue Jan 31, 2023 · 1 comment

Comments

@berteh
Copy link
Collaborator

berteh commented Jan 31, 2023

some thoughts relating to data storage

  • be able to use reference plants data (eg from scrapers you already made) loaded by the app.
  • all permadesigner to define/refine his own pet plants, local varieties or local calendars that vary a bit from the reference. They should not replace but rather 'superseed' the reference data (like an 'inheritance' mechanism). That should be usable/copyable/refinable in the app in some fast way, maybe with an ability to 'suggest for curation' to the reference dataset, either automatically (as a 'give back to community' mechanism) or manually.

About the data storage technology : how big do you expect it will become? How many plants and designs would be stored in the user's app?

If the sum is less than 2000 then the technology doesn't matter much I think.

Do you need instant online sync amongst all users? I think an update every once in a while is enough and thus wouldn't go for online CRUD like restdb, but I'd rather stick to offline & local file with some import/update mechanism.

In case of local files : SQLite is easy enough but really low level, opt. use sqlalchemy for ORM or MVC or later backend change... but I wouldn't invest too much time there in now. Parquet file is a good alternative to CSV, that natively loads to pandas data frame.

@berteh berteh converted this from a draft issue Jan 31, 2023
@berteh
Copy link
Collaborator Author

berteh commented Jan 31, 2023

from discussions:

  • size database ~ 10k plants
  • size designs ~ 30-200 pants

pandas dataframe for storing the design
sqlite to store the plants

@berteh berteh moved this from Todo to In Progress in @jwnigel's PermaProject Feb 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

No branches or pull requests

1 participant