You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The factgenie repository has almost 100 MBs when cloned from scratch. Majority of that are example datasets and outputs.
I realized that while the example datasets can be helpful, they can also be percieved as bloatware.
We should probably distribute them separately from the main repository, so that the default factgenie installation is as lightweight as can be.
Update: I found out that majority of the bloat was in fact caused by Factgenie.mp4 that has been once uploaded to the main repository (and remained in the history even after deletion). I fixed it using git-filter-repo:
The factgenie repository has almost 100 MBs when cloned from scratch.
Majority of that are example datasets and outputs.I realized that while the example datasets can be helpful, they can also be percieved as bloatware.
We should probably distribute them separately from the main repository, so that the default factgenie installation is as lightweight as can be.
Update: I found out that majority of the bloat was in fact caused by
Factgenie.mp4
that has been once uploaded to the main repository (and remained in the history even after deletion). I fixed it using git-filter-repo:The repository is now only several MBs large 💪
We tried to preserve git history as much as possible, but please let us know if this cause you some issues with your local git branches.
The main point of the issue still holds, though: we should make downloading the example datasets optional.
The text was updated successfully, but these errors were encountered: