Skip to content
Compare
Choose a tag to compare
@s-JoL s-JoL released this 28 Apr 07:22
· 63 commits to main since this release
f3c664b

Refactored the code, used datasets to load data, decreased padding from 30% to 5%, unified language model training logic into trainer to reduce redundant code, and supported a more reasonable config format.