包含自定义词表,以及自己实现的tokenize,detokenize。
pretrain_pipeline.py是流式输入数据。
各个程序直接使用Python运行即可,具体配置到代码里调整。
-
Notifications
You must be signed in to change notification settings - Fork 0
couldn/t5_pretrain
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
t5 pretrain ,torch ,transformer implement
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published