Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

transformer_courses/Transformer_Machine_Translation 找不到zh-en/train.tags.zh-en.zh.cut.txt文件 #913

Open
NewComer00 opened this issue Mar 5, 2023 · 0 comments

Comments

@NewComer00
Copy link

问题位置

https://github.com/PaddlePaddle/awesome-DeepLearning/tree/master/transformer_courses/Transformer_Machine_Translation

问题现象

在按照README进行bpe分词处理时,sh subword.sh命令执行错误,找不到zh-en/train.tags.zh-en.zh.cut.txt文件。

复现方式

安装完依赖并且获取数据集后,执行bpe分词处理

python data_process.py
sh subword.sh #出错命令
python bpe_process2.py

解决方法

浏览例程代码后,发现zh-en/train.tags.zh-en.zh.cut.txt文件是通过bpe_process.py脚本生成。将其添加到sh subword.sh命令前执行即可。

python data_process.py
python bpe_process.py
sh subword.sh
python bpe_process2.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant