-
Notifications
You must be signed in to change notification settings - Fork 914
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
训练488万张图片时中间突然出现OS.ERROR #1385
Comments
Full Logs:
|
显然是图片读取失败啊,检查下数据集,最好都过一下能不能用 pil打开 |
The image might be corrupted. I've updated dev branch to show the file name if OSError occurs, so please try with dev branch. |
你看看你那日志 with Image.open(image_file_path) as img: 如果你想跳过检查潜空间这个费时的操作 |
* Final implementation * Skip the final 1 step * fix alpha mask without disk cache closes kohya-ss#1351, ref kohya-ss#1339 * update for corner cases * Bump crate-ci/typos from 1.19.0 to 1.21.0, fix typos, and updated _typos.toml (Close kohya-ss#1307) * set static graph flag when DDP ref kohya-ss#1363 * make forward/backward pathes same ref kohya-ss#1363 * update README * add grad_hook after restore state closes kohya-ss#1344 * fix to work cache_latents/text_encoder_outputs * show file name if error in load_image ref kohya-ss#1385 --------- Co-authored-by: Kohya S <ykumeykume@gmail.com> Co-authored-by: Kohya S <52813779+kohya-ss@users.noreply.github.com> Co-authored-by: Yuta Hayashibe <yuta@hayashibe.jp>
When I was training model on 2x A100 80G machine, a few time later afrer start, there's an error occurred:
I hope the author can find the reason of this problem, thanks!!
The text was updated successfully, but these errors were encountered: