-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cache: checksum collision if there is dos2unix in the pipeline #992
Comments
@efiop , @shcheklein , should we this to our documentation? or is there any other way to address this issue? |
It is also related to this one iterative/dvc.org#68 |
@shcheklein well, the problem with crlf would arise in any checksum, so it won't be solved by migrating to sha1 or any other algorithm. Most likely, we'll simply need to introduce an additional suffix that would mark that that file is dos2unix one. Need to take a closer look at it sometime. @MrOutis As @shcheklein noticed, we already have an issue for it at iterative/dvc.org#68 , so we might eventually get to writing a doc for it in the future. |
@efiop yep, I understand the crlf issue. My point was if we go with this change (switch to a regular |
Closing in favor of #4658 |
As @shcheklein noticed, if user will have
dvc add data && dvc run -d data -o data.unix dos2unix data data.unix
in his pipeline, we will get a cache collision since we use dos2unix to compute checksum in the first place.The text was updated successfully, but these errors were encountered: