You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When syncing a local copy to a bucket I would like to sync (upload) only files that changed. This is supported today, but does not work for some use cases.
Use Case
We hold a number of CSVs in a repository synced to S3 by Github workflow. Using the --size-only flag works as long as the file size has changed, but if a typo is made and a fix is pushed (say hacc is the typo and the fix pushed is hack) then this change is not picked up by the CLI and that file is not being synced.
Proposed Solution
Introduce a flag (--md5 for example) that calculate MD5 for all files and compare each of them to the remote copy's MD5. Only those with different hashes should be synced.
Other Information
This may not be very efficient for large files and should be probably mentioned (and discouraged) in the documentation.
Acknowledgements
I may be able to implement this feature request
This feature might incur a breaking change
CLI version used
2.3.2
Environment details (OS name and version, etc.)
Ubuntu latest (Github workflow default runner)
The text was updated successfully, but these errors were encountered:
Thanks for your comment. This feature has been requested quite some time ago, but as you note there are performance issues - not only for large files, but if you have many files. See the issue here: #599
There is a possibility to support the new S3 checksum feature in AWS CLI high level S3 commands:
Comments on closed issues are hard for our team to see.
If you need more assistance, please open a new issue that references this one. If you wish to keep having a conversation with other community members under this issue feel free to do so.
Describe the feature
When syncing a local copy to a bucket I would like to sync (upload) only files that changed. This is supported today, but does not work for some use cases.
Use Case
We hold a number of CSVs in a repository synced to S3 by Github workflow. Using the
--size-only
flag works as long as the file size has changed, but if a typo is made and a fix is pushed (sayhacc
is the typo and the fix pushed ishack
) then this change is not picked up by the CLI and that file is not being synced.Proposed Solution
Introduce a flag (
--md5
for example) that calculate MD5 for all files and compare each of them to the remote copy's MD5. Only those with different hashes should be synced.Other Information
This may not be very efficient for large files and should be probably mentioned (and discouraged) in the documentation.
Acknowledgements
CLI version used
2.3.2
Environment details (OS name and version, etc.)
Ubuntu latest (Github workflow default runner)
The text was updated successfully, but these errors were encountered: