You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The profiler shows that cudaStreamSynchronize happens in display_if_exist.
In display_if_exist, find_property is expected to be float. However, it is tensor(1., device='cuda:0'), a float32 tensor on the GPU, causing the synchronization.
make 'find_' to be float in get data, fix#3991 .
On my device, the profiler indicates that `cudaStreamSynchronize` takes
negligible time, resulting in minimal speedup.
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
- **New Features**
- Enhanced data loading by adding a `collate_fn` parameter for more
flexible data collation.
- Improved data filtering by excluding keys containing "find_" in
addition to existing filters.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
make 'find_' to be float in get data, fixdeepmodeling#3991 .
On my device, the profiler indicates that `cudaStreamSynchronize` takes
negligible time, resulting in minimal speedup.
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
- **New Features**
- Enhanced data loading by adding a `collate_fn` parameter for more
flexible data collation.
- Improved data filtering by excluding keys containing "find_" in
addition to existing filters.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
Bug summary
The profiler shows that
cudaStreamSynchronize
happens indisplay_if_exist
.In
display_if_exist
,find_property
is expected to befloat
. However, it istensor(1., device='cuda:0')
, a float32 tensor on the GPU, causing the synchronization.deepmd-kit/deepmd/pt/loss/loss.py
Lines 32 to 43 in 0c0878e
DeePMD-kit Version
0c0878e
Backend and its version
PyTorch 2.3.1
How did you download the software?
Built from source
Input Files, Running Commands, Error Log, etc.
Use
examples/water/se_atten_compressible
to debug.Steps to Reproduce
cd examples/water/se_atten_compressible dp --pt train input.json
Further Information, Files, and Links
No response
The text was updated successfully, but these errors were encountered: