Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prod_force: support multiple frames in parallel #2600

Merged
merged 6 commits into from
Jun 12, 2023

Conversation

njzjz
Copy link
Member

@njzjz njzjz commented Jun 10, 2023

The previous prod_force did not support multiple frames in parallel, which was slow when the batch size was large.

This PR adds support so that prod_force can be parallelized in the dimension of the samples.

When the batch size is about 70, the prod_force op is 10x faster than before on GPU cards.

The previous `prod_force` did not support multiple frames in parallel, which was slow when batch size is large.

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
@codecov
Copy link

codecov bot commented Jun 10, 2023

Codecov Report

Patch coverage: 87.75% and project coverage change: +0.02 🎉

Comparison is base (4b822b8) 76.66% compared to head (b7787ee) 76.68%.

Additional details and impacted files
@@            Coverage Diff             @@
##            devel    #2600      +/-   ##
==========================================
+ Coverage   76.66%   76.68%   +0.02%     
==========================================
  Files         233      233              
  Lines       24177    24182       +5     
  Branches     1711     1695      -16     
==========================================
+ Hits        18536    18545       +9     
  Misses       4518     4518              
+ Partials     1123     1119       -4     
Impacted Files Coverage Δ
source/op/prod_force_multi_device.cc 91.40% <33.33%> (+0.82%) ⬆️
source/lib/src/prod_force.cc 83.58% <95.34%> (+6.65%) ⬆️

... and 1 file with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

njzjz added 3 commits June 9, 2023 21:18
Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
@njzjz njzjz marked this pull request as ready for review June 10, 2023 02:23
Copy link
Member

@denghuilu denghuilu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
wanghan-iapcm pushed a commit that referenced this pull request Jun 12, 2023
@wanghan-iapcm wanghan-iapcm merged commit bb0d02b into deepmodeling:devel Jun 12, 2023
@njzjz njzjz mentioned this pull request Jul 17, 2023
wanghan-iapcm pushed a commit that referenced this pull request Jul 20, 2023
Add an experimental model called pairwise DPRc, which is fragment-based
and integrated with QM/MM. Compression inference and training are
supported.
Unit tests and documentation have been added.

Some features or bugfix to implement this PR have been merged in #2549,
#2600, #2601, #2604, #2631, #2635, #2665, #2666, #2667, and #2679.
This PR makes some changes to `model.build_descrpt` additionally:
- fix errors when the suffix is not empty
- fix errors when `fparam` or `aparam` are given
- support model-customized `input_map`

---------

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants