feat: Add initial benchmarking setup #3120

epochcoder · 2024-03-25T18:13:43Z

Will execute on pull requests, and run all benchmarks against
master and post a comparison as a PR comment

This will allow the maintainers of MyBatis to have concrete results for important flows on every pull request measured against master, I believe this would greatly help with PR's that say, "I've improved performance"

Thanks to @gavlyukovskiy for the benchmarking pipeline setup

Will execute on pull requests, and run all benchmarks against master and post a comparison as a PR comment

coveralls · 2024-03-25T18:18:11Z

coverage: 87.176% (+0.009%) from 87.167%
when pulling cefe77b on epochcoder:feature/mybatis-benchmarks-setup
into e9e9a29 on mybatis:master.

see: https://docs.github.com/en/actions/using-workflows/events-that-trigger-workflows#pull_request_target

… this in two steps

epochcoder · 2024-03-25T18:41:10Z

It ran the initial benchmarks on this PR, but failed to post the comments due to permissions.

I changed the event to be pull_request_target, so it will only start running on PR's after this is merged. (since it needs to be in master)

The reason for the event change is explained here

This is what it would have posted after the initial commit:

Note

These results are affected by shared workloads on GitHub runners. Use the results only to detect possible regressions, but always rerun on more stable machine before making any conclusions!

Benchmark results (pull-request, `1339c93`)

Benchmark                                                           Mode  Cnt    Score   Error  Units
BasicBlogBenchmark.retrieveSingleBlog                               avgt    5    0.588 ± 0.003  us/op
BasicBlogBenchmark.retrieveSingleBlogUsingConstructorWithResultMap  avgt    5  104.569 ± 1.134  us/op

See https://github.com/mybatis/mybatis-3/pull/3120/checks?sha=1339c93d34e870d0794ec6de460841adee5a903c for where it tried to post ;-)

harawata · 2024-03-25T21:14:49Z

@epochcoder ,

Thank you for the PR, but I don't like the idea very much.

In terms of performance, our focus is mostly on the mapping part (i.e. setting parameters, getting column values).
Parsing config/mappers is a heavy task, but it occurs only once during the startup in production environment.

So, this benchmark will not reflect the effect on production usage correctly and could encourage micro-optimization (which, in most cases, is not a good thing).
If a PR is aimed at improving performance, the effect should be measured against particular (and realistic) usage scenarios (which requires fair amount of data usually).

epochcoder · 2024-03-25T21:36:09Z

Thank you for the feedback @harawata, I should have been more clear, the included benchmark is just a dummy, this PR was more meant to showcase that we can have the possibility to include benchmarks in mybatis that matter, whatever that might mean for the project.

I've gone over a few past PRs and seen benchmarks here and there, and it's a pity that they cannot stay withing the code base and demonstrate particular improvements (or most importantly, regressions)

The benchmark I have on my branch of #101 reflects a more real scenario and definitely helped me ensure we do not affect current result mapping performance.

I do agree that we definitely do not want micro improvements! And that was also not supposed to be the purpose here.

But feel free to close if you think this was not a good idea ;-) and thanks for taking a look!

harawata · 2024-03-26T21:41:26Z

@epochcoder ,

Thank you for the follow-up.
If it's measured against realistic usages, it's not necessarily a bad idea.
I'm still unsure about running resource-consuming benchmarks, but GitHub is strong enough, maybe?

And, in case we adopt this, I would also like to see memory usage which is equally important as execution speed (I assume it's not included in the current setup, but correct me if I am wrong).

@hazendaz @kazuki43zoo Any thoughts?

gavlyukovskiy · 2024-03-27T07:21:42Z

To share some experience, we've been using this setup for a couple of months on one of our open source projects, it helped us quite a lot for detecting regressions early and then investigating further the results. We run it on every PR (around 50 PRs so far) and currently have 10 benchmarks in total, which takes around 6 minutes in total to execute on both master and PR branches (though you could tune that a bit to run less iterations or fewer benchmarks).
I think it is worth having the benchmarking setup for this project, even if you don't always run it on CI, but locally.

To reduce the noise, pipeline could also use labels / comments / manual dispatch as a trigger so that you manually trigger it when a PR is touching something that could affect the performance. We used that setup on one of our projects (not open-source, but also on GitHub). If that would be relevant for you, I could help with the implementation.

I'm still unsure about running resource-consuming benchmarks, but GitHub is strong enough, maybe?

From what I could find, the GitHub doesn't count this usage on open-source projects towards its GitHub Actions limits. Also, despite running on shared GitHub runners, the results were consistent (+-10%) for us, though it's better to rerun the benchmarks on some stable hardware when in doubt.

epochcoder · 2024-03-27T08:59:14Z

@harawata

And, in case we adopt this, I would also like to see memory usage which is equally important as execution speed (I assume it's not included in the current setup, but correct me if I am wrong).

Thats correct ;-) Ill add it to this PR. After #101 Merges, I can create a PR to drop these dummy benchmarks in favor of the ones I built in https://github.com/epochcoder/mybatis-3/commits/feature/101-jmh-performance-test/

.github/workflows/benchmarks.yaml

willie added 2 commits March 25, 2024 19:06

feat: Add initial benchmarking setup

7a36385

Will execute on pull requests, and run all benchmarks against master and post a comparison as a PR comment

feat: Add check for master branch too

3d0c272

willie added 3 commits March 25, 2024 19:18

feat: Fix pull request demo

1339c93

feat: Use pull request target, so we can comment on the PR

9fe0501

see: https://docs.github.com/en/actions/using-workflows/events-that-trigger-workflows#pull_request_target

feat: Because we changed the event that triggers, we don't need to do…

37adbf7

… this in two steps

feat: Add gc memory profiling

f8d25aa

gavlyukovskiy reviewed Apr 15, 2024

View reviewed changes

.github/workflows/benchmarks.yaml Outdated Show resolved Hide resolved

epochcoder added 2 commits April 15, 2024 10:29

Merge branch 'master' into feature/mybatis-benchmarks-setup

4df7342

fix: use correct sha when checking out

cefe77b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add initial benchmarking setup #3120

feat: Add initial benchmarking setup #3120

epochcoder commented Mar 25, 2024 •

edited

Loading

coveralls commented Mar 25, 2024 •

edited

Loading

epochcoder commented Mar 25, 2024 •

edited

Loading

harawata commented Mar 25, 2024 •

edited

Loading

epochcoder commented Mar 25, 2024

harawata commented Mar 26, 2024

gavlyukovskiy commented Mar 27, 2024

epochcoder commented Mar 27, 2024 •

edited

Loading

feat: Add initial benchmarking setup #3120

Are you sure you want to change the base?

feat: Add initial benchmarking setup #3120

Conversation

epochcoder commented Mar 25, 2024 • edited Loading

coveralls commented Mar 25, 2024 • edited Loading

epochcoder commented Mar 25, 2024 • edited Loading

Benchmark results (pull-request, 1339c93)

harawata commented Mar 25, 2024 • edited Loading

epochcoder commented Mar 25, 2024

harawata commented Mar 26, 2024

gavlyukovskiy commented Mar 27, 2024

epochcoder commented Mar 27, 2024 • edited Loading

epochcoder commented Mar 25, 2024 •

edited

Loading

coveralls commented Mar 25, 2024 •

edited

Loading

epochcoder commented Mar 25, 2024 •

edited

Loading

Benchmark results (pull-request, `1339c93`)

harawata commented Mar 25, 2024 •

edited

Loading

epochcoder commented Mar 27, 2024 •

edited

Loading