Make LLVM Profiling robust for multithreaded programs #47778

bachdavi · 2022-12-02T14:40:12Z

This is a draft PR to make LLVM optimization logging correct for multithreaded programs.

It guards against the case where you enable or disable LLVM optimization logging (@snoopl) in one thread, while another thread is midway through an ongoing optimization, or stopping it before it's finished. Without this PR, this can result in a malformed .yaml file, since the logging might start with the "after" clause, or it might only contain the "begin" clause.

After this PR, such cases will simply not be logged, and only LLVM optimizations that occurred completely while logging was enabled would be logged.

pchintalapudi · 2022-12-04T02:09:41Z

It's not super clear to me what the purpose of this PR is. Is the intent to allow access to these timings within Julia code? If so, why is the timing struct embedding a vector<> and not a julia array? Also, the previous implementation of timing reporting should have been thread-safe, albeit not manipulable from Julia, since we locked before ever operating on the stream.

bachdavi · 2022-12-04T17:22:32Z

@pchintalapudi This is very much a work in progress :)
Sorry, could have mentioned this in the description, am adding it right now.
I pushed a PR such that @NHDaly and @vilterp can work on this together.
We'll ping you once this is in any state to review it!

bachdavi · 2022-12-04T17:26:58Z

Exactly, we want to access the timings from within Julia while potentially having another thread do code generation. Interesting, we thought that we might run into a case where we unset the streams, and codgen is trying to write to it, finding a null pointer. Where does the locking happen?

src/codegen.cpp

This reverts commit c4a7aac.

This reverts commit 5649efc.

bachdavi · 2022-12-08T16:53:26Z

@pchintalapudi Does the actual optimization in jitlayers.cpp: (***PMs).run(M); acquire any locks? If not, would it be feasible to hold the stream lock over the whole optimization time to ensure we don't write just the first part of the YAML block or just the second one? I've adapted this PR to do that. The alternative would be to write the first part into a string and write the whole YAML block atomically after optimization.

NHDaly · 2022-12-08T16:57:03Z

🤔 @bachdavi: now that i'm looking at the PR change, we maybe don't want to hold the lock over the whole LLVM optimization, since that would force serialization of two different threads doing LLVM optimization if we were profiling..

So i think that the string-buffer approach is probably the best one. 👍 Sorry i didn't think of this when we discussed it earlier! Thanks for the quick change today :)

pchintalapudi · 2022-12-08T16:57:36Z

We actually specifically want to avoid holding a lock during optimization, so that in the future multiple modules can be optimizing at the same time. Rather than write to a string first, I would say holding the data in variables and dumping the entire yaml block post-optimization using those variables would be a better solution.

Guess I got sniped by @NHDaly :)

NHDaly · 2022-12-08T16:59:01Z

💡 @pchintalapudi, yeah, just keeping the results in variables seems even better. 👍 👍

pchintalapudi · 2022-12-08T17:02:48Z

As an aside, we currently don't measure machine code optimization and emission time at the moment (this is in jitlayers.cpp, CompilerT::operator()) although this probably constitutes a decent fraction of optimization time. That may also be something of interest if you're looking to collect optimization statistics.

NHDaly · 2022-12-08T17:04:08Z

I was wondering about that, @pchintalapudi: Does that get measured in @snoopc (codegen)?

pchintalapudi · 2022-12-08T17:12:17Z

It does get measured by @snoopc since that relies on the dump_compiles_stream instead of the dump_llvm_opt_stream, but that includes time spent in codegen, middle-end optimization, and backend optimization/mc emission. We don't currently have anything measuring the contribution of just backend optimization/mc emission.

This reverts commit 93f62df.

bachdavi · 2022-12-09T12:58:15Z

This uses a stringstream, instead we could use a map and store instructions and basicblocks as a tuple. The downside of the stringstream is the lack of proper formatting, afaik std::format is only available from c++20.

src/jitlayers.cpp

NHDaly

LGTM, thanks, @bachdavi!
I've updated the description as well, to explain this situation better.

@pchintalapudi: Can you do a final review pass as well?

NHDaly · 2022-12-30T18:18:32Z

Friendly ping for @pchintalapudi. Do you want to review? We've been using this in production at RAI and so far it seems to be working correctly. Can we merge this?

CC: @IanButterworth as well, since this is profiling related.

pchintalapudi

Seems fine to me

vchuravy · 2023-01-05T15:50:04Z

Needs a rebase

NHDaly · 2023-01-05T23:17:09Z

Also, are the test failures here real? Or unrelated?

…lvm-opt-profiling

vchuravy · 2023-01-09T08:02:26Z

analyzegc failure seems relevant.

bachdavi · 2023-01-09T08:05:37Z

analyzegc failure seems relevant.

Mh, it seems so indeed!

NHDaly · 2023-01-11T04:51:11Z

Thanks Valentin and David. The remaining failures look unrelated:

Error During Test at C:\buildkite-agent\builds\win2k22-aws-vms-2\julialang\julia-master\julia-4d55f71f4d\share\julia\stdlib\v1.10\Downloads\test\runtests.jl:12
  Got exception outside of a @test
  RequestError: Could not resolve host: httpbingo.julialang.org while requesting https://httpbingo.julialang.org/base64/SnVsaWEgaXMgZ3JlYXQh
  Stacktrace:
    [1] (::Downloads.var"#9#18"{IOStream, Base.DevNull, Nothing, Vector{Pair{String, String}}, Float64, Nothing, Bool, Nothing, Bool, String, Bool, Bool})(easy::Downloads.Curl.Easy)

and

intrinsics                                       (5) |        started at 2023-01-09T13:12:37.840
      From worker 4:
      From worker 4:	[180] signal (11.1): Segmentation fault
      From worker 4:	in expression starting at /cache/build/default-power9bot2-0/julialang/julia-master/julia-4d55f71f4d/share/julia/test/atomics.jl:281

* Use stringsteam to atomically write LLVM opt timings * Add boolean to ensure we don't _only_ write the after block * Use ios_printf Co-authored-by: Nathan Daly <NHDaly@gmail.com> (cherry picked from commit 8985403)

bachdavi added 2 commits December 2, 2022 15:38

Add initial llvm-opt-profiling

5649efc

Remove lock

c4a7aac

giordano added multithreading Base.Threads and related functionality compiler:llvm For issues that relate to LLVM labels Dec 2, 2022

vchuravy requested a review from pchintalapudi December 2, 2022 15:53

pchintalapudi reviewed Dec 4, 2022

View reviewed changes

src/codegen.cpp Outdated Show resolved Hide resolved

bachdavi added 3 commits December 8, 2022 17:46

Revert "Remove lock"

6c58a2c

This reverts commit c4a7aac.

Revert "Add initial llvm-opt-profiling"

ee9ff5b

This reverts commit 5649efc.

Hold jl_locked_stream over the whole period of optimization

93f62df

bachdavi added 2 commits December 9, 2022 11:05

Revert "Hold jl_locked_stream over the whole period of optimization"

3ee3c46

This reverts commit 93f62df.

Use stringsteam to atomically write LLVM opt timings

01d55dd

NHDaly reviewed Dec 9, 2022

View reviewed changes

src/jitlayers.cpp Outdated Show resolved Hide resolved

Add boolean to ensure we don't _only_ write the after block

5505b5c

NHDaly changed the title ~~[wip] make llvm opt profiling thread-safe~~ Make LLVM Profiling robust for multithreaded programs Dec 13, 2022

NHDaly approved these changes Dec 13, 2022

View reviewed changes

NHDaly marked this pull request as ready for review December 13, 2022 22:41

Merge branch 'master' into dba-thread-safe-llvm-opt-profiling

4ce87e9

NHDaly requested a review from pchintalapudi December 13, 2022 22:44

NHDaly added the backport 1.9 Change should be backported to release-1.9 label Dec 30, 2022

Merge branch 'master' into dba-thread-safe-llvm-opt-profiling

f08b75f

KristofferC mentioned this pull request Jan 2, 2023

Backports for julia 1.9.0-beta3 #48075

Merged

41 tasks

pchintalapudi approved these changes Jan 3, 2023

View reviewed changes

Merge branch 'master' into dba-thread-safe-llvm-opt-profiling

ba94f9b

NHDaly and others added 2 commits January 5, 2023 16:18

Merge branch 'master' into dba-thread-safe-llvm-opt-profiling

02a3179

Merge remote-tracking branch 'upstream/master' into dba-thread-safe-l…

869960a

…lvm-opt-profiling

Use ios_printf

4d55f71

NHDaly merged commit 8985403 into JuliaLang:master Jan 11, 2023

KristofferC removed the backport 1.9 Change should be backported to release-1.9 label Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make LLVM Profiling robust for multithreaded programs #47778

Make LLVM Profiling robust for multithreaded programs #47778

bachdavi commented Dec 2, 2022 •

edited by NHDaly

Loading

pchintalapudi commented Dec 4, 2022

bachdavi commented Dec 4, 2022 •

edited

Loading

bachdavi commented Dec 4, 2022

bachdavi commented Dec 8, 2022

NHDaly commented Dec 8, 2022

pchintalapudi commented Dec 8, 2022 •

edited

Loading

NHDaly commented Dec 8, 2022 •

edited

Loading

pchintalapudi commented Dec 8, 2022

NHDaly commented Dec 8, 2022

pchintalapudi commented Dec 8, 2022

bachdavi commented Dec 9, 2022

NHDaly left a comment

NHDaly commented Dec 30, 2022

pchintalapudi left a comment

vchuravy commented Jan 5, 2023

NHDaly commented Jan 5, 2023

vchuravy commented Jan 9, 2023

bachdavi commented Jan 9, 2023

NHDaly commented Jan 11, 2023

Make LLVM Profiling robust for multithreaded programs #47778

Make LLVM Profiling robust for multithreaded programs #47778

Conversation

bachdavi commented Dec 2, 2022 • edited by NHDaly Loading

pchintalapudi commented Dec 4, 2022

bachdavi commented Dec 4, 2022 • edited Loading

bachdavi commented Dec 4, 2022

bachdavi commented Dec 8, 2022

NHDaly commented Dec 8, 2022

pchintalapudi commented Dec 8, 2022 • edited Loading

NHDaly commented Dec 8, 2022 • edited Loading

pchintalapudi commented Dec 8, 2022

NHDaly commented Dec 8, 2022

pchintalapudi commented Dec 8, 2022

bachdavi commented Dec 9, 2022

NHDaly left a comment

Choose a reason for hiding this comment

NHDaly commented Dec 30, 2022

pchintalapudi left a comment

Choose a reason for hiding this comment

vchuravy commented Jan 5, 2023

NHDaly commented Jan 5, 2023

vchuravy commented Jan 9, 2023

bachdavi commented Jan 9, 2023

NHDaly commented Jan 11, 2023

bachdavi commented Dec 2, 2022 •

edited by NHDaly

Loading

bachdavi commented Dec 4, 2022 •

edited

Loading

pchintalapudi commented Dec 8, 2022 •

edited

Loading

NHDaly commented Dec 8, 2022 •

edited

Loading