Create a Base.root_task method and document it #35726

mkitti · 2020-05-04T08:59:01Z

Currently, the constant Base.roottask is not documented as part of the public API per #35715. The closest API method is Base.current_task().

Rather than documenting Base.roottask I suggest creating Base.root_task() whose implementation could change further down the road if needed. The name is consistent with the existing Base.current_task().

Currently the implementation just returns Base.roottask. Alternatively, this could call jl_get_root_task, but more work might be needed to expose it.

Root Task detection is currently used by JavaCall.jl to detect if initialization is occurring on the root Task since otherwise a segmentation fault would occur if JULIA_COPY_STACKS were not set to 1 or "yes".

Root Task detection may be needed to implement a work around for using JavaCall from @async per @vtjnash and @c42f : #35048 (comment) #35048 (comment)

c42f · 2020-05-05T10:41:33Z

I'm not sure we actually want to have this as a public API. If the root task is a special resource, then it'll need to be shared by all packages, so I expect we'd want to reserve it for the system to run a specialized event loop. Then the API might be something like result = run_on_root_task(f) for user defined functions f?

mkitti · 2020-05-06T06:30:08Z

I agree that the root Task is a special resource, and that such a facility such as run_on_root_task should exist. Does that facility need to exist in Core, Base, or stdlib and should it be active by default? For the moment, I think we should implement it as a package.

Exposing a reference to the root task in the public API does not compromise its status as as special resource or prevent it from being shared by all packages. However, having a reference is necessary so that

A package on the receiving end of run_on_root_task that implements the loop to service it would need that reference to verify it is executing on the correct Task.
A package could determine if it is running on the root Task or not by comparing root_task() == current_task().
A call from a package not on the root_task could use yieldto(root_task())

c42f · 2020-05-07T01:52:03Z

Exposing a reference to the root task in the public API does not compromise its status as as special resource or prevent it from being shared by all packages

True, but it will compromise our ability to supply a different, more intrinsically sharable or useful API in the future. Also, just getting a reference to the root task really isn't very useful by itself: it doesn't imply that you have a way to run code on that task or otherwise use it as a resource for doing anything useful!

A call from a package not on the root_task could use yieldto(root_task())

But this doesn't allow you to run code that you'd like. It just schedules the root task for whatever code the root task happens to be running already.

For it to be actually useful, the system must either guarantee that the root task is the task which runs user code at startup, or is otherwise left in a waiting state, ready to run some user-defined function. In both cases, it's about having a way to run given code on the task, not having a reference to the task data structure.

So how do we move forward? I think the best way to take action right now is to prototype a shared event loop for the root task in a package, relying on the current implementation detail of which task runs which code at system initialization in order to steal control of the computation which the root task is running. Revise plays tricks like this in Revise.steal_repl_backend. There's nothing stopping you from using Base.roottask if it helps in this; it's just not guaranteed to exist in the future. But that's a good thing - it means Base is free to settle on a really solid API once we've figured out what it should be. Part of that could be changing the way that the REPL backend works internally to make your package easier to implement. For example, running both the REPL frontend and backend on new tasks and leaving the root task blocked on a Channel, ready to run user code.

In the longer term this might turn into some official API in Base for freeing up the root task as necessary.

tkf · 2020-05-07T02:14:18Z

Is isroottask() = roottask === current_task() all what JavaCall.jl needs at the moment? This may be an OK API to have in Base?

It may also be something useful even if we run_on_root_task have. I think run_on_root_task(f) would have FIFO behavior so that the resource would be used fairly. But, if you really don't want to let it go once you are on the root task, you can do isroottask() ? f() : run_on_root_task(f).

mkitti · 2020-05-07T19:25:33Z

@tkf isroottask() would be useful to have and is a good compromise.

https://github.com/mkitti/JavaCall.jl/blame/master/src/jvm.jl#L171

Perhaps it should take an optional Task for comparison if we are interested in checking a Task other than the current one.

isroottask( task::Task = current_task() ) = Base.roottask === task

c42f · 2020-05-12T10:48:14Z

I'm ok with isroottask as a name. But I don't really understand why it's a generally useful function to have. I mean, I understand why you need to run on the root task for JavaCall, but the JavaCall problems are very implementation specific (to both the JVM and to the Julia runtime).

So isroottask() doesn't even express the semantics you want! If Julia were to run the logical julia-level root task on a different stack for some reason, issroottask() is not the check you would want.

Given that the test you want is very implementation-specific, I don't see what the problem is with reaching into the implementation and just doing the Base.roottask == current_task() check yourself in JavaCall?

tkf · 2020-05-12T22:59:02Z

It may be partially my "fault" as the first version of the "public API" PR #35715 (which may be a part of the cause behind this PR) used very strict wording. If so, I think this PR is good feedback to #35715. The reality of software is that implementation details leak sometimes (often?) and it's still useful to rely on it and build a stable interface.

c42f · 2020-05-13T05:13:22Z

The reality of software is that implementation details leak sometimes (often?) and it's still useful to rely on it and build a stable interface.

Yes! All I'm arguing here is that, in its current form, Base is not equipped with the right tools to provide a usable abstract API which can make JavaCall reliable. That is, anything we could provide immediately without further design work in Base is very closely tied to the current implementation.

Therefore, I'm arguing that JavaCall is the appropriate package which should be providing the stable abstraction here ("please run this java function!"), not Base ("is this task safe to run Java code on?").

Undoubtedly this means JavaCall will need to rely on implementation detail of Base for the moment, but adding isroottask will not change this fact.

mkitti · 2020-05-14T01:07:46Z

The actual question is: "Base, are we running on a valid stack where the dynamic library I just loaded installed a signal handler?"

Currently, the answer is yes if:

The current Task is the root Task where we never had to copy the stack OR
The environmental variable JULIA_COPY_STACKS is 1 or yes.

It is not clear to me that this is necessarily specific to embedding Java.

c42f · 2020-05-14T04:25:09Z

"Base, are we running on a valid stack where the dynamic library I just loaded installed a signal handler?"

I was under the impression it was more complicated than this. Has the following situation changed?

#31104 (comment)

mkitti · 2020-05-14T07:27:03Z

With JULIA_COPY_STACKS=1 on Linux and Mac, we can load JavaCall pretty well now even with @async

$ JULIA_COPY_STACKS=1 ~/src/julia-1.4.1/bin/julia
               _
   _       _ _(_)_     |  Documentation: https://docs.julialang.org
  (_)     | (_) (_)    |
   _ _   _| |_  __ _   |  Type "?" for help, "]?" for Pkg help.
  | | | | | | |/ _` |  |
  | | |_| | | | (_| |  |  Version 1.4.1 (2020-04-14)
 _/ |\__'_|_|_|\__'_|  |  Official https://julialang.org/ release
|__/                   |

julia> using JavaCall
[ Info: Precompiling JavaCall [494afd89-becb-516b-aafa-70d2670c0337]

julia> JavaCall.init()

julia> jls = @jimport java.lang.System
JavaObject{Symbol("java.lang.System")}

julia> jcall(jls,"getProperty",JString,(JString,),"java.version")
"1.8.0_252"

julia> jls_out = jfield(jls,"out",@jimport java.io.PrintStream)
JavaObject{Symbol("java.io.PrintStream")}(Ptr{Nothing} @0x0000000002317218)

julia> @async jcall(System_out,"println",Nothing,(JString,),"Hello world");
Hello world

Confusingly, JULIA_COPY_STACKS=1 does not work on Windows (Julia crashes with it set), but JavaCall works well on that Windows x64 without it:
https://travis-ci.org/github/JuliaInterop/JavaCall.jl

It is very buggy on x86 Windows:
https://ci.appveyor.com/project/aviks/javacall-jl-6c24s/build/job/2i41xg4iqen6h3e9

mkitti · 2020-05-14T07:30:00Z

I created a generic worker loop here with basic functionality: https://github.com/mkitti/TaskWorkers.jl

Much to do still. Comments welcome.

barche · 2023-10-14T22:22:01Z

Since Qt version 6.5, this is no longer JavaCall-specific, but the same issue also affects QML.jl. There, too, as a fix we need to determine if Julia code is running on the root task (also because of the stack differences between tasks).

mkitti added 2 commits May 4, 2020 22:52

Add a Base.root_task(), export, and document it for public API.

7ee06dd

Add root_task to parallel.md

a287eb3

mkitti force-pushed the root_task_func_and_doc branch from 538368e to a287eb3 Compare May 5, 2020 03:52

Correctly associated doc string with root_task

3c28f34

Add Base.isroottask to test if a Task is the root Task

dfbf849

mkitti mentioned this pull request Jun 7, 2020

Accessors for current_task, safe_restore in TLS #36064

Merged

mkitti mentioned this pull request Jan 27, 2021

remove unused roottask constant #31983

Closed

vtjnash requested a review from Keno April 20, 2021 18:48

mkitti mentioned this pull request Oct 11, 2023

JVM fails to load in 1.1 (JavaCall.jl) #31104

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a Base.root_task method and document it #35726

Create a Base.root_task method and document it #35726

mkitti commented May 4, 2020

c42f commented May 5, 2020

mkitti commented May 6, 2020

c42f commented May 7, 2020

tkf commented May 7, 2020

mkitti commented May 7, 2020

c42f commented May 12, 2020

tkf commented May 12, 2020 •

edited

Loading

c42f commented May 13, 2020

mkitti commented May 14, 2020

c42f commented May 14, 2020

mkitti commented May 14, 2020

mkitti commented May 14, 2020

barche commented Oct 14, 2023

Create a Base.root_task method and document it #35726

Are you sure you want to change the base?

Create a Base.root_task method and document it #35726

Conversation

mkitti commented May 4, 2020

c42f commented May 5, 2020

mkitti commented May 6, 2020

c42f commented May 7, 2020

tkf commented May 7, 2020

mkitti commented May 7, 2020

c42f commented May 12, 2020

tkf commented May 12, 2020 • edited Loading

c42f commented May 13, 2020

mkitti commented May 14, 2020

c42f commented May 14, 2020

mkitti commented May 14, 2020

mkitti commented May 14, 2020

barche commented Oct 14, 2023

tkf commented May 12, 2020 •

edited

Loading