static compile round 2 #4898

vtjnash · 2013-11-23T08:15:02Z

This is largely complete. I'll probably try to merge it piecemeal, but I thought people might like to see the status of this.

Big TODO items remaining:

Based on: Squash reloadso changes for cleaner history Sanitize macro names in llvm to avoid implied symbol Re-add jl_dump_bitcode and clean up debugging code generate llvm global variables for most literal_pointer_val switch to llvm-managed global variables, load and save them get much further in the static compile boot process (barely) working static-compiled repl static compiling

aviks · 2013-11-23T10:26:37Z

Woo hoo!

Will there be any hooks to compile (and load) packages?

timholy · 2013-11-23T11:03:05Z

This is huge, especially if it will (eventually) be possible to leverage for packages!

StefanKarpinski · 2013-11-23T15:56:41Z

Yes, this is great to see. Very, very exciting.

staticfloat · 2013-11-23T17:36:42Z

@vtjnash when you're ready for testing, just let me know and I'll try to break it on all the boxes I run Julia on. ;)

JeffBezanson · 2013-11-24T03:11:04Z

This is looking pretty good. It is actually a fairly non-invasive change; basically all we needed were some tables to map things to their names in the dynamic symbol table. And the same changes will surely work alongside MCJIT, but @loladiro can confirm that.

vtjnash · 2013-11-24T05:09:12Z

yeah, its almost surprising how minimal the changes are.

well, sort-of. originally i used some tables to map them to names, but that caused some issues with the linker, aside from being inefficient. in the second commit here, i changed to just mapping them to indices in a private julia table.

the MCJIT should work about the same, except we will need to dump all of the modules

renames the --bare [-b] flag to --build the --build flag now requires an argument (where to save the system image) the build flag triggers mode selection so that the code can get correctly generated and saved

partially address some minor issues with getopt usage miscounting the arguments

vtjnash · 2013-12-01T04:26:32Z

it is very exhilarating to watch julia launch so fast: on some win32 testing where I would launch time julia-readline, wait for the prompt, hit ^D, and wait for it to close, average timings dropped from > 8 seconds to < 2 seconds!

this just has one small item remaining (merging function pointer lookups in ccall, for better efficiency).

johnmyleswhite · 2013-12-01T21:00:45Z

Just tried this. It's amazing how fast the REPL starts on my machine with this branch. Well done!

staticfloat · 2013-12-01T21:07:00Z

Wow. Just... wow. I figured I had to get in on the buzz and see how much of a deal this actually makes..... On my (newer) macbook pro:

$ time julia -e 'return 0'

real  0m3.078s
user  0m3.055s
sys   0m0.118s

$ time ./julia-fast -e 'return 0'

real  0m0.251s
user  0m0.232s
sys   0m0.071s

This is awesome. Can't wait till we can apply this to packages as well!

Keno · 2013-12-01T21:12:40Z

0.3 is gonna be awesome :)

johnmyleswhite · 2013-12-01T22:52:04Z

In case it's helpful information, this passes all tests on my OS X machine, but segfaults as soon as I type 1 + 1 in the REPL.

vtjnash · 2013-12-02T02:13:36Z

@johnmyleswhite I'm not sure what is going wrong there -- for some reason, jl_parse_input_line is returning (<null>, <null>) (e.g. a tuple of 2 NULLs). If you hit enter on the first repl prompt, the later ones seem to accept input just fine.

StefanKarpinski · 2013-12-13T15:16:28Z

That's great. I really do think that we should consider moving some of the parts of Base that are less commonly used and require loading external libraries out. The linear algebra stuff is getting a bit out of hand – I mean, I love that stuff, but it's starting to be a lot of stuff to ship with.

tshort · 2013-12-13T21:59:26Z

I'm late to the party, but I'd like to add my YAHOOO! and thanks to Jameson and Isaiah.

ViralBShah · 2013-12-14T06:05:08Z

I am late too - but this is really amazing. We probably could move some of the more uncommon linear algebra stuff out of base, but hey, with these load times, I am encouraged to add stuff rather than remove!

StefanKarpinski · 2013-12-14T14:47:41Z

Well, I think the end goal here should be matching load times of python and ruby, so a 10x further speedup would be ideal.

ivarne · 2013-12-14T15:50:27Z

Baseline memory usage might also be a good reason to push less commonly used stuff out of Base.

tknopp · 2013-12-14T19:09:48Z

I would like to second that it would be great when base Julia would be a little bit more lightweight. This especially important when embedding Julia. Lua has such a success as an embedding language as it is so lightweight.

From my perspective it would be great if their would be, beside Base, about 4-5 modules living in the Julia tree that are precompiled and part of the Julia distribution. The Pkg management in Julia is great, but I think that their is a too high gap between putting things into Base or putting things into a package. And its quite hard to draw a distinct line. Just an example: I have been looking for a median filter and this would be a perfect example of a function that should not be part of Base but part of a "Signal" module, which could live in the Julia source tree. The fft routines are an example of which I think should not be Part of base but still in one of the "high quality" module that could live in the Julia repo.

JeffBezanson · 2013-12-14T20:34:28Z

I agree it is hard to know where the line is. I think the default should be "batteries included", but it would be great if it were easier to remove pieces you don't need.

A lot of our memory use on startup is just openblas. Particularly with multiple threads, openblas allocates a large amount of memory.

ViralBShah · 2013-12-14T20:57:27Z

My default is also to have "batteries included", but I do believe there may be a few things we can move out of Base. Even so, it is unlikely to have any impact on startup time or memory.

With @vtjnash 's patch for setting the openblas threads, we should see much lesser memory utilization. Also, the default max number of threads for openblas are now much lower in our build than before.

tknopp · 2013-12-15T05:30:46Z

@JeffBezanson: Does "batteries included" also mean "available at startup"? In Python I also have to "import os" although it is in the standard library.
But anyway, I think that it makes sense to split Base into 5 modules forming the standard library of Julia. Then make it easy to chose which of these is available at startup.

JeffBezanson · 2013-12-15T05:38:05Z

I think that's very reasonable; if it's easy to change what's available at startup, then that choice doesn't matter so much.

ivarne · 2013-12-15T06:24:07Z

It is fairly easy to add using LinAlg to the juliarc file. Only problem then is that you get a fragmented environment where different machines have different configuration. Maybe we could ship a default juliarc file to include the standard packages.

johnmyleswhite · 2013-12-15T06:26:21Z

I'd prefer that there be a command-line switch like --minimal that avoids pulling in things like LinAlg. This means that almost all users will get the full system, but conservative users will still have an easy way to remove things they don't want.

tknopp · 2013-12-15T06:46:03Z

Whats wrong with a user base choosing different default imports? If Julia wants to be more than a "Matlab environment" it should be easy to get a small memory footprint and startup time.

I also do not get whats so bad about explicit imports. In most programming languages one has to import something before it is usable (import os in Python). If it should be there at startup, it can be put into juliarc.jl

johnmyleswhite · 2013-12-15T06:54:09Z

What's wrong with the user base having different defaults is that you have to go to more work to make sure that code is portable across different Julia installations.

Personally, I think Python's approach to hiding basic functionality in modules is an example of bad design for a language used for scientific programming. Both Matlab and R introduce a huge amount of functionality by default and this is something I think is very desirable for aiding discoverability. I would be very sad to see Julia stop doing this. The hiding of functionality is the main reason I did not use Python historically.

tknopp · 2013-12-15T07:43:55Z

It is a different thing what is available at the REPL and whats available when executing a Julia program. If you have one Base module an 5 Standard modules, it could be convention to import all in the REPL but only the Base module when executing a Julia program.

This would not make writing portable code hard. One would have to import the Standard modules when one uses them in a different module.

For the discoverability there can be other solutions then importing everything.

The point why I am proposing the two level structure is that currently it is quite hard to get things into Base. There are various functions in Images.jl, that IMHO are "batteries" and should be included (e.g. gaussian filter). But proposing this might lead to a controversial discussion. On the other hand I would assume that everyone would agree that this should go into a "second level stdlib"

tknopp · 2013-12-15T13:06:20Z

I have created #5155 for a dedicated discussion on a Base/Standard module.

gitfoxi · 2013-12-15T13:19:15Z

Julia has two competing notions of hierarchical namespace. One is Module, the other is Type. If you were only allowed one notion hierarchical namespace, which would you choose?

Taking this to it's logical conclusion, imagine a global Type hierarchy that includes every Type -- and so every Method -- in every Module. If I call a specific Method on a specific Type it is up to the compiler to resolve the code to execute. Whether that code is in some pre-compiled base .so that is mmap'd on startup, another .so that isn't, fresh source code in my ~/tmp directory or in a package on Github doesn't matter to me as long as the system has a way to find, compile and execute. I shouldn't have to using or import anything. If I call allknowledge = translate(transmogrify(url"wikipedia.org", :LightPurple), MongolianLanguage()) then that ends my conversation with the computer. Leave the compiling to the compiler -- that's what it's good at.

In all seriousness though, just having a function available in a so/dylib/dll isn't something you should optimize. It's the OS' job to figure it out. Otherwise you'd have hundreds of files in /usr/lib and -- oh, you do? Well, hundreds more than that if you can imagine, it would look like /usr/include.

tknopp · 2013-12-15T13:40:48Z

I have not said one function per dll. Modularization should be done in a sane manner. And if done right it actually helps a lot structuring source code. I agree that not needing using seems to be the perfect world but I do not see how this could be implemented efficiently.

JeffBezanson · 2013-12-15T19:09:49Z

The only purpose of modules and using is to control the answer to the question "when I say x, what x does it refer to?" If that were entirely automated and there were no using, it would be equivalent to having a single namespace since the only x you could get would be whatever the language picks.

StefanKarpinski · 2013-12-15T19:13:11Z

Also, types aren't a hierarchical namespace in any way that makes sense to me.

JeffBezanson · 2013-12-15T19:13:44Z

One could actually argue that everything exported should be part of a single namespace, such as Base or some common pool. Then you'd have some of the properties of a single namespace but still be able to hide definitions.

toivoh · 2013-12-15T19:18:37Z

In the name of modularity, please don't pool all exported definitions! I for one like to organize my code using modules internal to my modules. I want those to be able export things that are not ultimately exported. I also think that it's really helpful to have some explicit control of what gets imported into a given module, so that I know what it depends on.

JeffBezanson · 2013-12-15T19:23:03Z

I agree; I was playing a bit of devil's advocate. Modules are one of those things where people tend to expect it to read their mind and know what they want, but many people want something different.

toivoh · 2013-12-15T20:18:20Z

Oh, I should have seen that :)

JeffBezanson · 2013-12-16T18:03:50Z

I am noticing this change has disrupted my workflow --- I'm used to reading email or something while waiting for julia to start up, and that is not possible anymore :)

vtjnash · 2013-12-16T18:27:03Z

So true.

Also, I like disruptive changes :P

aviks · 2013-12-16T20:34:12Z

While developing the JavaCall.jl package, most bugs cause a segfault. Further, a JVM cannot be loaded twice in the same process, even after being destroyed cleanly... so the module cannot be reloaded easily. This change has therefore saved me many hours of time in the last two weeks.

vtjnash added 3 commits November 22, 2013 23:14

remove dependence on llvm-mangled names, improve Makefile rules

96ae019

find sys.so relative to sys.ji

b4eaf40

ghost assigned JeffBezanson Nov 25, 2013

vtjnash and others added 14 commits November 28, 2013 17:07

create a valid sys.dylib file for any compile

1f51c45

renames the --bare [-b] flag to --build the --build flag now requires an argument (where to save the system image) the build flag triggers mode selection so that the code can get correctly generated and saved

better error messages for literal_pointer_val in ccall

f8c2456

cleanup some elements of static compile, based on Jeff's comments. Also

8ea70ac

partially address some minor issues with getopt usage miscounting the arguments

make static_compile build work on linux

7f002aa

try to make travis happy with static_compile

9ac166d

partial windows compatibility

dfbf414

lookup all ccall/cglobal values at runtime (very inefficiently)

67a11cb

fix some types and nulls checking in previous commit

f4f4c2f

get static compile most of the way to supporting windows

f0ec45d

fix #4213. seriously, that was all that was needed?

d2a088b

various attempts at fixing windows support for static_compile

f6a0f7a

fix stack alignment bug on win32

8022255

fix LLVM issues with corrected stack alignment code for windows

7425938

functional static compile support for win32

abac0df

tknopp mentioned this pull request Dec 15, 2013

Shrinking Base and Introducing a Standard Library #5155

Closed

ivarne mentioned this pull request Dec 22, 2013

Splash screen days-count fails #5218

Closed

tkelman deleted the jn/static_compile_2 branch April 19, 2015 11:30

static compile round 2 #4898

static compile round 2 #4898

Conversation

vtjnash commented Nov 23, 2013

aviks commented Nov 23, 2013

timholy commented Nov 23, 2013

StefanKarpinski commented Nov 23, 2013

staticfloat commented Nov 23, 2013

JeffBezanson commented Nov 24, 2013

vtjnash commented Nov 24, 2013

vtjnash commented Dec 1, 2013

johnmyleswhite commented Dec 1, 2013

staticfloat commented Dec 1, 2013

Keno commented Dec 1, 2013

johnmyleswhite commented Dec 1, 2013

vtjnash commented Dec 2, 2013

StefanKarpinski commented Dec 13, 2013

tshort commented Dec 13, 2013

ViralBShah commented Dec 14, 2013

StefanKarpinski commented Dec 14, 2013

ivarne commented Dec 14, 2013

tknopp commented Dec 14, 2013

JeffBezanson commented Dec 14, 2013

ViralBShah commented Dec 14, 2013

tknopp commented Dec 15, 2013

JeffBezanson commented Dec 15, 2013

ivarne commented Dec 15, 2013

johnmyleswhite commented Dec 15, 2013

tknopp commented Dec 15, 2013

johnmyleswhite commented Dec 15, 2013

tknopp commented Dec 15, 2013

tknopp commented Dec 15, 2013

gitfoxi commented Dec 15, 2013

tknopp commented Dec 15, 2013

JeffBezanson commented Dec 15, 2013

StefanKarpinski commented Dec 15, 2013

JeffBezanson commented Dec 15, 2013

toivoh commented Dec 15, 2013

JeffBezanson commented Dec 15, 2013

toivoh commented Dec 15, 2013

JeffBezanson commented Dec 16, 2013

vtjnash commented Dec 16, 2013

aviks commented Dec 16, 2013