[WIP] Install documentation for LightGBM on GPU #389

Laurae2 · 2017-04-09T15:21:12Z

Related to PR #368

Is there any documentation to install, setup, and use LightGBM on GPU? As the docs folder and the Wiki did not change, I was wondering if there are specific steps required.

Similar to this, but for this repository.

Similar question for:

Python package GPU installation/usage
R package GPU installation/usage

To-do:

Linux
Windows
Mac OS / macOS (need someone to with a Mac who can/want to do this...)

huanzhang12 · 2017-04-09T19:39:08Z

@Laurae2 I will work on move the documentation on my development repository to here.

huanzhang12 · 2017-04-09T19:40:44Z

For python I think just using the GPU-enabled shared library and pass device=gpu as an additional config parameter is sufficient. For R package I am not sure...

guolinke · 2017-04-10T02:49:33Z

I update some documents for GPU, @huanzhang12 can you check them?

For R/python package, I think they both need to pass the additional paramter device=gpu to enable it.
@Laurae2 @wxchan you can create PR to simplify it.

huanzhang12 · 2017-04-10T03:08:21Z

@guolinke For the building procedure, we need to mention the dependency package (cmake >= 3.2, boost >= 1.56, OpenCL >= 1.2) and how to install them.
On Windows it is probably gonna work (as nothing I used is platform dependent), but untested.

huanzhang12 · 2017-04-10T03:27:42Z

@guolinke @Laurae2 It seems I can't make a pull request to the wiki (github does not have this feature). So I am not able to directly edit it. But most detailed instructions are on my development repository page. You can probably move most materials to the wiki, perhaps adding a "GPU tutorial", helping people to setup GPU training and achieve good speedup.

guolinke · 2017-04-10T03:34:04Z

@huanzhang12 you can put your materials to the docs/... , I can add a link to it in the wiki.

huanzhang12 · 2017-04-10T03:45:28Z

@guolinke OK, I think I will put a tutorial there

Laurae2 · 2017-04-10T13:14:18Z

@guolinke for R package we might have to convert the cmake to make in Makevars.

I found this but I don't know how it could be integrated: https://github.com/forexample/r-cmake

@huanzhang12 R in Windows uses MinGW, while in Unix it uses the default way of installation. Are they enough alone or do we need more to install with GPU support? If cmake is a must, we might try to find to workaround for R.

Also, can we use environment variables? (if there are any to use)

We may also attempt to detect whether boost is existing or not (by checking whether ../include/LightGBM folder exists):

If not existing => CPU only (this is our current way of installing the R package)
If existing => install GPU version

wxchan · 2017-04-10T13:41:43Z

I don't have machine to test gpu version right now. @huanzhang12 can you also help update python package?

huanzhang12 · 2017-04-11T00:36:57Z

@Laurae2 It is fine if we don't have cmake. I list cmake 3.x as a dependency just because I need to look for OpenCL and boost headers/libraries automatically in CMakeList.txt. cmake 3.x can do this more reliably (than older cmake versions or hard-coded paths). Without cmake, if everything has been installed to standard system location (like /usr/includes and /usr/lib64, etc) and the compiler can find them, compilation with R should be fine, without using CMake.

But the problem is to detect the existence of OpenCL and boost, and enable GPU support accordingly. If we can't do this automatically, we will probably have to provide two Makevars, and an user must ensure that necessary dependencies have been installed and manually use the one with GPU support to compile.

@Laurae2 I am not sure what environment variables you want to use? Do you want to detect the existence of OpenCL/boost using environment variables?

@wxchan Currently it seems the Python package is working, by compiling LightGBM with GPU support as normal and then run python setup.py install. Is there anything else we need to do?

For both Python and R, as @guolinke mentioned, the user needs to pass the additional parameter device=gpu to lightGBM to enable GPU at run-time. Not sure if we want to do anything else, like a adding a global function SetDevice("xpu") to Python/R interface to globally enable GPU on all later LightGBM calls. I think it is acceptable to just let the user pass an additional parameter (device=gpu) to LightGBM each time, for the time being.

BTW, the GPU code can be tested without real GPUs installed. The beauty of OpenCL (unlike CUDA) is that it is a universal standard, targeting a wide range of device include CPU and GPU. You can install Intel OpenCL runtime or AMD APP SDK to get OpenCL working on CPUs (slow but good enough for testing). Currently in .travis.yml we test the GPU code using this way (and in fact we are already using the python interface there).

wxchan · 2017-04-11T08:52:36Z

can it work on mac? I got segfault on my machine.

[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 6143
[LightGBM] [Info] Number of data: 7000, number of used features: 28
[LightGBM] [Info] Using GPU Device: HD Graphics 5000, Vendor: Intel
[LightGBM] [Info] Compiling OpenCL Kernel with 256 bins...
Segmentation fault: 11

Thread 3 Crashed:
0   libstdc++.6.dylib             	0x0000000105964ff8 __cxxabiv1::__si_class_type_info::__do_dyncast(long, __cxxabiv1::__class_type_info::__sub_kind, __cxxabiv1::__class_type_info const*, void const*, __cxxabiv1::__class_type_info const*, void const*, __cxxabiv1::__class_type_info::__dyncast_result&) const + 24
1   libc++abi.dylib               	0x00007fffd556a44e __cxxabiv1::__class_type_info::can_catch(__cxxabiv1::__shim_type_info const*, void*&) const + 146
2   libc++abi.dylib               	0x00007fffd556bc0b default_terminate_handler() + 199
3   libobjc.A.dylib               	0x00007fffd6075f33 _objc_terminate() + 124
4   libc++abi.dylib               	0x00007fffd5568d69 std::__terminate(void (*)()) + 8
5   libc++abi.dylib               	0x00007fffd55687de __cxa_throw + 121
6   libboost_filesystem-mt.dylib  	0x0000000105932b72 boost::filesystem::detail::create_directory(boost::filesystem::path const&, boost::system::error_code*) + 274
7   libboost_filesystem-mt.dylib  	0x00000001059328dd boost::filesystem::detail::create_directories(boost::filesystem::path const&, boost::system::error_code*) + 461
8   lib_lightgbm.so               	0x000000010582182a boost::compute::detail::program_binary_path(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) + 618
9   ???                           	0x0000000000000021 0 + 33
10  ???                           	0x000000010113f620 func_new.kwlist + 48

huanzhang12 · 2017-04-11T09:16:37Z

@wxchan It should work on mac as it detects your Intel HD 5000 GPU as a OpenCL device. Glad to see you get it compiled successfully!

Base on the backtrace it seems the problem is in the offline cache of Boost.Compute. The offline cache is for caching compiled GPU kernels, so that it does not need to be compiled again when next time you launch LightGBM with GPU. From the backtrace I guess it crashed during creating the cache directory

Based on compute/include/boost/compute/detail/path.hpp, the default cache path is set to

    static const std::string appdata = detail::getenv("HOME")
        + path_delim() + ".boost_compute";

On my Linux machine it creates a folder ".boost_compute" in my home folder, not sure how it works for Mac. If you can figure out the exact reason of the crash, you can submit a PR to Boost.Compute.

Otherwise, you can remove the macro BOOST_COMPUTE_USE_OFFLINE_CACHE in src/treelearner/gpu_tree_learner.h to disable the offline kernel cache, make clean and make again.

wxchan · 2017-04-11T12:42:07Z

It works after comment out BOOST_COMPUTE_USE_OFFLINE_CACHE line. I think it can be added into instruction?

huanzhang12 · 2017-04-11T16:26:24Z

@wxchan Glad to know it works for you! I can add this to instruction, but please report this bug to Boost.Compute so that they can fix it in a future release. Offline cache is a nice feature to have, otherwise the user has to wait for kernel compilation (Compiling OpenCL Kernel with 256 bins...) each time launching LightGBM, and it can take some time.

I guess it is not a hard problem to fix, probably just add a #ifdef for the OSX case. You can try to print the variable dir in https://github.com/boostorg/compute/blob/master/include/boost/compute/detail/path.hpp#L46-L48 and see what happens. I don't have any OSX system at hand so can't test it.

Laurae2 · 2017-04-12T11:14:47Z

@huanzhang12 @guolinke I managed to compile with with GPU trainer in R + Windows! Will come very soon with a PR for enabling GPU in R package.

gugatr0n1c · 2017-04-12T13:54:14Z

@Laurae2 Which method did you used? Visual studio or mingw?

Laurae2 · 2017-04-12T16:29:15Z

@gugatr0n1c I used only MinGW for CLI, Python, and R, no need Visual Studio for compiling LightGBM and Boost (performance might vary though).

@huanzhang12 Got the same issue in Windows about @wxchan issue on Mac. Commenting that line 26 BOOST_COMPUTE_USE_OFFLINE_CACHE fixed the issue.

huanzhang12 · 2017-04-12T18:46:10Z

@Laurae2 Glad to know you get it working on Windows! I think we need to look into the BOOST_COMPUTE_USE_OFFLINE_CACHE issue a little bit, because I think offline cache is a good feature to have; otherwise each time we have to wait for a few seconds to get GPU kernels compiled, which is annoying.

@Laurae2 You can probably try to remove the OpenMP pragma at https://github.com/Microsoft/LightGBM/blob/master/src/treelearner/gpu_tree_learner.cpp#L569. I think probably Boost.Compute has a bug when using multiple threads to build the kernel. We need to report it to Boost.Compute if we can track down the issue.

huanzhang12 · 2017-04-12T23:43:53Z

@Laurae2 I am able to fix the offline cache issue in Boost.Compute on Windows. I have created a pull request at Boost.Compute repository: boostorg/compute#704

Laurae2 · 2017-04-18T09:53:21Z

@huanzhang12 now we have good documentation for Linux and Windows, and on how to choose device/platform. We still lack some doc for Mac, and I don't have access to my Mac currently.

Someone volunteering to make some Mac documentation:

To properly install LightGBM in CLI / Python / R with multithreading working? (seems we still have people with issues) LightGBM installs fine on Sierra, but doesn't seem to be running in parallel #89 installing R package on macOS #421
Get GPU mode usable easily? [WIP] Install documentation for LightGBM on GPU #389

huanzhang12 · 2017-04-18T19:51:06Z

@Laurae2 Thanks for your hard work on getting the instructions on Windows ready!

Based on the previous issues on Mac you mentioned, it could be tricky to get everything working on Mac. Currently I don't have access to any Mac computers, and I can't find any cloud computing service providing Mac virtual machines with GPU. So we need to look for some volunteers here.

guolinke · 2017-04-25T04:29:23Z

Add MSVC build guide on https://github.com/Microsoft/LightGBM/wiki/Installation-Guide#windows-2 .

Laurae2 · 2017-04-25T13:01:08Z

@guolinke great! Now, the last is Mac to do.

perhaps you can add a call for contribution for docs on this.

guolinke · 2017-04-25T13:17:49Z

@Laurae2 Did we solve the gpu build for R ?

Laurae2 · 2017-04-25T13:23:02Z

@guolinke GPU build works for R in Windows and in Linux as they work nearly identically in R (just feed 4 extra variables in Makeconf + modify Makevars + add gpu_tree_learner in include), it is the safest way due to how R is confining its own environment variables).

Linux got freedom to use the default compiler which may be common for CLI / Python, in Windows it is mandatory to use Rtools' MinGW.

For Mac + R, it should be the same as Linux except it requires to be able first to compile the CPU-only version (gcc, OpenMP issues...).

zhukunism · 2017-05-29T13:38:31Z

@Laurae2 , can you please give more details on the GPU build for R in Linux? Not sure what's involved on the "feed 4 extra variables in Makeconf + modify Makevars + add gpu_tree_learner in include" steps. Thanks!

Laurae2 · 2017-06-01T17:10:50Z

@zhukunism on linux it depends too much on where you things are installed by the OS (or where you install them), I can't really make more detailed. The same rules from Windows applies on Linux, with a different file naming scheme:

bushmanov · 2017-06-05T09:52:00Z

Successfully build LightGBM with GPU support on Ubuntu 16.04 and installed a Python version. Able to run in python with device="gpu".

As far as R version is concerned, which is installed by further running ./unix_build_package.sh and installing resulting lightgbm_0.1.tar.gz package, it runs successfully on cpu, but crashes as soon as I insert device="gpu".

I would really appreciate a clear instruction on installing R version on Linux with GPU support.

guolinke · 2017-06-05T10:32:39Z

@bushmanov
I am working for the easier install method for R package in this PR: #584 .
Welcome to try.
For the GPU support, you need to set use_gpu <- TRUE in R-package/src/install.libs.R .

zhukunism · 2017-06-09T03:14:11Z

@Laurae2 , I set the boost and openCL env variables as you suggested. But still get the errors below when building on my ubuntu machine:

installing source package ‘lightgbm’ ...
** libs
make: Nothing to be done for 'all'.
installing to /home/zhukun/Workspace/Library/lightgbm/install/lightgbm/libs
** R
** data
** demo
** preparing package for lazy loading
** help
*** installing help indices
** building package indices
** testing if installed package can be loaded
Error in dyn.load(file, DLLpath = DLLpath, ...) :
unable to load shared object '/home/zhukun/Workspace/Library/lightgbm/install/lightgbm/libs/lightgbm.so':
/home/zhukun/Workspace/Library/lightgbm/install/lightgbm/libs/lightgbm.so: undefined symbol: clGetCommandQueueInfo
Error: loading failed
Execution halted
ERROR: loading failed

The OpenCL headers and so are installed properly:

$ ls /usr/include/CL/
cl2.hpp cl_d3d10.h cl_d3d11.h cl_dx9_media_sharing.h cl_egl.h cl_ext.h cl_gl_ext.h cl_gl.h cl.h cl.hpp cl_platform.h opencl.h

$ ls /usr/lib/x86_64-linux-gnu/libOpenCL.so
/usr/lib/x86_64-linux-gnu/libOpenCL.so

Do you have any ideas? many thanks!

Laurae2 · 2017-06-09T05:44:00Z

@zhukunism For R installation, you can now use a precompiled lib you have to put in root of LightGBM folder (if it compiled elsewhere). Compile it as if you were doing for CLI, then compile the R package.

It is now much easier to compile R with GPU support that way. Current R installation can't compile with custom flags currently (you can override them by editing the sys/install.libs.R of the package.

Remember to adjust use_precompile to True if you use precompiled lib.

jzun · 2017-06-22T04:39:58Z

A following error came when I use R 3.3.3 to install lgb with devtools:

Error in inDL(x, as.logical(local), as.logical(now), ...) : 
无法载入共享目标对象‘C:/Users/DIUNI/Documents/R/win-library/3.3/lightgbm/libs/i386/lib_lightgbm.dll’：:
LoadLibrary failure:  %1 不是有效的 Win32 应用程序。

translated as:

Error in inDL(x, as.logical(local), as.logical(now), ...) : 
unable to load the shared object ‘C:/Users/DIUNI/Documents/R/win-library/3.3/lightgbm/libs/i386/lib_lightgbm.dll’：:
LoadLibrary failure:  %1 is not a valid Win32 application.

ps: cmake 3.9, rtools3.4 and vs2017 have been installed
Any reply will be greatly appreciated!

guolinke · 2017-06-22T04:41:20Z

@jzun it seems your r version is 32-bit (i386), can you use the 64-bit R ?

jzun · 2017-06-22T04:45:20Z

@guolinke it seems i just used the 64-bit to install with its head line in R console:

R version 3.3.3 (2017-03-06) -- "Another Canoe"
Copyright (C) 2017 The R Foundation for Statistical Computing
Platform: x86_64-w64-mingw32/x64 (64-bit)

guolinke · 2017-06-22T04:47:39Z

@jzun maybe you are using the 32-bit Rtools ?
you can list the folder in your C:\Rtools

And did you have a folder C:\R32 ?

jzun · 2017-06-22T05:38:08Z

@guolinke
It worked after I reinstall the R and Rtools without 32-bit.
And GPU version did!
thank you!

jzun · 2017-06-22T05:42:39Z

@guolinke
But a confusion came out with the running time test:
i used GTX 1060 to train the multiclass demo wiht about 0.4~0.5 sec, but just about 0.01 sec on amd 1700 cpu, which is a little bit strange...

guolinke · 2017-06-22T05:45:21Z

@jzun
I guess you are running with a small dataset.
When data is small, using GPU cannot gain the speed-up.

jzun · 2017-06-22T06:08:00Z

@guolinke
yeah i geust so
And I plan to translate the LightGBM installation guide into Chinese, and put it on my WeChat Subscription called "统计译文", it's OKAY?
I think we should call guys to install and use lgb, and report the their suggestions and problems, through some effective ways.

guolinke · 2017-06-22T06:10:38Z

@jzun sure

github-actions · 2023-08-23T23:57:02Z

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

Laurae2 mentioned this issue Apr 12, 2017

Documentation for GPU installation on Windows #409

Merged

wxchan mentioned this issue Apr 20, 2017

can not do training #436

Closed

guolinke added the help wanted label Apr 25, 2017

guolinke changed the title ~~Documentation for LightGBM on GPU~~ [WIP] Install documentation for LightGBM on GPU Apr 25, 2017

Laurae2 closed this as completed Oct 1, 2017

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

[WIP] Install documentation for LightGBM on GPU #389

[WIP] Install documentation for LightGBM on GPU #389

Comments

Laurae2 commented Apr 9, 2017 • edited Loading

huanzhang12 commented Apr 9, 2017

huanzhang12 commented Apr 9, 2017

guolinke commented Apr 10, 2017

huanzhang12 commented Apr 10, 2017 • edited Loading

huanzhang12 commented Apr 10, 2017

guolinke commented Apr 10, 2017

huanzhang12 commented Apr 10, 2017

Laurae2 commented Apr 10, 2017 • edited Loading

wxchan commented Apr 10, 2017

huanzhang12 commented Apr 11, 2017

wxchan commented Apr 11, 2017

huanzhang12 commented Apr 11, 2017 • edited Loading

wxchan commented Apr 11, 2017

huanzhang12 commented Apr 11, 2017

Laurae2 commented Apr 12, 2017

gugatr0n1c commented Apr 12, 2017

Laurae2 commented Apr 12, 2017

huanzhang12 commented Apr 12, 2017 • edited Loading

huanzhang12 commented Apr 12, 2017

Laurae2 commented Apr 18, 2017

huanzhang12 commented Apr 18, 2017 • edited Loading

guolinke commented Apr 25, 2017

Laurae2 commented Apr 25, 2017

guolinke commented Apr 25, 2017

Laurae2 commented Apr 25, 2017

zhukunism commented May 29, 2017

Laurae2 commented Jun 1, 2017

bushmanov commented Jun 5, 2017

guolinke commented Jun 5, 2017

zhukunism commented Jun 9, 2017

Laurae2 commented Jun 9, 2017 • edited Loading

jzun commented Jun 22, 2017

guolinke commented Jun 22, 2017

jzun commented Jun 22, 2017

guolinke commented Jun 22, 2017 • edited Loading

jzun commented Jun 22, 2017

jzun commented Jun 22, 2017

guolinke commented Jun 22, 2017

jzun commented Jun 22, 2017

guolinke commented Jun 22, 2017

github-actions bot commented Aug 23, 2023

Laurae2 commented Apr 9, 2017 •

edited

Loading

huanzhang12 commented Apr 10, 2017 •

edited

Loading

Laurae2 commented Apr 10, 2017 •

edited

Loading

huanzhang12 commented Apr 11, 2017 •

edited

Loading

huanzhang12 commented Apr 12, 2017 •

edited

Loading

huanzhang12 commented Apr 18, 2017 •

edited

Loading

Laurae2 commented Jun 9, 2017 •

edited

Loading

guolinke commented Jun 22, 2017 •

edited

Loading