[R-package] Support trees with linear models at leaves #3319

btrotta · 2020-08-19T08:24:33Z

PR #3299 implements trees with linear models at the leaves. I have only tested this with the Python package, and it's possible that some additional changes are needed to get it working for R. To calculate the linear model, we need a Dataset object that contains the full feature data (for numerical features, not needed for categorical), rather than just the binned data. Therefore, when we create the dataset we need to know whether linear_tree is True. In the Python interface, we first create the dataset, then call lgb.train as follows:

train_data = lgb.Dataset(x)
params = {'linear_tree'=True}
est =  lgb.train(params, train_data)

So Python doesn't know the dataset parameters (e.g. linear_tree, or max_bin, etc), until we call lgb.train. This works because the dataset is initialised in the first line, but the data isn't actually loaded until lgb.train is called. The code that handles this seems quite complex and delicate, and I had to make a couple of small changes to get it to work properly (e.g. to use the refit functionality on an existing model). So I imagine some similar tweaks would be required for the R interface.

Also, the new code requires the Eigen library for linear algebra, which is causing some issues with the R code checks in CI. See this comment for details: #3299 (comment)

The text was updated successfully, but these errors were encountered:

jameslamb · 2020-08-19T14:57:04Z

Thanks @btrotta ! If no one else picks this up before me, I'll come back and get it at some point.

For the issue with diagnostic-suppressing pragmas, we might have to do something like we do for the LightGBM ones:

LightGBM/build-cran-package.sh

Line 45 in 1804fd1

echo "Removing unknown pragmas in headers"

jameslamb · 2020-08-19T14:58:43Z

I just added this to #2302 in the R package section, so based on our convention I'll close it.

Anyone reading this issue...please leave a comment if you'd like to create a pull request for it and I can re-open it!

jameslamb · 2020-12-27T07:18:57Z

When this is picked up, it should also get changes similar to #3685.

build_r.R and build-cran-package.sh should be modified to only copy in the specific parts of Eigen that are needed (see MANIFEST.in changes in [python-package] remove unused Eigen files, compile with EIGEN_MPL2_ONLY (fixes #3684) #3685)
-DEIGEN_MPL2_ONLY should be added to LGB_CPPFLAGS in R-package/configure.ac and R-package/configure.win

…icrosoft#3319)

…3319) (#3699) * [R-package] enable use of trees with linear models at leaves (fixes #3319) * remove problematic pragmas * fix tests * try to fix build scripts * try fixing pragma check * more pragma checks * ok fix pragma stuff for real * empty commit * regenerate documentation * try skipping test * uncomment CI * add note on missing value types for R * add tests on saving and re-loading booster

btrotta mentioned this issue Aug 19, 2020

Trees with linear models at leaves #3299

Merged

jameslamb added r-package feature request labels Aug 19, 2020

guolinke mentioned this issue Aug 19, 2020

Feature Requests & Voting Hub #2302

Open

jameslamb closed this as completed Aug 19, 2020

jameslamb mentioned this issue Dec 27, 2020

[python-package] remove unused Eigen files, compile with EIGEN_MPL2_ONLY (fixes #3684) #3685

Merged

jameslamb added a commit to jameslamb/LightGBM that referenced this issue Dec 31, 2020

[R-package] enable use of trees with linear models at leaves (fixes m…

89474ed

…icrosoft#3319)

jameslamb reopened this Dec 31, 2020

jameslamb mentioned this issue Dec 31, 2020

[R-package] enable use of trees with linear models at leaves (fixes #3319) #3699

Merged

StrikerRUS closed this as completed in #3699 Jan 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[R-package] Support trees with linear models at leaves #3319

[R-package] Support trees with linear models at leaves #3319

btrotta commented Aug 19, 2020

jameslamb commented Aug 19, 2020

jameslamb commented Aug 19, 2020

jameslamb commented Dec 27, 2020

[R-package] Support trees with linear models at leaves #3319

[R-package] Support trees with linear models at leaves #3319

Comments

btrotta commented Aug 19, 2020

jameslamb commented Aug 19, 2020

jameslamb commented Aug 19, 2020

jameslamb commented Dec 27, 2020