[R-package] Fix best_iter and best_score #2159

Laurae2 · 2019-05-08T10:33:01Z

This should fix #2158 and #2029.

"best" rule:

Take the first iteration where the best score is attained, even if the further iterations have identical scores (do not take the last)
When there is no scoring done, the best score becomes NA and the best iteration is -1 (currently)

Later, we should enforce the metric used for early stopping should be only the first one, or at worst give the user the ability to choose the metric (best is the first).

note: @jameslamb spaces from RStudio to fix

Example (change to metric = "auc" and max_depth = 3 to test maximization):

library(lightgbm)
data(agaricus.train, package = "lightgbm")
train <- agaricus.train
dtrain <- lgb.Dataset(train$data, label = train$label)
data(agaricus.test, package = "lightgbm")
test <- agaricus.test
dtest <- lgb.Dataset.create.valid(dtrain, test$data, label = test$label)
params <- list(objective = "regression", metric = "l2")
valids <- list(test = dtest)
model <- lgb.train(params,
                   dtrain,
                   100,
                   min_data = 1,
                   learning_rate = 0.5)
model$best_score
model$best_iter
# > model$best_score
# [1] NA
# > model$best_iter
# [1] -1

model <- lgb.train(params,
                   dtrain,
                   100,
                   valids,
                   min_data = 1,
                   learning_rate = 0.5)
model$best_score
model$best_iter
# > model$best_score
# [1] 7.497024e-62
# > model$best_iter
# [1] 100

model <- lgb.train(params,
                   dtrain,
                   100,
                   valids,
                   min_data = 1,
                   learning_rate = 0.5,
                   early_stopping_rounds = 10)
model$best_score
model$best_iter
# > model$best_score
# [1] 7.497024e-62
# > model$best_iter
# [1] 100

model <- lgb.cv(params,
             dtrain,
             10,
             nfold = 5,
             min_data = 1,
             learning_rate = 1)
model$best_score
model$best_iter
# > model$best_score
# [1] 0.0003072197
# > model$best_iter
# [1] 1

model <- lgb.cv(params,
             dtrain,
             10,
             nfold = 5,
             min_data = 1,
             learning_rate = 1,
             early_stopping_rounds = 5)
model$best_score
model$best_iter
# > model$best_score
# [1] 5.369631e-18
# > model$best_iter
# [1] 3

manual tests done: * With early stopping + with validation set * With early stopping + without validation set * Without early stopping + with validation set * Without early stopping + without validation set And with multiple metrics / validation sets.

StrikerRUS · 2019-05-08T11:18:51Z

Later, we should enforce the metric used for early stopping should be only the first one, or at worst give the user the ability to choose the metric (best is the first).

There are some problems with metrics' order in Python: #2127. I suppose the same is true and for R too.

Laurae2 · 2019-05-08T11:31:47Z

@StrikerRUS In R lists, it is fixed. The first element always remains the first element. You may even have two elements having the same exact names without any conflict.

However users may provide the same metric multiple times by mistake, in that case we deduplicate them.

Example:

library(lightgbm)
data(agaricus.train, package = "lightgbm")
train <- agaricus.train
dtrain <- lgb.Dataset(train$data, label = train$label)
data(agaricus.test, package = "lightgbm")
test <- agaricus.test
dtest <- lgb.Dataset.create.valid(dtrain, test$data, label = test$label)
params <- list(objective = "regression", metric = c("l2", "l1", "l2"))
valids <- list(test = dtest)

model <- lgb.train(params,
                   dtrain,
                   100,
                   valids,
                   min_data = 1,
                   learning_rate = 0.5)
str(model$record_evals$test, max.level = 1)
# > str(model$record_evals$test, max.level = 1)
# List of 2
#  $ l2:List of 2
#  $ l1:List of 2

guolinke · 2019-05-09T01:58:49Z

@Laurae2 will it be better to have a parameter named "first_metric_only" in R as well?

guolinke · 2019-05-09T01:59:37Z

BTW, if both R and python have "first_metric_only" option, I think we should have the same option in CLI version.

Laurae2 · 2019-05-12T08:37:43Z

@guolinke Yes, I think we should have first_metric_only option for CLI, R, and Python.

As the handling would be different for each wrapper, R and Python would have their own implementations using callback.

Laurae2 · 2019-05-12T19:14:14Z

@StrikerRUS do you know why Travis MPI/Python jobs are failing?

StrikerRUS · 2019-05-12T19:38:11Z

@Laurae2 gcc 9 has been released recently. Hotfix is already in master: abbbbd7.

guolinke · 2019-05-17T00:43:27Z

Later, we should enforce the metric used for early stopping should be only the first one, or at worst give the user the ability to choose the metric (best is the first).

@Laurae2 is this implemented in this PR?

guolinke · 2019-05-17T00:44:53Z

also refer to this:
#2127 (comment)
we should ensure that cli, python and R have the same behavior with first_metric_only.

Laurae2 · 2019-05-17T16:31:53Z

@guolinke This PR uses all metrics. We can do first_metric_only in another PR.

Note that the best score / iteration is taken from the first metric when it was not computed by the model.

Laurae2 · 2019-05-17T16:32:56Z

Closing/reopening for CI

StrikerRUS · 2019-05-17T18:27:06Z

@Laurae2

Closing/reopening for CI

This doesn't work for branch's CIs, it works only for PR's CIs.

Laurae2 · 2019-05-26T11:51:52Z

@jameslamb do we merge it as is for the moment?

StrikerRUS · 2019-05-27T13:51:24Z

I've updated the branch. Now all checks should be OK.

jameslamb

Looks good to me! Thank you @Laurae2 , apologies for my delayed review

Laurae2 added 6 commits May 8, 2019 11:28

Callback for NA handling

d34516d

lgb.Booster default score => NA

dec2d36

lgb.cv default best score => NA

1a33eeb

Fix back callback

e6783c5

Laurae2 requested review from jameslamb and guolinke and removed request for jameslamb May 8, 2019 10:33

Laurae2 added the r-package label May 8, 2019

Laurae2 mentioned this pull request May 8, 2019

v2.3.0 realese #2138

Merged

guolinke mentioned this pull request May 14, 2019

first metric only in earlystopping for cli #2172

Merged

Laurae2 closed this May 17, 2019

Laurae2 reopened this May 17, 2019

guolinke approved these changes May 23, 2019

View reviewed changes

Merge remote-tracking branch 'origin/master' into fix-#2029

5e558b3

jameslamb approved these changes May 27, 2019

View reviewed changes

jameslamb merged commit f70a053 into master May 27, 2019

StrikerRUS deleted the fix-#2029 branch May 27, 2019 19:52

StrikerRUS mentioned this pull request May 27, 2019

Score always -1 if without early stopping #2029

Closed

StrikerRUS mentioned this pull request Sep 1, 2019

first_metric_only parameter for R-package #2368

Closed

lock bot locked as resolved and limited conversation to collaborators Mar 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[R-package] Fix best_iter and best_score #2159

[R-package] Fix best_iter and best_score #2159

Laurae2 commented May 8, 2019 •

edited

Loading

StrikerRUS commented May 8, 2019

Laurae2 commented May 8, 2019

guolinke commented May 9, 2019

guolinke commented May 9, 2019

Laurae2 commented May 12, 2019

Laurae2 commented May 12, 2019

StrikerRUS commented May 12, 2019

guolinke commented May 17, 2019

guolinke commented May 17, 2019

Laurae2 commented May 17, 2019

Laurae2 commented May 17, 2019

StrikerRUS commented May 17, 2019

Laurae2 commented May 26, 2019

StrikerRUS commented May 27, 2019 •

edited

Loading

jameslamb left a comment

[R-package] Fix best_iter and best_score #2159

[R-package] Fix best_iter and best_score #2159

Conversation

Laurae2 commented May 8, 2019 • edited Loading

StrikerRUS commented May 8, 2019

Laurae2 commented May 8, 2019

guolinke commented May 9, 2019

guolinke commented May 9, 2019

Laurae2 commented May 12, 2019

Laurae2 commented May 12, 2019

StrikerRUS commented May 12, 2019

guolinke commented May 17, 2019

guolinke commented May 17, 2019

Laurae2 commented May 17, 2019

Laurae2 commented May 17, 2019

StrikerRUS commented May 17, 2019

Laurae2 commented May 26, 2019

StrikerRUS commented May 27, 2019 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

Laurae2 commented May 8, 2019 •

edited

Loading

StrikerRUS commented May 27, 2019 •

edited

Loading