Backward compatibility with v.1.7.6 #9624

iftg · 2023-10-03T13:21:50Z

Hi,
When training the model using v.2.0.0 I'm getting substantially different results from v.1.7.6. Could you please clarify why this may be happening even though my code remains untouched?
I'm using:
import xgboost as xgb
paramIn = {
'disable_default_eval_metric' : False,
'objective' : 'reg:squarederror',
'eval_metric' : 'rmse',
'max_depth': 3,
'base_score' : 0.,
'max_leaves' : 0,
'min_child_weight' : 1,
'max_delta_step' : 0,
'subsample' : 1,
'colsample_bytree' : 1,
'lambda' : 0,
'alpha' : 0,
'eta' : 0.05,
'gamma' : 0
}

dtrain = xgb.DMatrix(X, feature_names=feature_names, label=y, nthread=-1) #X is a TxN numpy array,
#y is a Tx1 numpy array
evals_result = {}
bst = xgb.train(paramIn,
dtrain,
num_boost_round=500,
early_stopping_rounds=1000, #i.e. no early stopping
obj=custom_obj,
custom_metric=custom_eval,
evals=[(dtrain, 'train')],
evals_result=evals_result,
verbose_eval=False)

xbanke · 2023-10-04T02:24:32Z

I met the similar problem. When I training model with rank:pairwise, the result is quite different from the previous version. It looks like that the base_score in 1.7.6 is 0.5 by default, however it is 0.0 in 2.0.0. In the document of 2.0.0, it says rank:pairwise: Use LambdaRank to perform pair-wise ranking using the ranknet objective. but rank:pairwise: Use LambdaMART to perform pairwise ranking where the pairwise loss is minimized in 1.7.0

hcho3 · 2023-10-04T05:53:21Z

Some defaults have changed in the 2.0 version.

The tree_method parameter now defaults to hist. Previously (in 1.7) it was defaulted to approx or exact.
The base_score parameter is no longer set to 0.5. When unspecified, base_score is now estimated from the input labels.
Learning-to-rank algorithm has a brand new implementation. We chose to re-implement the learning-to-rank for a number of reasons: 1) Better alignment with current academic literature of learning-to-rank; 2) Previous implementation of learning-to-rank was non-deterministic (multiple runs would yield different results); 3) We wanted to support unbiased learning-to-rank for a biased data source such as click data.

See the Release Note for the full list of changes.

In general, developers of XGBoost do not guarantee that different versions of XGBoost would behave identically. (Making such guarantee would prevent us from making necessary improvements.) Instead, we make the following guarantees:

Reproducible execution defined as follows: if you run the same training script on the same machine with the same version of XGBoost, you will get identical results.
You can train a model with a previous version of XGBoost and save the model. Later versions of XGBoost can load the saved model and produce identical predictions as the previous version.
Any changes will be documented in the Release note.

However, if you observe significant degradation of model accuracy or training taking longer time, please file a new GitHub issue.

iftg · 2023-10-04T10:42:21Z

Thank you for the explanation. In my case 1.7.6 uses 'tree_method' : 'exact', and this was the only source of the difference. I explicitly specify base_score, so this point is irrelevant in my case. I do not think I use learning-to-rank as I am running a regression ('objective' : 'reg:squarederror').

iftg · 2023-10-04T10:43:28Z

Ah, and BTW, for my problem 'exact' works better than 'hist', hands down.

nomagic · 2023-10-31T18:52:16Z

is base_score := F0 (in e.g. the Friedman paper)?

iftg · 2023-10-31T19:06:45Z

I am explicitly using 0 in my problem. However, in other applications other settings may work better.

mvalley21 · 2024-10-10T18:00:28Z

Adding to this: setting base_score to 0.5 in xgboost 2.x resolved the differences I saw between 2.x and 1.7.

hcho3 added the status: need update label Oct 4, 2023

xbanke mentioned this issue Oct 4, 2023

rank:pairwise degrade peformance in 2.0.0 #9625

Closed

iftg closed this as completed Oct 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backward compatibility with v.1.7.6 #9624

Backward compatibility with v.1.7.6 #9624

iftg commented Oct 3, 2023 •

edited

Loading

xbanke commented Oct 4, 2023

hcho3 commented Oct 4, 2023 •

edited

Loading

iftg commented Oct 4, 2023

iftg commented Oct 4, 2023

nomagic commented Oct 31, 2023

iftg commented Oct 31, 2023

mvalley21 commented Oct 10, 2024

Backward compatibility with v.1.7.6 #9624

Backward compatibility with v.1.7.6 #9624

Comments

iftg commented Oct 3, 2023 • edited Loading

xbanke commented Oct 4, 2023

hcho3 commented Oct 4, 2023 • edited Loading

iftg commented Oct 4, 2023

iftg commented Oct 4, 2023

nomagic commented Oct 31, 2023

iftg commented Oct 31, 2023

mvalley21 commented Oct 10, 2024

iftg commented Oct 3, 2023 •

edited

Loading

hcho3 commented Oct 4, 2023 •

edited

Loading