Introduced cpu performance gaps disscussion #6697

ShvetsKS · 2021-02-09T15:51:57Z

Two performance problems were discovered in recently merged PRs:

thread safety prediction: Make prediction thread safe. #6648

santander training stage	before #6648	thread safe #6648 and master	#6696 PR
full training	136s	165s	140s
PredictRaw	55s	78s	59s

Would it be possible to have not thread safe PredictDMatrix call to reduce overheads for subsampling cases and have initialization buffer only once on first training iteration? I'd appreciate for any ideas.

drop binary format: Drop binary format for memory snapshot. #6513 leads to stable 10% perf regression, observed for Higgs1m dataset after this commit (16s vs 14.3s full training).
Before that we have an option to save old behavior via setting enable-experimental-json-serialization to False but currently there is no such possibility.
Could you share your thoughts about best way to mitigate it?

The text was updated successfully, but these errors were encountered:

trivialfis · 2021-02-09T16:16:48Z

I'm confused that why is thread-safe prediction has any impact on training? I thought we have enabled prediction cache for both regression and multi-class on CPU?

trivialfis · 2021-02-09T16:25:47Z

If the prediction happens during evaluation, we can use thread local static storage I believe.

As for the JSON thing, it's occurred during the end of training, which is used to release memory.

ShvetsKS · 2021-02-09T16:40:23Z

I'm confused that why is thread-safe prediction has any impact on training? I thought we have enabled prediction cache for both regression and multi-class on CPU?

Yes prediction caching is enabled for regression, binary and multiclass classification but for subsample==1 case only

ShvetsKS · 2021-02-09T16:44:08Z

As for the JSON thing, it's occurred during the end of training, which is used to release memory.

is it possible to have some alternative enable-experimental-json-serialization to reduce observed gaps?

trivialfis · 2021-02-09T20:22:51Z

Thanks for the reply. I think it's possible to change the subsampling implementation to allow cache.

As for the serialization time, I will try to find a way to remove it, either by BSON or by a better way to release memory.

trivialfis · 2021-02-10T10:08:51Z

The thread safety was mostly for dask interface, also some other feature requests. In dask, the prediction is done on each block of data in parallel.

ShvetsKS · 2021-02-10T12:58:06Z

The thread safety was mostly for dask interface, also some other feature requests. In dask, the prediction is done on each block of data in parallel.

so if i correctly understand we could initialize feat_vecs externally in training part function PredictRaw only once on 0-th iteration and propagate it right to PredictDMatrix

trivialfis · 2022-01-22T13:45:49Z

Closed by #7545 .

ShvetsKS changed the title ~~Introduced cpu performance gaps~~ Introduced cpu performance gaps disscussion Feb 9, 2021

trivialfis mentioned this issue Jul 29, 2021

[WIP] Use model slicing for copy. #7145

Closed

trivialfis mentioned this issue Jan 8, 2022

[RFC] Add support for Universal Binary JSON #7545

Closed

6 tasks

trivialfis closed this as completed Jan 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduced cpu performance gaps disscussion #6697

Introduced cpu performance gaps disscussion #6697

ShvetsKS commented Feb 9, 2021

trivialfis commented Feb 9, 2021 •

edited

Loading

trivialfis commented Feb 9, 2021

ShvetsKS commented Feb 9, 2021

ShvetsKS commented Feb 9, 2021

trivialfis commented Feb 9, 2021

trivialfis commented Feb 10, 2021

ShvetsKS commented Feb 10, 2021

trivialfis commented Jan 22, 2022

Introduced cpu performance gaps disscussion #6697

Introduced cpu performance gaps disscussion #6697

Comments

ShvetsKS commented Feb 9, 2021

trivialfis commented Feb 9, 2021 • edited Loading

trivialfis commented Feb 9, 2021

ShvetsKS commented Feb 9, 2021

ShvetsKS commented Feb 9, 2021

trivialfis commented Feb 9, 2021

trivialfis commented Feb 10, 2021

ShvetsKS commented Feb 10, 2021

trivialfis commented Jan 22, 2022

trivialfis commented Feb 9, 2021 •

edited

Loading