[ci] [python] reduce unnecessary data loading in tests #3486

jameslamb · 2020-10-26T05:47:14Z

I spent some time this weekend learning snakeviz, a Python profiler. I tried running it over the Python package tests here, and I think I found an opportunity to cut the test run time.

I realized we're spending time loading and reloading the same datasets. This PR proposes reducing those calls by caching the results of sklearn.datasets.load_* calls. The datasets are really small so I don't think holding them all in memory will cause an issue.

UPDATE: Ok from some experiments, the speedup from this seems really small on my laptop. But might be larger in CI environments, where we know that disk I/O is a lot slower (#2965 (comment)).

Results

These results were obtained on my laptop. The blockquote results come from one run of each test. Then I ran them all again 10 times to get a better estimate of the mean time.

`test_basic.py`

before (mean = 1.33s)

58216 function calls (57604 primitive calls) in 0.462 seconds
==== 10 passed, 3 warnings in 2.18s ====

trials (seconds): [1.37, 1.40, 1.34, 1.37, 1.31, 1.30, 1.31, 1.31, 1.31, 1.31]

after (mean = 1.36s)

53470 function calls (52858 primitive calls) in 0.396 seconds
==== 10 passed, 3 warnings in 1.36s ====

trials (seconds): [1.39, 1.39, 1.40, 1.36, 1.35, 1.41, 1.40, 1.29, 1.31, 1.35]

`test_engine.py`

before (mean = 12.78s)

2749043 function calls (2651956 primitive calls) in 9.456 seconds
==== 63 passed, 2 skipped, 20 warnings in 13.69s ====

trials (seconds): [14.16, 13.13, 12.85, 12.66, 12.34, 12.60, 13.17, 12.98, 12.40, 11.50]

after (mean = 11.84s)

2749043 function calls (2651956 primitive calls) in 9.456 seconds
==== 63 passed, 2 skipped, 20 warnings in 11.29s ====

trials (seconds): [12.18, 10.71, 11.09, 10.88, 10.77, 12.64, 12.47, 14.90, 11.62, 11.14]

`test_sklearn.py`

before(mean = 9.68s)

4292703 function calls (4172793 primitive calls) in 8.044 seconds
==== 34 passed, 15 warnings in 11.07s ====

trials (seconds): [10.17, 9.52, 9.34, 9.58, 9.22, 9.34, 9.75, 10.21, 9.67, 9.99]

after (mean = 9.50s)

2945200 function calls (2843392 primitive calls) in 8.867 seconds
==== 34 passed, 15 warnings in 9.26s ====

trials (seconds): [9.67, 8.64, 9.32, 8.71, 8.73, 9.29, 8.97, 12.73, 9.48, 9.44]

How to reproduce these tests

# install lightgbm
pushd python-package
python setup.py install
popd

# install dependencies
pip install snakeviz pytest-profiling

# profile tests
pytest --profile tests/python_package_test/test_basic.py
pytest --profile tests/python_package_test/test_engine.py
pytest --profile tests/python_package_test/test_sklearn.py

# (optional) visualize profiling data
snakeviz prof/combined.prof

jameslamb · 2020-10-26T17:27:27Z

interesting, I see this in some tests:

../../../../miniconda/envs/test-env/lib/python3.6/functools.py:477: in lru_cache
    raise TypeError('Expected maxsize to be an integer or None')
E   TypeError: Expected maxsize to be an integer or None

Maybe there was not a default value in older versions of Python? Because today it has a default of 128: https://docs.python.org/3/library/functools.html#functools.lru_cache

Anyway, I'm going to try switching these to just @cache. Since they're just tests that we completely control, not user code, I think it's ok to have an unbounded cache. And that should be faster. From https://docs.python.org/3/library/functools.html#functools.cache:

Returns the same as lru_cache(maxsize=None), creating a thin wrapper around a dictionary lookup for the function arguments. Because it never needs to evict old values, this is smaller and faster than lru_cache() with a size limit.

UPDATE: nevermind, functools.cache was added in Python 3.9.

StrikerRUS · 2020-10-26T19:14:31Z

@jameslamb I think we can have maxsize equals to 2 for functions with only one bool argument return_X_y=True/False and 32 for load_digits. None doesn't look as a good default value:
https://github.com/python/cpython/blob/920cb647ba23feab7987d0dac1bd63bfc2ffc4c0/Lib/functools.py#L549-L562

jameslamb · 2020-10-26T19:34:03Z

@StrikerRUS can you explain why you think None is a bad default?

From that code snippet, it looks like it would be faster than setting a maxsize (since there is no code about evicting things from the cache).

I understand why it would be bad in user code, since the cache can grow without limit, but for these unit tests where we completely control the set of unique combinations and know it to be small, I think we should have a preference for the faster option.

StrikerRUS · 2020-10-26T21:16:41Z

@jameslamb
maxsize=None means no LRU feature. I thought you want to use it.

jameslamb · 2020-10-26T21:23:32Z

@jameslamb
maxsize=None means no LRU feature. I thought you want to use it.

oh I see. No I really just cared more about the caching than using Least Recently Used (LRU), since we have so few variations of kwargs for each dataset.

I did forget though that functools.lru_cache isn't available in Python 2.7. Will push something to fix that 😬

StrikerRUS · 2020-10-26T23:06:15Z

@jameslamb Ah OK! I misunderstood you then.

tests/python_package_test/test_basic.py

tests/python_package_test/utils.py

StrikerRUS

@jameslamb Please check my comments below.

tests/python_package_test/utils.py

StrikerRUS · 2020-10-28T21:31:51Z

tests/python_package_test/utils.py

+    import warnings
+    warnings.warn("Could not import functools.lru_cache", RuntimeWarning)
+
+    def lru_cache(maxsize=None):


Please memoize this too

LightGBM/tests/python_package_test/test_plotting.py

Line 19 in 5cc9e67

self.X_train, self.X_test, self.y_train, self.y_test = train_test_split(*load_breast_cancer(return_X_y=True),

what would be the purpose? That's the only call to load_breast_cancer() in that module. So I think the caching would add a tiny bit of overhead for no benefit.

How does it differ with other calls you've memoized? 5 calls of load_breast_cancer() in test_plotting.py is even more than 3 calls of the same function in test_basic.py, for example.

there is only 1 call in test_plotting.py

git grep load_breast_cancer tests/python_package_test/test_plotting.py

Yes, you are right, but the method in which this call is performed (setUp) is called before each test. So, actually we have 5 calls.

OOOOOOOOOOO haha ok. I haven't used unittest.TestCase in a while, I forgot which one was a "run before every test" setup and which one was a "run exactly once, before any tests" one.

Ok yes I'll update this

added in dfb0fd3

Thanks! Actually I think we should refactor this to "run exactly once, before any tests" (setUpClass()), but it is another issue, of course.

tests/python_package_test/test_sklearn.py

Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

StrikerRUS

LGTM, thanks a lot!

github-actions · 2023-08-24T01:15:26Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

jameslamb added 2 commits October 25, 2020 23:55

[ci] [python] reduce unnecessary data loading in tests

79c7007

add profiling files to gitignore

f792015

jameslamb added the maintenance label Oct 26, 2020

jameslamb requested a review from StrikerRUS October 26, 2020 05:47

jameslamb requested review from chivee, guolinke, henry0312, Laurae2 and wxchan as code owners October 26, 2020 05:47

jameslamb added 2 commits October 26, 2020 12:29

just use cache()

c71a30f

default on cache size

979b76a

jameslamb added 2 commits October 26, 2020 16:32

patch lru_cache on Python 2.7

a86a6c2

linting

afff7d3

StrikerRUS reviewed Oct 26, 2020

View reviewed changes

tests/python_package_test/test_basic.py Outdated Show resolved Hide resolved

jameslamb added 3 commits October 26, 2020 22:24

Merge branch 'master' into misc/faster-tests

c3067ee

reduce duplicated code

25a9bc7

missing warnings

3af0a5c

StrikerRUS reviewed Oct 28, 2020

View reviewed changes

tests/python_package_test/utils.py Show resolved Hide resolved

tests/python_package_test/utils.py Outdated Show resolved Hide resolved

jameslamb added 4 commits October 27, 2020 23:05

Merge branch 'master' into misc/faster-tests

999c5b6

fix imports

3394222

fix lru_cache backport

3a5fe60

missing kwargs

c1cb1b2

StrikerRUS requested changes Oct 28, 2020

View reviewed changes

Apply suggestions from code review

16ec8aa

Co-authored-by: Nikita Titov <nekit94-08@mail.ru>

jameslamb added 2 commits October 28, 2020 17:38

reduce duplicated code

4c2e7be

cache in test_plotting

dfb0fd3

StrikerRUS approved these changes Oct 29, 2020

View reviewed changes

StrikerRUS merged commit 03c4d45 into microsoft:master Oct 29, 2020

jameslamb deleted the misc/faster-tests branch February 7, 2021 03:36

jameslamb mentioned this pull request Sep 1, 2021

[python-package] early stopping min_delta (fixes #2526) #4580

Merged

jameslamb mentioned this pull request Apr 27, 2022

add profiling jameslamb/lightgbm-dask-testing#44

Open

jameslamb mentioned this pull request Sep 3, 2022

Decouple Boosting Types (fixes #3128) #4827

Merged

github-actions bot locked as resolved and limited conversation to collaborators Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ci] [python] reduce unnecessary data loading in tests #3486

[ci] [python] reduce unnecessary data loading in tests #3486

jameslamb commented Oct 26, 2020

jameslamb commented Oct 26, 2020 •

edited

Loading

StrikerRUS commented Oct 26, 2020

jameslamb commented Oct 26, 2020

StrikerRUS commented Oct 26, 2020

jameslamb commented Oct 26, 2020

StrikerRUS commented Oct 26, 2020

StrikerRUS left a comment

StrikerRUS Oct 28, 2020

jameslamb Oct 28, 2020

StrikerRUS Oct 28, 2020 •

edited

Loading

jameslamb Oct 29, 2020

StrikerRUS Oct 29, 2020

jameslamb Oct 29, 2020

jameslamb Oct 29, 2020

StrikerRUS Oct 29, 2020

StrikerRUS left a comment

github-actions bot commented Aug 24, 2023

[ci] [python] reduce unnecessary data loading in tests #3486

[ci] [python] reduce unnecessary data loading in tests #3486

Conversation

jameslamb commented Oct 26, 2020

Results

test_basic.py

test_engine.py

test_sklearn.py

How to reproduce these tests

jameslamb commented Oct 26, 2020 • edited Loading

StrikerRUS commented Oct 26, 2020

jameslamb commented Oct 26, 2020

StrikerRUS commented Oct 26, 2020

jameslamb commented Oct 26, 2020

StrikerRUS commented Oct 26, 2020

StrikerRUS left a comment

Choose a reason for hiding this comment

StrikerRUS Oct 28, 2020

Choose a reason for hiding this comment

jameslamb Oct 28, 2020

Choose a reason for hiding this comment

StrikerRUS Oct 28, 2020 • edited Loading

Choose a reason for hiding this comment

jameslamb Oct 29, 2020

Choose a reason for hiding this comment

StrikerRUS Oct 29, 2020

Choose a reason for hiding this comment

jameslamb Oct 29, 2020

Choose a reason for hiding this comment

jameslamb Oct 29, 2020

Choose a reason for hiding this comment

StrikerRUS Oct 29, 2020

Choose a reason for hiding this comment

StrikerRUS left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 24, 2023

`test_basic.py`

`test_engine.py`

`test_sklearn.py`

jameslamb commented Oct 26, 2020 •

edited

Loading

StrikerRUS Oct 28, 2020 •

edited

Loading