Implemented proper work with multiple threads #1361

olegkkruglov · 2023-07-13T23:29:00Z

Changes proposed in this pull request:

Add patching for _FuncWrapper class from sklearn.utils.parallel to properly call patched config_context
Fix patching for get_config to allow run patched version in sklearn.utils.parallel
Remove unnecessary queue propagation via config in BaseSVC._fit_proba which causes test failure after get_config patch fix

sklearnex/utils/parallel.py

samir-nasibli · 2023-07-31T23:47:27Z

In the next PRs please don't mix implementing common feature and bug fix. But for now we can leave it as is.
Better disable the test and fix it on separate branch :)

olegkkruglov · 2023-08-01T10:30:24Z

/intelci: run

sklearnex/svm/_common.py

samir-nasibli · 2023-08-02T15:24:16Z

/intelci: run

olegkkruglov · 2023-08-03T09:37:49Z

/intelci: run

olegkkruglov · 2023-08-08T09:51:24Z

/intelci: run

sklearnex/dispatcher.py

samir-nasibli · 2023-08-18T12:10:10Z

@olegkkruglov please rebase your branches and run intelci

olegkkruglov · 2023-08-18T12:20:50Z

/intelci: run

samir-nasibli · 2023-09-05T15:08:38Z

/intelci: run

ethanglaser · 2023-09-05T15:14:26Z

Any additional tests/examples for this functionality? Or is it covered by existing?

samir-nasibli · 2023-09-05T15:17:51Z

You can use this reproducer for the test:

import numpy as np
from sklearnex import patch_sklearn, config_context
from sklearn.datasets import make_classification
from sklearn.ensemble import BaggingClassifier
patch_sklearn()

from sklearn.svm import SVC

X, y = make_classification(
    n_samples=1000,
    n_features=4,
    n_informative=2,
    n_redundant=0,
    random_state=0,
    shuffle=False,
)

with config_context(target_offload="gpu"):
    ExtraTreesClassifier(max_depth=2, random_state=0).fit(X, y)
        # decision_function
    ensemble = BaggingClassifier(
        SVC(decision_function_shape="ovr"), n_jobs=3, random_state=0
    ).fit(X, y)

The result is:

Intel(R) Extension for Scikit-learn* enabled (https://github.com/intel/scikit-learn-intelex)
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU
INFO:sklearnex: sklearn.svm.SVC.fit: running accelerated version on GPU

On master it fallbacks on CPU. So your branch works correct with n_jobs enabled in this case.

ethanglaser · 2023-09-05T15:24:57Z

On master it fallbacks on CPU. So your branch works correct with n_jobs enabled in this case.

Perhaps adding something similar with allow_fallback_to_host=False flag added to config_context would be a good test of whether things are working properly?

samir-nasibli · 2023-09-05T15:35:13Z

On master it fallbacks on CPU. So your branch works correct with n_jobs enabled in this case.

Perhaps adding something similar with allow_fallback_to_host=False flag added to config_context would be a good test of whether things are working properly?

Good point, we can update logger for that as well.

Alexsandruss · 2023-09-07T15:21:49Z

@Mergifyio rebase

mergify · 2023-09-07T15:21:55Z

rebase

❌ Base branch update has failed

Git reported the following error:

Rebasing (1/7)
Auto-merging sklearnex/dispatcher.py
CONFLICT (content): Merge conflict in sklearnex/dispatcher.py
error: could not apply 284f7a7... Implemented proper work with multiple threads
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 284f7a7... Implemented proper work with multiple threads

Alexsandruss · 2023-09-07T19:40:19Z

/intelci: run

ethanglaser · 2023-09-07T19:52:20Z

sklearnex/tests/test_parallel.py

+def test_config_context_in_parallel():
+    x, y = make_classification(random_state=42)
+    try:
+        with config_context(target_offload="gpu"):


Suggested change

with config_context(target_offload="gpu"):

with config_context(target_offload="gpu", allow_fallback_to_host=False):

Actually maybe this modification isn't necessary, but I am a bit confused by the test - when would dpctl be available but no GPU? Thanks for adding the test.

dpctl is not only for gpu devices. For example, CI has dpctl installed without gpu in azure pipelines used instances.

This reverts commit 5648d30.

Alexsandruss · 2023-09-11T20:19:11Z

/intelci: run

Alexsandruss · 2023-09-11T22:32:12Z

/intelci: run

Alexsandruss · 2023-09-12T09:24:11Z

/intelci: run

Alexsandruss · 2023-09-12T09:51:01Z

http://intel-ci.intel.com/ee5150fa-6720-f13c-870e-a4bf010d0e2e

sklearnex/tests/test_parallel.py

samir-nasibli · 2023-09-12T13:16:37Z

Please attach GPU CI job as well.

ethanglaser · 2023-09-12T13:29:31Z

Please attach GPU CI job as well.

http://intel-ci.intel.com/ee51704d-b860-f1a2-a5d9-a4bf010d0e2e

napetrov · 2023-09-12T13:43:04Z

@Alexsandruss should example will be updated https://github.com/intel/scikit-learn-intelex/blob/master/examples/sklearnex/n_jobs.py

Alexsandruss · 2023-09-12T13:47:03Z

@Alexsandruss should example will be updated https://github.com/intel/scikit-learn-intelex/blob/master/examples/sklearnex/n_jobs.py

Will be in next PR with n_jobs parameter update.

samir-nasibli · 2023-09-22T09:00:17Z

conda-recipe/run_test.sh

@@ -54,7 +54,9 @@ pytest --verbose --pyargs ${daal4py_dir}/daal4py/sklearn
 return_code=$(($return_code + $?))

 echo "Pytest of sklearnex running ..."
-pytest --verbose --pyargs ${daal4py_dir}/sklearnex
+# TODO: investigate why test_monkeypatch.py might cause failures of other tests


Do we have proper tracker for this issues?

olegkkruglov requested review from Alexsandruss, samir-nasibli and KulikovNikita as code owners July 13, 2023 23:29

olegkkruglov force-pushed the parallel-fix branch from 05c32e8 to 5460384 Compare July 14, 2023 17:57

samir-nasibli reviewed Jul 14, 2023

View reviewed changes

sklearnex/utils/parallel.py Outdated Show resolved Hide resolved

samir-nasibli reviewed Jul 14, 2023

View reviewed changes

sklearnex/utils/parallel.py Outdated Show resolved Hide resolved

samir-nasibli reviewed Jul 14, 2023

View reviewed changes

sklearnex/utils/parallel.py Show resolved Hide resolved

olegkkruglov force-pushed the parallel-fix branch 2 times, most recently from ed4ccb1 to 8907d32 Compare July 31, 2023 22:54

samir-nasibli closed this Jul 31, 2023

samir-nasibli reopened this Jul 31, 2023

Alexsandruss reviewed Aug 1, 2023

View reviewed changes

sklearnex/svm/_common.py Show resolved Hide resolved

olegkkruglov force-pushed the parallel-fix branch from 7406562 to 1530a4a Compare August 2, 2023 15:24

olegkkruglov added 6 commits August 14, 2023 05:46

Implemented proper work with multiple threads

284f7a7

Add support for older sklearn versions

53b59fd

Change condition in test_memory_usage to avoid test failure

674a666

Style fixes

bb2cd80

Remove unnecessary queue propagation via config in BaseSVC._fit_proba

67e080c

Sort imports

d7ed459

olegkkruglov force-pushed the parallel-fix branch from 1530a4a to d7ed459 Compare August 14, 2023 12:46

napetrov reviewed Aug 18, 2023

View reviewed changes

sklearnex/dispatcher.py Show resolved Hide resolved

Merge branch 'intel:master' into parallel-fix

6420de9

Merge branch 'master' into parallel-fix

1670cee

Add test for config_context in parallel

e331715

ethanglaser reviewed Sep 7, 2023

View reviewed changes

Alexsandruss added 5 commits September 11, 2023 16:41

Explicitly disallow fallback to host in test

f016726

Debug print for dpctl in test running

d542401

Change parallel backend for test

5648d30

Revert "Change parallel backend for test"

20b0e84

This reverts commit 5648d30.

Separate monkeypatch tests

9f74cee

Alexsandruss approved these changes Sep 12, 2023

View reviewed changes

ethanglaser reviewed Sep 12, 2023

View reviewed changes

sklearnex/tests/test_parallel.py Show resolved Hide resolved

ethanglaser approved these changes Sep 12, 2023

View reviewed changes

Merge remote-tracking branch 'upstream/master' into parallel-fix

a69e57f

Alexsandruss merged commit 90e531e into intel:master Sep 12, 2023
16 of 17 checks passed

samir-nasibli reviewed Sep 22, 2023

View reviewed changes

icfaust mentioned this pull request May 14, 2024

[enhancement] remove sklearn_check_version dependence from onedal/svm #1835

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented proper work with multiple threads #1361

Implemented proper work with multiple threads #1361

olegkkruglov commented Jul 13, 2023 •

edited

Loading

samir-nasibli commented Jul 31, 2023 •

edited

Loading

olegkkruglov commented Aug 1, 2023

samir-nasibli commented Aug 2, 2023

olegkkruglov commented Aug 3, 2023

olegkkruglov commented Aug 8, 2023

samir-nasibli commented Aug 18, 2023

olegkkruglov commented Aug 18, 2023

samir-nasibli commented Sep 5, 2023

ethanglaser commented Sep 5, 2023 •

edited

Loading

samir-nasibli commented Sep 5, 2023 •

edited

Loading

ethanglaser commented Sep 5, 2023

samir-nasibli commented Sep 5, 2023

Alexsandruss commented Sep 7, 2023

mergify bot commented Sep 7, 2023

Alexsandruss commented Sep 7, 2023

ethanglaser Sep 7, 2023

ethanglaser Sep 7, 2023 •

edited

Loading

Alexsandruss Sep 11, 2023

Alexsandruss commented Sep 11, 2023

Alexsandruss commented Sep 11, 2023

Alexsandruss commented Sep 12, 2023

Alexsandruss commented Sep 12, 2023

samir-nasibli commented Sep 12, 2023

ethanglaser commented Sep 12, 2023

napetrov commented Sep 12, 2023

Alexsandruss commented Sep 12, 2023

samir-nasibli Sep 22, 2023

	with config_context(target_offload="gpu"):
	with config_context(target_offload="gpu", allow_fallback_to_host=False):

Implemented proper work with multiple threads #1361

Implemented proper work with multiple threads #1361

Conversation

olegkkruglov commented Jul 13, 2023 • edited Loading

samir-nasibli commented Jul 31, 2023 • edited Loading

olegkkruglov commented Aug 1, 2023

samir-nasibli commented Aug 2, 2023

olegkkruglov commented Aug 3, 2023

olegkkruglov commented Aug 8, 2023

samir-nasibli commented Aug 18, 2023

olegkkruglov commented Aug 18, 2023

samir-nasibli commented Sep 5, 2023

ethanglaser commented Sep 5, 2023 • edited Loading

samir-nasibli commented Sep 5, 2023 • edited Loading

ethanglaser commented Sep 5, 2023

samir-nasibli commented Sep 5, 2023

Alexsandruss commented Sep 7, 2023

mergify bot commented Sep 7, 2023

❌ Base branch update has failed

Alexsandruss commented Sep 7, 2023

ethanglaser Sep 7, 2023

Choose a reason for hiding this comment

ethanglaser Sep 7, 2023 • edited Loading

Choose a reason for hiding this comment

Alexsandruss Sep 11, 2023

Choose a reason for hiding this comment

Alexsandruss commented Sep 11, 2023

Alexsandruss commented Sep 11, 2023

Alexsandruss commented Sep 12, 2023

Alexsandruss commented Sep 12, 2023

samir-nasibli commented Sep 12, 2023

ethanglaser commented Sep 12, 2023

napetrov commented Sep 12, 2023

Alexsandruss commented Sep 12, 2023

samir-nasibli Sep 22, 2023

Choose a reason for hiding this comment

olegkkruglov commented Jul 13, 2023 •

edited

Loading

samir-nasibli commented Jul 31, 2023 •

edited

Loading

ethanglaser commented Sep 5, 2023 •

edited

Loading

samir-nasibli commented Sep 5, 2023 •

edited

Loading

ethanglaser Sep 7, 2023 •

edited

Loading