FIX: bugs in evaluator #590

guijiql · 2020-12-18T05:57:02Z

1.Individual Evaluator can't raise NotImplementedError when used with eval_setting is full.
2.metrics may be disordered in log information.
3.GAUC will be calculated by error when there are items with same predicted scores for one user.

…eased the robustness of data.utils.data_preparation, which can divide the dataset into two parts (train, test) or three parts (train, valid, test).

…dle empty dataset now.

FEA: Add config['benchmark_filename'] to load pre-split dataset.

FEA: Increased the robustness of trainer.evaluate && bug fix in GeneralFullDataLoader.

FIX: optimize the update_attentive_A function in KGAT

tsotfsk · 2020-12-21T08:56:22Z

@guijiql Add some test cases in the tests/.

tsotfsk · 2020-12-22T03:22:39Z

recbole/evaluator/proxy_evaluator.py

        for metrics, evaluator in metric_eval_bind:
-            used_metrics = list(metrics_set.intersection(set(metrics.keys())))
+            used_metrics = [metric for metric in metrics_list if metric in metrics.keys()]


in metrics.keys() -> in metrics

tsotfsk · 2020-12-22T03:29:35Z

recbole/evaluator/metrics.py

+    all_with_pos = np.any(pos_len_list == 0)
+    all_with_neg = np.any(neg_len_list == 0)
+    non_zero_idx = np.full(len(user_len_list), True, dtype=np.bool)
+    if all_with_pos:


What's the meaning of this? why does all_with_pos mean np.any(pos_len_list == 0) , if pos_len_list = array([1,2,3]), then np.any(pos_len_list == 0) is False, so all_with_pos is False?

tsotfsk · 2020-12-22T03:47:46Z

tests/metrics/test_rank_metrics.py

+from recbole.evaluator.metrics import metrics_dict
+
+
+class TestCases(object):


Maybe, pos_rank_sum can‘t ensure your metric is right, please test the function evaluator.collect to ensure your pos_rank_sum is right. The reason to add the test is that it requires complex logic to calculate it.

chenyushuo and others added 14 commits December 18, 2020 10:54

FEA: add config['benchmark_filename'] to load pre-split dataset; incr…

0772227

…eased the robustness of data.utils.data_preparation, which can divide the dataset into two parts (train, test) or three parts (train, valid, test).

FIX: Increased the robustness of GeneralFullDataLoader, which can han…

3769b2d

…dle empty dataset now.

FIX: can't raise error in IndividualEvaluator

dd157a5

FIX: metrics disorder

9ced718

FIX: GAUC calculation error

c7cbd34

FIX: rename & comment format

5c1c147

REVERT: revert modify in data.utils

2dcac28

Merge pull request RUCAIBox#588 from chenyushuo/0.2.x

e8db062

FEA: Add config['benchmark_filename'] to load pre-split dataset.

update notes

fd86870

FEA: Increased the robustness of trainer.evaluate

e84aeb7

FIX: bug fix in GeneralFullDataLoader.

50bf9e8

Merge pull request RUCAIBox#596 from chenyushuo/0.2.x

27dad3f

FEA: Increased the robustness of trainer.evaluate && bug fix in GeneralFullDataLoader.

FIX: optimize update_attentive_A function in KGAT

4b4b9a8

Merge pull request RUCAIBox#597 from ShanleiMu/0.2.x

da3972c

FIX: optimize the update_attentive_A function in KGAT

chenyushuo requested a review from tsotfsk December 21, 2020 08:50

guijiql added 11 commits December 21, 2020 21:55

FIX: can't raise error in IndividualEvaluator

83f514e

FIX: metrics disorder

10243bb

FIX: GAUC calculation error

ab95863

FIX: rename & comment format

48f1078

FEA: add parameters check in gauc

eb84ef3

FEA: add GAUC check & GAUC test

aed05ee

update metrics.py

d6ea8e2

update metrics.py

02b3d1c

Update evaluators.py

3620841

FEA: add ranking metric test

bd40a3a

Merge branch '0.2.x' of https://github.com/guijiql/RecBole into 0.2.x

b7c56bb

tsotfsk suggested changes Dec 22, 2020

View reviewed changes

FIX: rename bool variable in GAUC & remove keys in build

6e9a9c6

tsotfsk suggested changes Dec 22, 2020

View reviewed changes

FEA: add RankEvaluator collect test

634bad6

tsotfsk approved these changes Dec 24, 2020

View reviewed changes

chenyushuo merged commit 94d5bd8 into RUCAIBox:0.2.x Dec 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: bugs in evaluator #590

FIX: bugs in evaluator #590

guijiql commented Dec 18, 2020

tsotfsk commented Dec 21, 2020 •

edited

Loading

tsotfsk Dec 22, 2020

tsotfsk Dec 22, 2020

tsotfsk Dec 22, 2020 •

edited

Loading

		from recbole.evaluator.metrics import metrics_dict


		class TestCases(object):

FIX: bugs in evaluator #590

FIX: bugs in evaluator #590

Conversation

guijiql commented Dec 18, 2020

tsotfsk commented Dec 21, 2020 • edited Loading

tsotfsk Dec 22, 2020

Choose a reason for hiding this comment

tsotfsk Dec 22, 2020

Choose a reason for hiding this comment

tsotfsk Dec 22, 2020 • edited Loading

Choose a reason for hiding this comment

tsotfsk commented Dec 21, 2020 •

edited

Loading

tsotfsk Dec 22, 2020 •

edited

Loading