Selection of objective "rank:ndcg" results in lower NDCG than "rank:pairwise" #4177

Edmondguo · 2019-02-25T12:00:27Z

Thanks for adding ranking task support in xgboost! But I have a few questions:

Docs says "Use LambdaMART to perform pairwise ranking where the pairwise loss is minimized", I want to know particular function of "pairwise loss".
I can not understand "Use LambdaMART to perform pairwise ranking". According to 《From RankNet to LambdaRank to LambdaMART: An Overview》, LambdaMART is a Listwise method, which optimizes NDCG.

kretes · 2019-02-25T18:23:44Z

Hi @Edmondguo . I can just point to some historic discussion and my understanding of how xgboost works, so it would still be good to get some official confirmation for that, e.g. from @hcho3 .

I believe rank:pairwise is a pairwise method that tries to minimize the number of pairwise errors.
rank:ndcg is a method following LambdaMART and when you dig in the code - it will confirm that rank:ndcg is an extension of rank:pairwise with additional weights added to the loss of each pair.

however in a few experiments it looks as if rank:ndcg performs worse than rank:pairwise, and it might be due to the implementation. see e.g. #2092 (comment)

Some time ago we verified rank:ndcg to perform a bit worse when evaluated on ndcg than rank:pairwise in our case.

hcho3 · 2019-02-25T20:34:44Z

rank:ndcg is an extension of rank:pairwise with additional weights added to the loss of each pair.

Exactly. In "From RankNet to LambdaRank to LambdaMART", LambdaMART optimizes NDCG by optimizing the pairwise loss (with lambda's) that is weighted with change in NDCG.

Edmondguo · 2019-02-26T01:20:43Z

Hi @Edmondguo . I can just point to some historic discussion and my understanding of how xgboost works, so it would still be good to get some official confirmation for that, e.g. from @hcho3 .

I believe rank:pairwise is a pairwise method that tries to minimize the number of pairwise errors.
rank:ndcg is a method following LambdaMART and when you dig in the code - it will confirm that rank:ndcg is an extension of rank:pairwise with additional weights added to the loss of each pair.

however in a few experiments it looks as if rank:ndcg performs worse than rank:pairwise, and it might be due to the implementation. see e.g. #2092 (comment)

Some time ago we verified rank:ndcg to perform a bit worse when evaluated on ndcg than rank:pairwise in our case.

Thank you very much! In my experiment I also found that rank:ndcg perform worse than rank:pairwise.

Edmondguo · 2019-02-26T01:23:40Z

rank:ndcg is an extension of rank:pairwise with additional weights added to the loss of each pair.

Exactly. In "From RankNet to LambdaRank to LambdaMART", LambdaMART optimizes NDCG by optimizing the pairwise loss (with lambda's) that is weighted with change in NDCG.

Thank you! So is it means in rank:pairwise, xgboost use lambda's which is derived by "Cross Entropy Loss" in RankNet as the loss funtion?

hcho3 · 2019-02-26T01:36:47Z

@Edmondguo Yes

hcho3 · 2019-02-26T01:37:45Z

@Edmondguo @kretes Would you be interested in posting an example where you get better NDCG metric by choosing rank:pairwise instead of rank:ndcg? I'd like to see if this is a bug or a chance.

Edmondguo · 2019-02-26T03:02:42Z

@Edmondguo @kretes Would you be interested in posting an example where you get better NDCG metric by choosing rank:pairwise instead of rank:ndcg? I'd like to see if this is a bug or a chance.

The project I am dealing with is using rank model in quantitative stock selection.It seems hard to provide because the data is too big.In this case "rank:pairwise" performs much better than "rank:ndcg" under the same booster parameters.the NDCG are 0.5138 for "rank:ndcg", 0.5586 for "rank:pairwise".

hcho3 · 2019-02-26T22:40:11Z

@Edmondguo Does your data have multiple relevance judgment levels (1, 2, 3, 4, ...) ?

Edmondguo · 2019-02-27T02:29:25Z

@Edmondguo Does your data have multiple relevance judgment levels (1, 2, 3, 4, ...) ?

Yes，before I train the model,I have change y into (1,2,3,...30)

hcho3 · 2019-02-27T18:51:59Z

It would be nice if there is a toy example we can use to show rank:pairwise outperforming rank:ndcg. Without an example, it is hard to find out why rank:ndcg is not working well.

kretes · 2019-03-04T21:31:22Z

Hello.

I believe I found an example where this is reproducible. rank-pairwise gives ndcg 1 while rank:ndcg cannot.
See this gist: https://gist.github.com/kretes/1228e571aeba2a57f617352af633cd40.

I hope this will help nailing the issue

sano176 · 2019-10-30T06:54:34Z

i met this problem too, i could't find the reason to explain it, "objective = rank:pairwise" better than "objective = rank:ndcg"

chloe-wang · 2020-09-14T12:08:36Z

@Edmondguo just want to follow up this issue. I met the same problem. Did you figure out the reason?

trivialfis · 2020-12-17T14:12:11Z

Some explanation is given in #6352 . For future work, see #6450 .

hcho3 changed the title ~~What is the particular loss function in "rank:pairwise"?~~ Selection of objective "rank:ndcg" results in lower NDCG than "rank:pairwise" Mar 8, 2019

fly12357 mentioned this issue Jul 25, 2019

Dose XGBRanker objective=‘rank:pairwise’. consider the NDCG in loss? #4694

Closed

hcho3 mentioned this issue Nov 17, 2020

rank:ndcg does not work in Python #6352

Closed

trivialfis added the LTR Learning to rank label Dec 17, 2020

trivialfis closed this as completed Dec 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Selection of objective "rank:ndcg" results in lower NDCG than "rank:pairwise" #4177

Selection of objective "rank:ndcg" results in lower NDCG than "rank:pairwise" #4177

Edmondguo commented Feb 25, 2019

kretes commented Feb 25, 2019

hcho3 commented Feb 25, 2019

Edmondguo commented Feb 26, 2019

Edmondguo commented Feb 26, 2019

hcho3 commented Feb 26, 2019

hcho3 commented Feb 26, 2019

Edmondguo commented Feb 26, 2019

hcho3 commented Feb 26, 2019

Edmondguo commented Feb 27, 2019

hcho3 commented Feb 27, 2019

kretes commented Mar 4, 2019

sano176 commented Oct 30, 2019

chloe-wang commented Sep 14, 2020

trivialfis commented Dec 17, 2020

Selection of objective "rank:ndcg" results in lower NDCG than "rank:pairwise" #4177

Selection of objective "rank:ndcg" results in lower NDCG than "rank:pairwise" #4177

Comments

Edmondguo commented Feb 25, 2019

kretes commented Feb 25, 2019

hcho3 commented Feb 25, 2019

Edmondguo commented Feb 26, 2019

Edmondguo commented Feb 26, 2019

hcho3 commented Feb 26, 2019

hcho3 commented Feb 26, 2019

Edmondguo commented Feb 26, 2019

hcho3 commented Feb 26, 2019

Edmondguo commented Feb 27, 2019

hcho3 commented Feb 27, 2019

kretes commented Mar 4, 2019

sano176 commented Oct 30, 2019

chloe-wang commented Sep 14, 2020

trivialfis commented Dec 17, 2020