Convert TopK reduction to be multiline example based #1752

jackgerrits · 2019-02-07T23:43:52Z

JohnLangford · 2019-02-08T16:47:25Z

test/train-sets/ref/topk-rec.stderr

 weighted label sum = 15.000000
 average loss = 0.000002
 best constant = 1.666667
-total feature number = 39
+total feature number = 36


This changes because you eliminate newline examples with a constant feature?

Yes exactly

JohnLangford · 2019-02-08T16:50:05Z

test/train-sets/ref/topk-rec.stderr

+0.000004 0.000001            6            6.0   1.0000   1.0007        4
+0.000003 0.000000            7            7.0   2.0000   2.0004        4
+0.000003 0.000000            8            8.0   1.0000   1.0003        4
+0.000002 0.000000            9            9.0   3.0000   2.9995        4


The number of multiline examples should be 3, right?

Yes it is 3, but I am recreating the previous behavior of printing statistics after each example. See line 91 in the reduction. What do you think is the correct thing to do here? Only print the statistics after each set of examples?

Yes. All predictions from a top-k call should (conceptually) be on a single line. The idea of this reduction is that you are return the top-k-of-n things.

This reverts commit 2c8f965.

JohnLangford · 2019-02-08T21:26:20Z

Merged, thanks :-)

RunTests: use test label number instead of counter (VowpalWabbit/vowpal_wabbit#1753) Small Json parser cleanup (VowpalWabbit/vowpal_wabbit#1759) Type erase json parser context for easier deletion ((VowpalWabbit/vowpal_wabbit#1760) Fix static linking (VowpalWabbit/vowpal_wabbit#1758) Fix build scripts forcing Debug builds. Add LTO mode and fix VW defau…(VowpalWabbit/vowpal_wabbit#1735) Do not define BOOST_TEST_DYN_LINK when statically linking (VowpalWabbit/vowpal_wabbit#1750) Convert TopK reduction to be multiline example based (VowpalWabbit/vowpal_wabbit#1752) vw java 11 compatibility (VowpalWabbit/vowpal_wabbit#1700) cbify: --cbify_ldf for multiline (csoaa_ldf) input datasets (VowpalWabbit/vowpal_wabbit#1681) Merge pull request #1751 from yannstad/fix-tests (VowpalWabbit/vowpal_wabbit#1751) [tests] Make repeat.py compatible with python 3 (VowpalWabbit/vowpal_wabbit#1747)

* Make topk a multiline learner * Fix test for new format and rename B to K * revert destructor usage * remove all from data, move to output seq in finish * Revert "remove all from data, move to output seq in finish" This reverts commit 2c8f965.

jackgerrits and others added 4 commits February 7, 2019 18:23

Make topk a multiline learner

73dc5f9

Fix test for new format and rename B to K

477da92

revert destructor usage

d2b65fd

Merge branch 'master' into jagerrit/top_multi

d3787fa

JohnLangford reviewed Feb 8, 2019

View reviewed changes

jackgerrits and others added 3 commits February 8, 2019 14:50

remove all from data, move to output seq in finish

2c8f965

Merge branch 'master' into jagerrit/top_multi

e7c2b18

Revert "remove all from data, move to output seq in finish"

42b7063

This reverts commit 2c8f965.

JohnLangford merged commit a3c0943 into VowpalWabbit:master Feb 8, 2019

homezcx mentioned this pull request Feb 21, 2019

Update VowpalWabbit submodule to ecb130592465cfb084b87eabdd4a103f9ddeb891 VowpalWabbit/reinforcement_learning#46

Merged

jackgerrits deleted the jagerrit/top_multi branch April 5, 2019 16:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert TopK reduction to be multiline example based #1752

Convert TopK reduction to be multiline example based #1752

jackgerrits commented Feb 7, 2019

JohnLangford Feb 8, 2019

jackgerrits Feb 8, 2019

JohnLangford Feb 8, 2019

jackgerrits Feb 8, 2019

JohnLangford Feb 8, 2019

JohnLangford commented Feb 8, 2019

Convert TopK reduction to be multiline example based #1752

Convert TopK reduction to be multiline example based #1752

Conversation

jackgerrits commented Feb 7, 2019

JohnLangford Feb 8, 2019

Choose a reason for hiding this comment

jackgerrits Feb 8, 2019

Choose a reason for hiding this comment

JohnLangford Feb 8, 2019

Choose a reason for hiding this comment

jackgerrits Feb 8, 2019

Choose a reason for hiding this comment

JohnLangford Feb 8, 2019

Choose a reason for hiding this comment

JohnLangford commented Feb 8, 2019