FastNLPProcessor annotatation is not stable #402

kwalcock · 2020-06-23T21:41:23Z

See also clulab/eidos#861. A different answer is computed from the same document. This is now a unit test in https://github.com/clulab/processors/blob/kwalcock-envBug/corenlp/src/test/scala/org/clulab/processors/TestFastNLPProcessorEnv.scala. Moving it from Eidos to Processors gets me a little closer to the cause and helps with debugging.

MihaiSurdeanu · 2020-06-23T23:26:41Z

Can you please paste an example of differences here?

kwalcock · 2020-06-23T23:36:44Z

First run	Second run
O	O
O	O
O	O
>=2.0	>=2.0
O	O
O	O
O	O
O	O
2012-09-25	2012-09-18
******18	******18
O	O
2012-09-19	2012-09-19
******19	******19
2012-W37	2012-W37
2012-W37	2012-W37
O	O
O	O
3	3
O	O
O	O
O	O
2012-09	2012-09
2012-09	2012-09
2012-09	2012-09
O	O

MihaiSurdeanu · 2020-06-24T14:13:46Z

This is a Stanford SUTime bug... Maybe it should be filed in the Stanford CoreNLP github?

kwalcock · 2020-06-24T15:06:08Z

That was almost my conclusion. I'd like to make sure that we are not misusing SUTime by, for instance, not calling some reset method between documents. I haven't yet found the line that makes the change that needs to be undone, though. I'll check what remedies Stanford might offer.

MihaiSurdeanu · 2020-06-24T15:07:08Z

Good point. Thanks!

kwalcock · 2020-07-02T20:08:23Z

The problem does seem to be with SUTime and I will file an issue there shortly. This here is for practice.

The rules for dealing with time are encoded in src/edu/stanford/nlp/time/rules/english.sutime.txt. The rules

  ENV.defaults["stage"] = 4
  ...
  {  pattern: ( [ { tag:/VBD/ } | /have/ ] []{0,2} [ $hasTemporal ] ),
     action: VTag( $0[-1].temporal.value, "resolveTo", RESOLVE_TO_PAST )
  }
  {  pattern: ( [ $hasTemporal ] []{0,2} [ { tag:/VBD/ } | /have/ ] ),
     action: VTag( $0[0].temporal.value, "resolveTo", RESOLVE_TO_PAST )
  }
  {  pattern: ( (/would/ | /could/ | /should/ | /will/ | /going/ /to/ | /'/ /ll/ | /'ll/ )
                []{0,2} [ $hasTemporal ]
              ),
     action: VTag( $0[-1].temporal.value, "resolveTo", RESOLVE_TO_FUTURE )
  }
  {  pattern: ( [ $hasTemporal ] []{0,2}
                (/would/ | /could/ | /should/ | /will/ | /going/ /to/ | /'/ /ll/ | /'ll/ ) ),
     action: VTag( $0[0].temporal.value, "resolveTo", RESOLVE_TO_FUTURE )
  }

arrange for a value tag (VTag) to be added to the environment. The tag's key is "resolveTo" and the value will depend on the matching pattern. This ends up happening in ValueFunctions.java where I can observe the change take place. The problem is that the environment influences other operations, the whole point of it, but that it cannot easily be reset. The first document is annotated without a resolveTo tag and SUTime acts in one way. The second document is annotated with a side effect of the resolveTo tag being added. The first document gets read again, but the side effect influences behavior and a different result gets produced. I see no support anywhere for restoring the environment to its initial condition between documents short of doing something like throwing everything away and starting with a new object, which would be very expensive on a per document basis.

Opinions to the contrary are very welcome.

MihaiSurdeanu · 2020-07-02T20:09:57Z

Nice catch!
I think you're correct.

kwalcock · 2020-07-02T22:37:04Z

Other related code is GenericTimeExpressionPatterns.java.determineRelFlags:

  public int determineRelFlags(CoreMap annotation, TimeExpression te)
  {
    int flags = 0;
    boolean flagsSet = false;
    if (te.value.getTags() != null) {
      Value v = te.value.getTags().getTag("resolveTo");
      if (v != null && v.get() instanceof Number) {
        flags = ((Number) v.get()).intValue();
        flagsSet = true;
      }
    }
    if (!flagsSet) {
      if (te.getTemporal() instanceof SUTime.PartialTime) {
        flags = SUTime.RESOLVE_TO_CLOSEST;
      }
    }
    return flags;
  }
}

and SUTime.PartialTime.resolve(). I'll reference them as well.

kwalcock · 2020-07-06T16:24:10Z

Submitted as stanfordnlp/CoreNLP#1061...

kwalcock · 2021-06-10T15:41:38Z

We just recently read 40,000 documents twice and the same phenomenon was observed. The reading changes.

MihaiSurdeanu · 2021-06-10T15:49:05Z

do you have examples of what changes? The SUTime output?

kwalcock · 2021-06-10T16:12:20Z

It's the exact same as before, which shouldn't be surprising. The week wanders around, and we have lots and lots of examples. It wasn't somehow a temporary problem. The repeatability consequence is troubling. Aside from results reported in papers, Eidos with a given version is supposed to report the same results downstream for the same document.

kwalcock self-assigned this Jun 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FastNLPProcessor annotatation is not stable #402

FastNLPProcessor annotatation is not stable #402

kwalcock commented Jun 23, 2020

MihaiSurdeanu commented Jun 23, 2020

kwalcock commented Jun 23, 2020

MihaiSurdeanu commented Jun 24, 2020

kwalcock commented Jun 24, 2020

MihaiSurdeanu commented Jun 24, 2020

kwalcock commented Jul 2, 2020

MihaiSurdeanu commented Jul 2, 2020

kwalcock commented Jul 2, 2020

kwalcock commented Jul 6, 2020

kwalcock commented Jun 10, 2021

MihaiSurdeanu commented Jun 10, 2021

kwalcock commented Jun 10, 2021

FastNLPProcessor annotatation is not stable #402

FastNLPProcessor annotatation is not stable #402

Comments

kwalcock commented Jun 23, 2020

MihaiSurdeanu commented Jun 23, 2020

kwalcock commented Jun 23, 2020

MihaiSurdeanu commented Jun 24, 2020

kwalcock commented Jun 24, 2020

MihaiSurdeanu commented Jun 24, 2020

kwalcock commented Jul 2, 2020

MihaiSurdeanu commented Jul 2, 2020

kwalcock commented Jul 2, 2020

kwalcock commented Jul 6, 2020

kwalcock commented Jun 10, 2021

MihaiSurdeanu commented Jun 10, 2021

kwalcock commented Jun 10, 2021