Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

addition of grammaticalGender to ATTLIST for Lexical Entry and Form? #64

Open
anasfkhan81 opened this issue Jul 6, 2022 · 2 comments
Open

Comments

@anasfkhan81
Copy link

anasfkhan81 commented Jul 6, 2022

Would be useful for languages with gender (in languages like Italian the same Lexical Entry can have singular and plural forms of different genders so it would be good to have this in the ATTLIST for Form too)....(I also think part of speech would make more sense as part of the ATTLIST for Lexical Entry if we start to add grammatical attributes at the level of Lexical Entry)

@jmccrae
Copy link
Member

jmccrae commented Jul 6, 2022

This seems very reasonable. Do we have any examples of existing wordnets where we could base this from?

@goodmami
Copy link
Member

goodmami commented Jul 7, 2022

Something like this would be very useful. My initial reaction is that an attribute for gender features isn't great because many languages do not use grammatical gender, but then again we have non-universal things like adjposition and partOfSpeech (and its limited values). But also, what about other features, like number, person, tense, aspect, etc...

Currently we can use <Tag>:

<LexicalEntry id="oewn-goose-n">
  <Lemma writtenForm="goose" partOfSpeech="n">
    <Tag category="NUM">sg</Tag>
  </Lemma>
  <Form writtenForm="geese">
    <Tag category="NUM">pl</Tag>
  </Form>
  ...

But this does seem excessively verbose, and would be worse with multiple features. It might be nice to have something that encodes multiple features in some regular way on a single attribute, e.g., following the features of Unimorph:

  <LexicalEntry id="oewn-sleep-v">
    <Lemma writtenForm="sleep" partOfSpeech="v" />
    <Form writtenForm="sleeps" features="3;SG;PRS;IND" />
    <Form writtenForm="slept" features="PST" />
  ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants