-
Notifications
You must be signed in to change notification settings - Fork 4
LisbonDocumentationDiscussion
ChikaraHashimoto edited this page Aug 28, 2005
·
11 revisions
FrancisBond presented a brief introduction and a related work that is in progress in NTT (TheSlides). After that, the participants discussed how to document, how to motivate people to write documents, among others.
- It is hard to keep track of grammars and tools that are being developed by DELPH-IN members all over the world.
- So we are using the DELPH-IN wikis and mailing-lists, but these are still incomplete.
- We need documentations.
- Lexical Type Documentation for the Japanese grammar, JACY
- In NTT's HINOKI treebanking project, treebankers are not grammar developers nor linguists. They need a comprehensive and detailed documentation, especially for lexical types, to annotate corpora.
- Requirements:
- Semi-automatic documentation
- Flexible browsing
- Visible to members all over the world
- Key ideas:
- Put every information into Relational Database
- Browsing it through the Web
- Description on a high level of the linguistic analysis is often scattered in different papers. It is useful to collect electronic versions of as much papers as possible links.
- Granularity of documentation and modes of documentation depends on users, purposes, or what you want to do with it.
- Modes of documentation
- documentation for "internal" people, technical, in the grammar files, viewable from outside
- detailed documentation about phenomena, book or papers.
- very external documentation about installation etc, online
- The grammar could be annotated with examples, typically for the rules and lexical types.
- Documentation process should be automated as much as one can, as the NTT system does.
- Grammar writers should set up multi-language test suites on certain phenomena and differences in the grammars; something like the MRS testsuite for all grammars.
- Top-level documentation has to be done not only for grammars but also for systems.
- The model in which one system is maintained by one developer is frightening. We need documentation to change this situation.
- The Spanish grammar started with documenting each line of implementation, different phenomena with examples, types. So, it is documented by writing the grammar.
- Some things that have been done for one purpose will be used for another purpose. So, the documentation will have to be adapted to the needs.
- Researchers need things that can be inserted into one's CV, and LSA counts electronic documentations as publications, and so does W3C. This can motivate people to write documents.
- A reason for documentation is also to keep track for oneselves, another is to keep track of Matrix adaptations, to support the Matrix development.
- Interests for documentation it to set up a standard, discussion in the community, comparability of the grammars (how do we want to compare?)
- High-level documentation can be published. low-level documentation, navigation, like the lexical type documentation is very useful, on what scale will we need it?
Home | Forum | Discussions | Events