This project aims at casting the French lexicon data from Wiktionary into an OWL ontology. Check it out at chalier.fr/flont!
The French Wiktionary is an immense source of lexical knowledge about the French language. Its community model allowed it to reach a broad coverage of the language, making it one of the best sources available online. Moreover, it's open-sourced. Yet, its content is hardly machine readable, preventing its usage in general application. We tackle this by parsing the Wiktionary data and populating an handcrafted ontology.
- Input data: Wiktionary
- WikiText parsing: WikiTextParser
- Python OWL interface: Owlready
- Ontology crafting: Protégé
- Web application framework: Django
Contributions are welcomed. See the readmes in subdirectories for documentation on how to get started with the code. See the to-do list for inputs on what you can do. Write clean code (use a linter). Then send a pull request with your changes.