C-Value-Term-Extraction

This repository contains implementations of the term extraction algorithm (without filters) from

Automatic Recognition of Multi-Word Terms: the C-value/NC-value Method Katerina Frantziy, Sophia Ananiadouy, Hideki Mima

The sample algorithm testing file 'Turku.txt' was tagged by Stanford CoreNLP to Part of Speech, which gave out 'Turku-tagged.txt'

python3 Main.py path_to_/Turku-tagged.txt ligui_filter max_len freq_threshold C_Value_threshld

The program will print out terms with the top-10 C-value.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Main.py		Main.py
NoName.py		NoName.py
README.md		README.md
Turku-tagged.txt		Turku-tagged.txt
Turku.txt		Turku.txt
stanford-postagger-2017-06-09.zip		stanford-postagger-2017-06-09.zip

Provide feedback