evaluation

Evaluation and agreement scripts for the DISCOSUMO project. Each evaluation script takes both manual annotations as automatic summarization output. The formatting of these files is highly project-specific. However, the evaluation functions for precision, recall, ROUGE, Jaccard, Cohen's kappa and Fleiss' kappa may be applicable to other domains too.

The script agreement_and_eval.pl also implements three baselines: random, position and length.
The script eval.pl is the only script that assumes that the ground truth labels are in the featurefile (also for the oracle ranking) and therefore only takes one argument.
The script agreement_fleisskappa.py does not evaluate but only implements Fleiss' Kappa, and takes only the manual annotations as input.

The postids_per_thread.queries.txt, 106long20threads.postfeats.norm.out and 106long20threads.threadfeats.out are needed for some of the scripts to know the post ids per thread or number of posts per thread.

License

See the LICENSE file for license rights and limitations (GNU-GPL v3.0).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
106long20threads.postfeats.norm.out		106long20threads.postfeats.norm.out
106long20threads.threadfeats.out		106long20threads.threadfeats.out
LICENSE.md		LICENSE.md
README.md		README.md
agreement_and_eval.py		agreement_and_eval.py
agreement_fleisskappa.py		agreement_fleisskappa.py
agreement_query.py		agreement_query.py
eval.py		eval.py
eval_query.py		eval_query.py
postids_per_thread.queries.txt		postids_per_thread.queries.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

evaluation

License

About

Releases

Packages

Languages

License

DISCOSUMO/evaluation

Folders and files

Latest commit

History

Repository files navigation

evaluation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages