Skip to content

mounicam/lexical_simplification

Repository files navigation

A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification

This repository contains the code and resources from the following paper

Repo Structure:

  1. word_complexity_lexicon: Lexicon with complexity scores for ~15000 most frequent words from Google Ngram Corpus. The scores are calculated by aggregating over human ratings. We release both the aggregated ratings and the individual ratings by each annotator.

  2. SimplePPDBpp: SimplePPDB++ resource consisting of around 14.1 million paraphrase rules along with their readability scores.

  3. neural_readability_ranker: Code for our neural readability ranker model.

Citation

Please cite if you use the above resources for your research

@InProceedings{EMNLP-2018-Maddela,
  author = 	"Maddela, Mounica and Xu, Wei",
  title = 	"A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification",
  booktitle = 	"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)",
  year = 	"2018",
}

About

No description or website provided.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages