Skip to content
This repository has been archived by the owner on Jan 2, 2024. It is now read-only.

Gives some statistics about English language vocabulary

License

Notifications You must be signed in to change notification settings

openderock/vocabulary-list-statistics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vocabulary List Statistics

Gives some statistics about english language vocabulary. the base vocabulary list is gain using hackerb9/gwordlist and hermitdave/FrequencyWords lists.

Features

  • 50,000 english cleaned vocabulary. cleaned using extract-lemmatized-nonstop-words.
  • Based on revising 349,066,176,882 words.
  • Sorted by relative frequency.
  • Relative Frequency percent per word.
  • Cumulative Relative Frequency percent perword sorted by relative frequency.

Install

Using Yarn

yarn add vocabulary-list-statistics

Using NPM

npm i --save vocabulary-list-statistics

You can also use the Excel version.

Usage

const vocabulary = require('vocabulary-list-statistics');

console.log(vocabulary[12].cumulative);
// logs Cumulative Relative Frequency of 12th word in the list 

Releases

No releases published

Packages

No packages published