Vocabulary Richness Calculator

This Python script calculates and compares the vocabulary richness of two text files. It can be used to determine which text has a richer vocabulary based on the ratio of unique words to the total number of words.

Prerequisites

Python 3.x
NLTK library (Natural Language Toolkit)

You can install NLTK using pip:

pip install nltk

Usage

Clone the Repository: Clone this repository to your local machine.
Place Text Files: Place the text files you want to analyze in the same directory as this script.
Run the Script: Open your terminal or command prompt, navigate to the project directory, and execute the following command:
```
python vocabulary_richness.py
```

Customization

You can easily modify this script to work with other text files by changing the file names in the read_book1 and read_book2 functions.

Feel free to adjust the lemmatization or tokenization methods to suit your specific requirements.

Example

Suppose you have two text files, "dorian.txt" and "jekyll.txt," containing the text of "The Portrait of Dorian Gray" and "The Strange Case Of Dr. Jekyll And Mr. Hyde," respectively. Running this script will tell you which book has a richer vocabulary.

Author

Cristina Matacuta

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

License This project is licensed under the MIT License - see the LICENSE.md file for details

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
vocabulary.py		vocabulary.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vocabulary Richness Calculator

Table of Contents

Prerequisites

Usage

Customization

Example

Author

License

About

Releases

Packages

Languages

License

cristinamatacuta/Vocabulary-Richness-Calculator

Folders and files

Latest commit

History

Repository files navigation

Vocabulary Richness Calculator

Table of Contents

Prerequisites

Usage

Customization

Example

Author

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages