ZipfAnatomy - supplementary materials

This is a collection of bash, Perl, and gnuplot scripts as well as tables and figures accompanying my paper "Corrections of Zipf's and Heaps' Laws Derived from Hapax Rate Models".

Supplementary plots

If you are primarily interested in analysing the supplementary plots for my paper, go to directories herdan/ and zipf/ and browse the PDF files.

Reproducing the results on another dataset

Before running these scripts, you need to download plain texts and frequency lists (these were from the Project Gutenberg corpus and the National Corpus of Polish in my case) and to put them in appropriate directories. To work with other data, you are invited to modify scripts:

make_init_herdan.bash

make_init_zipf.bash

The scripts assume that that the Perl scripts from my Github TypeToken repository are reachable from the bash variable $PATH.
The main script is make.bash, it calls everything else and it generates directories herdan/ and zipf/ that contain tables and figures.
There are two parallel pipelines of text processing. The first pipeline assumes that you have plain text files as the input. These are the "herdan" files. Another pipeline assumes that you have frequency lists as the input. These are the "zipf" files.

Good luck!

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
herdan		herdan
zipf		zipf
LICENSE		LICENSE
README.md		README.md
fitlog_herdan.txt		fitlog_herdan.txt
fitlog_zipf.txt		fitlog_zipf.txt
make.bash		make.bash
make_combination_herdan.bash		make_combination_herdan.bash
make_combination_herdan.gpl		make_combination_herdan.gpl
make_combination_top.bash		make_combination_top.bash
make_combination_top.gpl		make_combination_top.gpl
make_combination_zipf.bash		make_combination_zipf.bash
make_combination_zipf.gpl		make_combination_zipf.gpl
make_init_herdan.bash		make_init_herdan.bash
make_init_zipf.bash		make_init_zipf.bash
make_parameters_herdan_1.pl		make_parameters_herdan_1.pl
make_parameters_herdan_2.pl		make_parameters_herdan_2.pl
make_parameters_zipf_1.pl		make_parameters_zipf_1.pl
make_parameters_zipf_2.pl		make_parameters_zipf_2.pl
make_rank.pl		make_rank.pl
make_rank_sparse.pl		make_rank_sparse.pl
make_tokens.pl		make_tokens.pl
parameters_herdan.txt		parameters_herdan.txt
parameters_zipf.txt		parameters_zipf.txt
token_ratio.eps		token_ratio.eps
token_ratio.pdf		token_ratio.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ZipfAnatomy - supplementary materials

Supplementary plots

Reproducing the results on another dataset

About

Releases

Packages

Languages

License

lukasz-debowski/ZipfAnatomy

Folders and files

Latest commit

History

Repository files navigation

ZipfAnatomy - supplementary materials

Supplementary plots

Reproducing the results on another dataset

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages