Skip to content

Latest commit

 

History

History
11 lines (7 loc) · 1.3 KB

File metadata and controls

11 lines (7 loc) · 1.3 KB

nickname-and-diminutive-names-lookup

A CSV file that containing US given names (first name) and their associated nicknames or diminutive names.

This lookup file was initially created by mining this genealogy page. Because the lookup is based off of a dataset used for genealogy purposes there are some old names that aren't used commonly these days, but there are recent ones as well. Examples are "gregory", "greg", or "geoffrey", "geoff". There was also a significant effort to make it machine readable, i.e. separate it with commas, remove human conventions, like "rickie(y)" would need to be made into two different names "rickie", and "ricky".

There are Java, Perl and Python parsers provided for convenience.

This is a relatively large list with about 1600 names. Any help from people to clean this list up and add to it is greatly appreciated. Think of it as a wiki. Just request to join the project and you'll be added.

This project was created by Old Dominion University - Web Science and Digital Libraries Research Group. More information about the creation of this lookup can be found here.