New Yorker Caption Preprocessing

Main Article: Image-to-Text Generation for New Yorker Cartoons

Code used for preprocessing and filtering the New Yorker cartoon captions provided in https://github.com/nextml/caption-contest-data

from preprocess_captions import get_file_id_to_captions

caption_starts = {"i", "you", "we", "he", "she", "they", "im"}
file_id_to_captions = get_file_id_to_captions(caption_starts, 15)
print(file_id_to_captions[583])

['he stole food from the office refrigerator',
 'im guessing a man bun',
 'they dont write people up the way they used to',
 ...]

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
preprocess_captions.py		preprocess_captions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

New Yorker Caption Preprocessing

About

Releases

Packages

Languages

License

eugenet12/newyorker-caption-preprocessing

Folders and files

Latest commit

History

Repository files navigation

New Yorker Caption Preprocessing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages