Python generator for yielding cleaned up transcripts from various sources (fox and msnbc right now) to build a giant corpus of text for a brain to consume. Gross!
Requires python 3 - 3.7 due to gpt-simple relying on Tensorflow < 2
Python helper to create and finetune brains using some big text corpus (like one generated by the webcrawler, for example)