NLP from scratch Series 1: Char-level Lannguage Model (LM) Bigram count-based LM Bigram Neural Network LM Transformer based LM