Contextual Word Representation: BERT Suggested Readings: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding