Natural Language Processing (from scratch to GPT-3)

Note: This repo is updated regularly as I learn. If you want to learn NLP, just start from the first point and go on till the bottom. Everything is hierarchically arranged (from basic concepts to advanced)

Basic stuff

Cosine Similarity

Text Processing

Tokenization

Spacy

Spacy comes really handy to perform NLP tasks at state-of-the-art computation speeds. Here are some tutorials to get familiar with it

Stemming and Lemmatization

Videos

Articles

Code

Bag Of Words

Videos

Krish Naik

Articles

Code

Implementation (sklearn + nltk)

TF-IDF

Great you are now done with some of the basics, time to implement a basic project

SpamClassifier Project

Video
Code

Great, after implementing a basic project it's time to get a bit mathematical

Watch the first lecture of the most sought after course of NLP (CS224N by Stanford)

CS224N Lecture-1

Word2Vec

After watching above lecture and going through the suggested readings (Stanford CS224N), let's understand more about word2vec

Jay Alammar Blog - detailed

Code

Great, now it's time to do one more projects to solidify the concepts learnt so far

Predict Stock Price Movement Based on News Headlines

Jupyter Notebook

More about Word Embeddings (CS224N - Lecture-2)

Suggested Readings

Additional Readings

We have been seeing word embeddings applied in NLP to get the vector representation of the words. But now let's try them on tabular dataset with categorical features. We will convert the categorical features in word embeddings rather than traditional approaches like one-hot encoding, label encoding, etc.

Word Embeddings for Categorical Features

Code

Let's now move on to the deep learning part of NLP.

Refresher on Neural Networks

CS224N - Lecture-3 & 4 (Neural Networks and Backpropagation)

Suggested Readings

Additional Readings

RNN

This material will get you started with RNN

Videos

Articles

Code

LSTM

Videos

Articles

Code

Bidirectional RNN

Videos

B-RNN (overview) - Krish Naik

Articles

Bi-Directional RNN - Paperspace Blog

Code

Generating Baby names with Bidirectional-RNN

Sequence to Sequence Learning

Will add resources here soon (forgot to add while learning :( )

Attention Mechanism

Articles

Code

Transformers

Articles

Jay Alammar Blog

Videos

Illutstrated Guide to Transformers

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
Attention/seq2seq_attention		Attention/seq2seq_attention
Basics		Basics
CS224N-Materials/Word_Vectors		CS224N-Materials/Word_Vectors
LSTM		LSTM
RNN		RNN
Seq_to_Seq		Seq_to_Seq
Transformers/german-to-english		Transformers/german-to-english
Word2vec		Word2vec
.gitignore		.gitignore
README.md		README.md

tushark01/Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

Natural Language Processing (from scratch to GPT-3)

Note: This repo is updated regularly as I learn. If you want to learn NLP, just start from the first point and go on till the bottom. Everything is hierarchically arranged (from basic concepts to advanced)

Basic stuff

Cosine Similarity

Text Processing

Tokenization

Spacy

Stemming and Lemmatization

Videos

Articles

Code

Bag Of Words

Videos

Articles

Code

TF-IDF

Videos

Articles

Code

Great you are now done with some of the basics, time to implement a basic project

SpamClassifier Project

CS224N Lecture-1

Suggested Readings

Word2Vec

Code

Predict Stock Price Movement Based on News Headlines

More about Word Embeddings (CS224N - Lecture-2)

Word Embeddings for Categorical Features

Refresher on Neural Networks

CS224N - Lecture-3 & 4 (Neural Networks and Backpropagation)

RNN

Videos

Articles

Code

LSTM

Videos

Articles

Code

Bidirectional RNN

Videos

Articles

Code

Sequence to Sequence Learning

Attention Mechanism

Articles

Code

Transformers

Articles

Videos

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages