Skip to content

tushark01/Natural-Language-Processing

 
 

Repository files navigation

Natural Language Processing (from scratch to GPT-3)

Note: This repo is updated regularly as I learn. If you want to learn NLP, just start from the first point and go on till the bottom. Everything is hierarchically arranged (from basic concepts to advanced)


Basic stuff

Cosine Similarity


Text Processing

Tokenization

Spacy

Spacy comes really handy to perform NLP tasks at state-of-the-art computation speeds. Here are some tutorials to get familiar with it


Stemming and Lemmatization

Videos

Articles

Code


Bag Of Words

Videos

Articles

Code


TF-IDF

Videos

Articles

Code


Great you are now done with some of the basics, time to implement a basic project

SpamClassifier Project


Great, after implementing a basic project it's time to get a bit mathematical

Watch the first lecture of the most sought after course of NLP (CS224N by Stanford)

CS224N Lecture-1

Suggested Readings


Word2Vec

After watching above lecture and going through the suggested readings (Stanford CS224N), let's understand more about word2vec

Code


Great, now it's time to do one more projects to solidify the concepts learnt so far

Predict Stock Price Movement Based on News Headlines


More about Word Embeddings (CS224N - Lecture-2)

Suggested Readings

Additional Readings


We have been seeing word embeddings applied in NLP to get the vector representation of the words. But now let's try them on tabular dataset with categorical features. We will convert the categorical features in word embeddings rather than traditional approaches like one-hot encoding, label encoding, etc.

Word Embeddings for Categorical Features


Let's now move on to the deep learning part of NLP.

Refresher on Neural Networks

CS224N - Lecture-3 & 4 (Neural Networks and Backpropagation)

Suggested Readings

Additional Readings

RNN

This material will get you started with RNN

Videos

Articles

Code

LSTM

Videos

Articles

Code

Bidirectional RNN

Videos

Articles

Code


Sequence to Sequence Learning

  • Will add resources here soon (forgot to add while learning :( )

Attention Mechanism

Articles

Code


Transformers

Articles

Videos

About

Documenting all the concepts and codes learned in NLP

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 96.5%
  • Python 3.5%