Skip to content

Latest commit

 

History

History
31 lines (29 loc) · 944 Bytes

README.md

File metadata and controls

31 lines (29 loc) · 944 Bytes
  • Soundex
  • Levenshtein Edit Distance
  • Levenshtein Distance in 'nltk' Library
    • Levenshtein Distance
    • Damerau-Levenshtein Distance
  • Heteronyms Detection
    • Example I
    • Example II
  • Navigating Wordnet Relationships
  • Word-Sense Disambiguation
  • Lesk Algorithm
    • Example I
    • Example II
    • Example III
  • Automatic POS Tagging + Lesk with spaCy
  • Tokenisation
    • Word Tokenisation
    • Sentence Tokenisation
    • Tweet Tokenisation
    • Custom Tokenisation (using Regex)
  • Stemmer
    • Porter Stemmer
    • Snowball Stemmer
  • Lemmatizer
    • Wordnet Lemmatizer
  • Naive Bayes Classifier