Skip to content

Subrata15/Text-Name-and-Brand-Similarity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text-Name-and-Brand-Similarity

Sometimes when we crawl or got data text, we confuse because the data similar and close to another sentences, such as name of brand, head of news, head of journal, or any other sentences. So I create a similarity check function to help us know how similar sentences to each other. This project contain :

Cosine Similarity
Difflib Get Close Match (based on Fuzzy Logic)
ReGex
Pandas
Numpy
Math
Time
Sys
OS

To test this project, you can run python main.py.
To know how to use cosine similarity function, you can open and check in the cosine_function.ipynb
*** Note : The output of main.py is data table with close match and similarity, you can custom the treshold inside this code, enjoy ***

About

Using Pandas, Numpy, Difflib, Math, ReGex, Time, and Sys

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published