Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 3.37 KB

File metadata and controls

15 lines (11 loc) · 3.37 KB

Product Recommendation by Azure OpenAI Embedding model (text-embedding-ada-002)

In a previous repo (Content Based Product Recommendation Samples), I showed how to use TF-IDF to vectorize product key phrases and recommend products based on cosine similarity. Now, I have improved the demo by using Azure OpenAI’s Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.

TF-IDF is a statistical measure that assigns a weight to each word based on how frequently it appears in a document and how rare it is across all documents. Word embedding is a technique that converts a word to a vector of numbers that captures its semantic and syntactic features. Words that are similar in meaning or usage have similar vectors, while words that are different have different vectors. TF-IDF vectors are sparse and do not account for semantic similarities between words, while word embedding vectors are dense and can capture the relationships between words1. Word embedding can also handle words that are not in the vocabulary by using subword information1, while TF-IDF can only use the words that are in the predefined vocabulary.

Content:

Enjoy!