Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 299 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 299 Bytes

Webpage-Similarity-II

This is a modified version of the Webpage-Similarity project. With the addition of 190 more wikipedia pages, a more efficient method of data store is required. The main focus of this project is to integrate persistent data stores and switch the similarity metric to TF-IDF.