This is a modified version of the Webpage-Similarity project. With the addition of 190 more wikipedia pages, a more efficient method of data management is required. The main focus of this project is to create clusters, use persistent data stores and extendible hashing for quick data retrieval.
-
Updated
Nov 7, 2023 - Java