Skip to content

Latest commit

 

History

History
30 lines (17 loc) · 696 Bytes

README.md

File metadata and controls

30 lines (17 loc) · 696 Bytes

info-retrieval-notebooks

  • Designed for viewing in GitHub.

Implemented Algorithms and Datastructures

search

  • InvertedIndex

  • InvertedIndex (via vector space model, linear algebra, sparse matrices)

  • Similarity search (via cosine similarity)

  • Fuzzy string search

  • Ranking and evaluation

databases

  • Basic database operations (project, select, cartesian product)

  • more database operations (equi join, merge join, hash join, group by)

  • SPARQL to SQL algorithm

  • SQL to SPARQL algorithm

Used datasets

IMDB movies dataset

https://www.kaggle.com/datasets/ashpalsingh1525/imdb-movies-dataset