Skip to content

School project with inverted trees, wildcard searches with bi-gram and TF-IDF scores.

Notifications You must be signed in to change notification settings

MartinsBumanis/TextAnalysisTool

Repository files navigation

TextAnalysisTool

School task:

  1. GetPostings - get all postings for the document. Document is defined by a 4character number. Put it all in a dictionary and sort with inverted tree.

  2. Query the dictionary for specific terms. Find if the terms are in all the documents or any of them(QueryOr and QueryAnd)

  3. Check the TD-IDF scores of terms.

  4. Wildcard search using bi-gram index.

About

School project with inverted trees, wildcard searches with bi-gram and TF-IDF scores.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages