This is small proof-of-concept application, which implements live full text search by popular russian site habrahabr.ru.
The Reindexer in-memory DB is used as storage and full text search engine.
Current dataset contains about 5GB of data: 170K aricles with 6M commentaries.
The frontend for project is written with vue.js and located in repository
- Install & build reindexers dependencies
- Install habr-search
go get github.com/olegator77/habr-search
- Import (download) dataset from habrahabr.ru
habr-search import -startid 1 -finishid 355000 -dumppath <path-to-store-data> -webrootpath <path to store images>
This step is very long, and can take about 8+ hours to download all data, and requires about 5GB of free disk space. You can reduce time and size by decrease ID range, e.g. set startid to 350000.
- Load imported data to Reindexer
habr-search load -dumppath <path-to-store-data> -webrootpath <path to store images>
This step takes about 5 minutes for all dataset
- Install and build frontend
- Follow the instructions
- Copy built fronened to
webrootpath
folder
- Run service
habr-search run -dumppath <path-to-store-data> -webrootpath <path of webroot>
Open http://127.0.0.1:8881 in your browser.