Skip to content

Encode text to embedding so that retrieve info with semantic information.

Notifications You must be signed in to change notification settings

Dikea/Sentence-Embedding-Retrieval

Repository files navigation

Sentence Embeddings

Introduction

Build sentence embeddings encoder for searching purpose.

NOTE: the project is dividied to three parts, data preprocess, model training and searching supply. Each part has a directory to keep its codes and data, and common used codes and data is placed in tools directory.

Graphs

Running Steps

Step 1: Prepare data for model training

./model_data_preprocess.sh

Step 2: Train model

cd sentence_embeddings_model
python bin/model.py 

Step 3: Prepare data for searching

./search_data_preprocess.sh

Step 4: Supply searching

cd sentence_embeddings_search
supervisord -c conf/sup.qsearch.conf 

After process enabled, url (e.g. http://211.159.179.239:6008/in/nlp/sentence/search?query=产后失眠怎么办&size=100) can be visited.

Analysis

For more infomation of model and searching, please refer to model_analysis

Reference

About

Encode text to embedding so that retrieve info with semantic information.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published