Context based Question and Answering system

A context based question answering system using a variation of the model described in this paper

Unlike the paper, this implementation is trained on SQUAD v2.0, which is a much harder dataset, as it has many questions which are impossible to answer based on information given in the context.

Setting up

conda env create --name qa_env -f qnaenv.yaml

To download the pretrained glove word embeddings, dataset and pretrained model run,

python DownloadData.py

Training

As this isn't a relatively large model, you can train it on CPU, with minimum 8 GB system memory. On a Intel i5, each epoch takes roughly 35min and you'll probably need to train it for at least 20 epochs to see good results. The authors have used 200 GRU units, so that maybe helpful, incase yoo have the computational means to train the model.

Testing

Generate a predictions file, results.json using

python Test.py

Evaluate the model using the model using the official evaluation script you can run,

python evaluate.py --data_file Datasets/dev-v2.0.json --pred_file results.json

Pre-trained model scores

Small model(50 GRU units)

 {
   "exact": 22.311126084393162,
   "f1": 23.42393885183988,
   "total": 11873,
   "HasAns_exact": 0.4892037786774629,
   "HasAns_f1": 2.7180205782548734,
   "HasAns_total": 5928,
   "NoAns_exact": 44.07064760302775,
   "NoAns_f1": 44.07064760302775,
   "NoAns_total": 5945
 }

Large model(200 GRU units, 10 epochs)

 {
   "exact": 25.38532805525141,
   "f1": 26.50248099500217,
   "total": 11873,
   "HasAns_exact": 0.4048582995951417,
   "HasAns_f1": 2.6423678902936114,
   "HasAns_total": 5928,
   "NoAns_exact": 50.29436501261564,
   "NoAns_f1": 50.29436501261564,
   "NoAns_total": 5945
 }

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
static/js		static/js
templates		templates
.gitignore		.gitignore
CreateDatasets.py		CreateDatasets.py
DownloadData.py		DownloadData.py
Model.py		Model.py
Preprocessing.py		Preprocessing.py
README.md		README.md
Test_model.py		Test_model.py
TrainModel.py		TrainModel.py
demo_flask_app.py		demo_flask_app.py
evaluate-v2.0.py		evaluate-v2.0.py
params.py		params.py
preprocessing.pkl		preprocessing.pkl
qnaenv.yaml		qnaenv.yaml
results.json		results.json
test_data.pkl		test_data.pkl
train_data.pkl		train_data.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Context based Question and Answering system

Setting up

Training

Testing

Pre-trained model scores

About

Releases

Packages

Languages

Aftaab99/Context-based-QnA-system

Folders and files

Latest commit

History

Repository files navigation

Context based Question and Answering system

Setting up

Training

Testing

Pre-trained model scores

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages