Skip to content

The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".

Notifications You must be signed in to change notification settings

SiyuanWangw/LReasoner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LReasoner

The source code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text"

Our ensemble system is the first to surpass human performance on both EASY set and HARD set of ReClor (EvalAI leaderboard). If you find this paper useful, please cite this paper:

@misc{wang2021logicdriven,
      title={Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text}, 
      author={Siyuan Wang and Wanjun Zhong and Duyu Tang and Zhongyu Wei and Zhihao Fan and Daxin Jiang and Ming Zhou and Nan Duan},
      year={2021},
      eprint={2105.03659},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Setting up

  1. To set up the environment, please install the packages in the requirements.txt.
pip install -r requirements.txt
  1. To get the datasets, you can refer to the paper ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning to get the original data. We also provide our preprocessed data in reclor-data directory. Or you can start from the original data and preprocess it by the following steps:
  • Step 1: exrtact the logical symbols and identify the logical expressions in the context, then infer the entailed logical expressions;
  • Step 2: select a logical expreesion in the context and construct the negative samples based on it.
cd DataPreprocess
python extract_logical_expressions_v2.py
python construct_negative_samples_v2.py

Usage

Then you can run the LReasoner system in the scripts directory as following:

1. to run LReasoner_roberta
        bash run_roberta_DA_CE.sh
    
2. to run LReasoner_albert
        bash run_albert_DA_CE.sh

Here CE means context extension while DA means data augmentation.

Result

We obtain the following results:

Model Test EASY HARD
LReasoner-RoBERTa 62.4 81.4 47.5
LReasoner-ALBERT 70.7 81.1 62.5
Human Performance 63.0 57.1 67.2

The checkpoints of LReasoner-RoBERTa and LReasoner-ALBERT can be accessed here: https://drive.google.com/drive/folders/1Gqoa7DR9wZmMhpLHuXQnznEhdnlZnkbp?usp=sharing.

We only evaluate our model on the ReClor dataset and plan to try more datasets. As codes are publicly available, anyone interested could also have a try.

About

The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published