Introduction

Official implementation of Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement.

Citation of our paper: Hui Liu, Wenya Wang, and Haoliang Li. 2022. Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 4995–5006, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.

Run Code

To run our code, please replace paths of datasets in main function of train.py using your paths first. Also, your can unzip twitter.zip (dataset file) and place it in sarcasm project to run code immediately. In addition, you can generate image embeddings, dependency and three kinds of knowledge using data_process.ipynb.

Then you need to specify path to parameter.json for train.py, which contain hyperparamters for our model. Concretely, two parameter files in our project to recover performance reported in our paper, including parameter.json with knowledge enhancement and parameter_without_know.json without knowledge enhancement. However, you may need to tune the parameters for your machine. Moreover, please use one-layer MLP for text-image branch without knowledge and two-layer MLP for text-image branch with knowledge at the final classification layer. That's because text-knowledge branch is one optimization shortcut so we need more complicated classification layer to mitigate this problem. need to tume the lam value, cro attention tyep, layer number.

At last, you can run the below code:

CUDA_VISIBLE_DEVICES=1 python train.py

For experiment, please refer to requirements. txt and we only list the core packages.

For dataset, please download at this link https://portland-my.sharepoint.com/:u:/g/personal/liuhui3-c_my_cityu_edu_hk/Eb59O8EpvO5Ft_M4FKmfPgsBjq_V_1HefEyooyjpL2t8yA?e=lTw6Up. For the orginial dataset, see as https://github.com/headacheboy/data-of-multimodal-sarcasm-detection.

CheckList

We perform our experimetns on 24-GB 3090Ti with nearly one hour for one run.

Total Params: 112540942

We take the average results of multiple runs for reports.

Citation

If you find this repo useful for your research, please consider citing the paper.

Thanks for dataset from https://github.com/headacheboy/data-of-multimodal-sarcasm-detection

Thanks for Vit model from https://github.com/lukemelas/PyTorch-Pretrained-ViT

Thanks for Clipcap model from https://github.com/rmokady/CLIP_prefix_caption

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
interraction		interraction
text		text
utils		utils
LICENSE		LICENSE
Readme.md		Readme.md
adamw.py		adamw.py
aug_lib.py		aug_lib.py
data_process.ipynb		data_process.ipynb
model.py		model.py
parameter.json		parameter.json
parameter_without_know.json		parameter_without_know.json
requirements.txt		requirements.txt
train.py		train.py
trainnni_warmup.py		trainnni_warmup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Run Code

CheckList

Citation

About

Releases

Packages

Languages

License

less-and-less-bugs/HKEmodel

Folders and files

Latest commit

History

Repository files navigation

Introduction

Run Code

CheckList

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages