CLUEEmotion2020

CLUE Emotion Analysis Dataset 情感分析数据集

Data Description

This dataset in data directory is emotion analysis corpus, with each sample annotated with one emotion label. The label set is like, happiness, sadness, anger, disgust, fear and surprise.

This dataset is from the following paper:

Minglei Li, Yunfei Long, Qin Lu, and Wenjie Li. “Emotion Corpus Construction Based on Selection from Hashtags.” In Proceedings of International Conference on Language Resources and Evaluation (LREC). Portorož, Slovenia, 2016

The corpus statistics and lable distribution are as follows:

The train, valid and test set is split by the ratio of 8:1:1 and encoded in UTF-8.

Baseline results

Test results of different classification models on this dataset.

Models	Accuracy	Parameters
BERT-base	60.7%	Epoch 3, batch 32, max_seq_len 128

Reproduce the results

The code is based on the original CLUE source code, which is based on the original Google BERT code, and the pre-trained language model is BERT Base Chinese version.

Env

tensorflow 1.12

Run command

cd models/bert
./run_classifier_emotion.sh

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CLUEdataset/emotion		CLUEdataset/emotion
models		models
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLUEEmotion2020

Data Description

Baseline results

Reproduce the results

Env

Run command

About

Releases

Packages

Contributors 2

Languages

CLUEbenchmark/CLUEmotionAnalysis2020

Folders and files

Latest commit

History

Repository files navigation

CLUEEmotion2020

Data Description

Baseline results

Reproduce the results

Env

Run command

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages