GitHub - ifuseok/AMCNN: Attention Based Multi Channel CNN

Attention Based multi-channel CNN code source

딥러닝 기술을 활용한 차별 및 혐오 표현 탐지 : 어텐션 기반 다중 채널 CNN 모델링

위 논문에 사용한 모델 아키텍쳐 소스 코드 정리

Test in NSMC Binary Classification

git clone https://github.com/ifuseok/AMCNN.git
cd AMCNN
git clone https://github.com/e9t/nsmc.git

python train.py --train_data nsmc/ratings_train.txt --document document --label label
python test.py --test_data nsmc/ratings_test.txt --document document --label label

Pre-Trained Embedding weight

인터넷 뉴스 댓글 데이터 셋 을 활용해 Word2Vec 을 학습하여 Pre-trained embedding으로 활용

requirements

transformers == 3.x.x
tensorflow >= 2.0.0
keras >= 2.2.4
emoji
scikit-learn
pandas
gensim

References

Pre-trained Weights Data : https://www.kaggle.com/junbumlee/kcbert-pretraining-corpus-korean-news-comments
Tokenizers Reference : https://github.com/Beomi/KcBERT
Model Architecture Base 논문 : Multichannel CNN with Attention for Text Classification

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Tokenizer		Tokenizer
w2v_pretrain_emb		w2v_pretrain_emb
.gitignore		.gitignore
AttentionLayer.py		AttentionLayer.py
Metric.py		Metric.py
Model.py		Model.py
Readme.md		Readme.md
Token.py		Token.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attention Based multi-channel CNN code source

Test in NSMC Binary Classification

Pre-Trained Embedding weight

requirements

References

About

Releases

Packages

Languages

ifuseok/AMCNN

Folders and files

Latest commit

History

Repository files navigation

Attention Based multi-channel CNN code source

Test in NSMC Binary Classification

Pre-Trained Embedding weight

requirements

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages