Gated Attention Network

Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net)

The GAN model is implemented in Python using PyTorch framework. The GAN model is compared to attention and LSTM models for text classification problem on TREC and IMDb datasets.

Flow Diagram for the network:

There are two networks in the model:

Backbone Network
Auxiliary Network

Comparison with soft attention network:

Soft Attention gives some attention (low or high) to all the input tokens whereas gated attention network chooses the most important tokens to attend.

Gate Probability and gated attention:

Visualization of probability for gate to be open for input token and the actual gated attention weight.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Code		Code
Notebooks		Notebooks
images		images
Final_Presentation.pdf		Final_Presentation.pdf
Final_Report.pdf		Final_Report.pdf
Milestone_Report.pdf		Milestone_Report.pdf
Project_Proposal.pdf		Project_Proposal.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gated Attention Network

Flow Diagram for the network:

Comparison with soft attention network:

Gate Probability and gated attention:

About

Releases

Packages

Languages

prakruti-joshi/Gated-Attention-Network

Folders and files

Latest commit

History

Repository files navigation

Gated Attention Network

Flow Diagram for the network:

Comparison with soft attention network:

Gate Probability and gated attention:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages