A curated list of “Temporally Language Grounding” and related area
- TALL: Temporal Activity Localization via Language Query - Gao et al,
ICCV 2017
. [code1] [code2] - Localizing Moments in Video with Natural Language - Hendricks et al,
ICCV 2017
. [code]
- Localizing Moments in Video with Temporal Language - Hendricks et al,
EMNLP 2018
. - MAC: Mining Activity Concepts for Language-based Temporal Localization - Ge et al,
WACV 2018
. [code1] [code2] - Temporally Grounding Natural Sentence in Video - Chen et al,
EMNLP 2018
. - Cross-modal Moment Localization in Videos - Liu et al,
ACM MM 2018
. [code] - Attentive Moment Retrieval in Videos - Liu et al,
SIGIR 2018
. [code] - Multi-modal Circulant Fusion for Video-to-Language and Backward - Wu et al,
IJCAI 2018
.
- Localizing Natural Language in Videos - Chen et al,
AAAI 2019
. - Semantic Proposal for Activity Localization in Videos via Sentence Query - Chen et al,
AAAI 2019
. - Multilevel Language and Vision Integration for Text-to-Clip Retrieval - Xu et al,
AAAI 2019
. [code] - To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression - Yuan et al,
AAAI 2019
. [code] - Read,Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos - He et al,
AAAI 2019
. [code] - Tripping through time Efficient Localization of Activities in Videos - He et al,
CVPR Workshop 2019
. - MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment - Zhang et al,
CVPR 2019
. - Language-driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model - Wang et al,
CVPR 2019
. - Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos - Zhu et al,
SIGIR 2019
. - Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos - Yuan et al,
NeurIPS 2019
.