Here are
52 public repositories
matching this topic...
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Updated
Feb 27, 2023
Python
pytorch implementation of video captioning
Updated
Aug 19, 2019
Python
Video to Text: Natural language description generator for some given video. [Video Captioning]
Updated
May 3, 2022
Python
Auto transcribe tool based on whisper
Updated
Apr 27, 2023
Python
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing the video.
Updated
Oct 12, 2019
Python
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Updated
Apr 19, 2019
Python
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Updated
Apr 9, 2024
Python
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
Updated
Sep 25, 2024
Python
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
Updated
Sep 6, 2023
Python
这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。
Updated
Mar 12, 2022
Python
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
Updated
Jul 30, 2023
Python
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Updated
Nov 10, 2024
Python
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
Updated
Feb 7, 2022
Python
Video captioning baseline models on Video2Commonsense Dataset.
Updated
Apr 15, 2021
Python
Attention Bidirectional Video Recurrent Net
Updated
Feb 20, 2019
Python
🎬 Video Captioning: ICCV '15 paper implementation
Updated
May 30, 2018
Python
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Updated
Dec 25, 2024
Python
A PyTorch implementation of EmpiricalMVM
Updated
Dec 18, 2023
Python
implement video caption based on openNMT
Updated
Apr 19, 2018
Python
[ICCV 2023] Accurate and Fast Compressed Video Captioning
Updated
Feb 18, 2024
Python
Improve this page
Add a description, image, and links to the
video-captioning
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
video-captioning
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.