🍉
I may be slow to respond before the due date of ACL.
PhD@CUHK, Research Engineer@Alibaba
- Shatin, N.T., HKSAR
- https://lixin4ever.github.io/
- @lixin4ever
Pinned Loading
-
DAMO-NLP-SG/VideoLLaMA3
DAMO-NLP-SG/VideoLLaMA3 PublicFrontier Multimodal Foundation Models for Image and Video Understanding
-
DAMO-NLP-SG/VideoLLaMA2
DAMO-NLP-SG/VideoLLaMA2 PublicVideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
-
DAMO-NLP-SG/CLEX
DAMO-NLP-SG/CLEX Public[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
-
DAMO-NLP-SG/VCD
DAMO-NLP-SG/VCD Public[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
-
DAMO-NLP-SG/Inf-CLIP
DAMO-NLP-SG/Inf-CLIP PublicThe official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.
-
DAMO-NLP-SG/Video-LLaMA
DAMO-NLP-SG/Video-LLaMA Public[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.