Skip to content
View orrzohar's full-sized avatar
Video
Video

Highlights

  • Pro

Block or report orrzohar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
orrzohar/README.md

I'm Orr Zohar 👋

My research focuses on Large Multi-Modal Models, especially Large Image/Video + Langauge models, with the hope of pushing these models to be capable of evaluating the quality of actions in video. Recent relevant work:

  • 💫 Video-STAR: Introduced a method that allows the utilization of any labeled video dataset for instruction tuning.
  • 🤖 VideoAgent: A novel agent-based system that utilizes a large language model to iteratively identify and compile crucial information from long-form videos

Pinned Loading

  1. Video-STaR Video-STaR Public

    Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

    Python 47 4

  2. PROB PROB Public

    [CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection

    Python 112 16

  3. LOVM LOVM Public

    [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection

    Python 20

  4. FOMO FOMO Public

    Official Pytorch code for Open World Object Detection in the Era of Foundation Models

    Python 65 4