Orr Zohar orrzohar

I'm Orr Zohar 👋

My research focuses on Large Multi-Modal Models, especially Large Image/Video + Langauge models, with the hope of pushing these models to be capable of evaluating the quality of actions in video. Recent relevant work:

Apollo coming soon..
💫 Video-STAR: Introduced a method that allows the utilization of any labeled video dataset for instruction tuning.
🤖 VideoAgent: A novel agent-based system that utilizes a large language model to iteratively identify and compile crucial information from long-form videos

Created a pull request in huggingface/transformers that received 73 comments
Feb 11

SmolVLM2

What does this PR do? SmolVLM2 support Before submitting This PR fixes a typo or improves the docs (you can dismiss the other checks if that's th…

+5,159 −132 lines changed • 73 comments

Opened 2 other pull requests in 1 repository

orrzohar/transformers 2 merged

Add draft test to smolvlm
This contribution was made on Feb 12

updated video loader + processor for SmolVLM2
This contribution was made on Feb 12

Reviewed 1 pull request in 1 repository

huggingface/transformers 1 pull request

SmolVLM2
This contribution was made on Feb 13

7 contributions in private repositories Feb 1 – Feb 5

	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb
Sun
Mon
Tue
Wed
Thu
Fri
Sat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Orr Zohar orrzohar

Achievements

Achievements

Highlights

Block or report orrzohar

I'm Orr Zohar 👋

Pinned Loading

377 contributions in the last year

Activity overview

Contribution activity

February 2025

Created a pull request in huggingface/transformers that received 73 comments

SmolVLM2

	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb
Sun
Mon
Tue
Wed
Thu
Fri
Sat