Skip to content
View dongyh20's full-sized avatar
  • Tsinghua University

Highlights

  • Pro

Block or report dongyh20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dongyh20/README.md

Hi there 👋

🔭 I’m currently working on the topic of visual perception and my long-term goal is to build embodied fundation models.

⚡ Recently I'm focusing on vision-language model, embodied AI and 3D world model.

📫 If you are also interested in relevant issues, feel free to chat with me!

Pinned Loading

  1. Octopus Octopus Public

    🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.

    Python 253 18

  2. Oryx-mllm/Oryx Oryx-mllm/Oryx Public

    MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

    Python 228 9

  3. EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

    Accelerating the development of large multimodal models (LMMs) with lmms-eval

    Python 1.4k 116

  4. Chain-of-Spot Chain-of-Spot Public

    Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models

    Python 84 6