add new feature for llava-next-video #417

ZhangYuanhan-AI · 2024-05-10T03:42:29Z

llava-next-video: https://llava-vl.github.io/blog/2024-04-30-llava-next-video/

Add video support

hkunzhe · 2024-05-11T06:52:07Z

python/sglang/utils.py

 from io import BytesIO
 from json import dumps

+import cv2


opencv-python should be added in pyproject.toml.

hkunzhe · 2024-05-11T07:38:23Z

examples/quick_start/srt_example_llava_v.py

There are many user-defined paths and some parameters in argparse are not being used.

merrymercy · 2024-05-12T03:58:11Z

README.md

@@ -1,411 +1,74 @@
+To refine and clean up the README you've provided for the SGLang project, I'll focus on improving clarity, organization, and conciseness. This includes providing clear installation instructions, simplifying steps where possible, and ensuring the document is easy to navigate. Here's a revised version:


Revert the change to README.md

merrymercy · 2024-05-12T04:00:06Z

python/pyproject.toml

@@ -20,7 +20,7 @@ dependencies = [

 [project.optional-dependencies]
 srt = ["aiohttp", "fastapi", "psutil", "rpyc", "torch", "uvloop", "uvicorn",
-       "zmq", "vllm>=0.3.3", "interegular", "pydantic", "pillow", "outlines>=0.0.27"]
+       "zmq", "vllm==0.3.3", "interegular", "pydantic", "pillow", "outlines>=0.0.27"]


Recently, we moved to vllm==0.4.2, can you update accordingly?
There are only minor changes. You can change similar to the changes in this PR #380

merrymercy · 2024-05-12T04:00:44Z

examples/demo/model_call.py

@@ -0,0 +1,198 @@
+# from flask import Flask, request, jsonify


Rename examples/demo to examples/usage/llava_video

merrymercy · 2024-05-12T04:01:27Z

examples/quick_start/srt_example_llava_v.py

+    compile_and_cleanup_final_results(cur_chunk, num_batches, save_dir)
+
+
+import argparse


move this to the beginning of the file.

merrymercy · 2024-05-12T04:03:44Z

examples/quick_start/srt_example_llava_v.py

+    parser.add_argument('--chunk-idx', type=int, default=0, help='The index of the chunk to process.')
+    parser.add_argument('--num-chunks', type=int, default=8, help='The number of chunks to process.')
+    parser.add_argument('--save-dir', type=str, default="./work_dirs/llava_video", help='The directory to save the processed video files.')
+    parser.add_argument('--video-dir', type=str, default="/mnt/bn/vl-research/workspace/yhzhang/data/sora/", help='The directory to save the processed video files.')


It seems there are many personal paths. Can someone else easily run this script?
Maybe you can move all things under a single folder examples/llava_video and add a README.md with an example command that runs on your Q98Z4OTh8RwmDonc.mp4

merrymercy · 2024-05-12T04:06:03Z

python/sglang/utils.py

+    return video_base64
+
+
+# def encode_video_base64(video_path):


remove unused code.

merrymercy · 2024-05-12T04:10:22Z

python/sglang/srt/managers/router/model_rpc.py

@@ -32,6 +32,9 @@
 )
 from vllm.logger import _default_handler as vllm_default_handler

+from typing import Any, Dict, List, Optional, Tuple, Union


merge this with L7

ZhangYuanhan and others added 18 commits March 25, 2024 01:52

Add personal work directory to .gitignore

663a267

Add video support and update model configuration files

198267e

Fix import order and remove unused code

79cbd78

Fix model download path

47608bb

adapt video demo

0c837d2

update requirement

d85b14b

add backend

66dfa6d

add demo

1c87c1d

update

4fa7661

update

c775602

update

688f023

update readme

affffd9

add llavavid server

b9c698d

Update model_overide_args in launch_server_llavavid.py

2c06ca1

update llava-next-video-7b

70971cb

update

74dc8dc

Add model override arguments for num_frames == 32

1f9d9ad

Merge pull request #2 from ZhangYuanhan-AI/dev/sglang_video_public

7bfc070

Add video support

hkunzhe reviewed May 11, 2024

View reviewed changes

examples/quick_start/srt_example_llava_v.py

Copy link

hkunzhe May 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are many user-defined paths and some parameters in argparse are not being used.

merrymercy requested changes May 12, 2024

View reviewed changes

merrymercy reviewed May 12, 2024

View reviewed changes

Remove unused imports and update dependencies

90be603

ZhangYuanhan-AI closed this May 12, 2024

ZhangYuanhan-AI deleted the llava_video branch May 12, 2024 05:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add new feature for llava-next-video #417

add new feature for llava-next-video #417

ZhangYuanhan-AI commented May 10, 2024

hkunzhe May 11, 2024

hkunzhe May 11, 2024

merrymercy May 12, 2024

merrymercy May 12, 2024

merrymercy May 12, 2024 •

edited

Loading

merrymercy May 12, 2024

merrymercy May 12, 2024

merrymercy May 12, 2024

merrymercy May 12, 2024

		@@ -1,411 +1,74 @@
		To refine and clean up the README you've provided for the SGLang project, I'll focus on improving clarity, organization, and conciseness. This includes providing clear installation instructions, simplifying steps where possible, and ensuring the document is easy to navigate. Here's a revised version:

		@@ -0,0 +1,198 @@
		# from flask import Flask, request, jsonify

		compile_and_cleanup_final_results(cur_chunk, num_batches, save_dir)


		import argparse

add new feature for llava-next-video #417

add new feature for llava-next-video #417

Conversation

ZhangYuanhan-AI commented May 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

merrymercy May 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

merrymercy May 12, 2024 •

edited

Loading