GitHub - ramdrop/edgevl: Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities
Kaiwen Cai, Zhekai Duan, Gaowen Liu, Charles Fleming, Chris Xiaoxuan Lu

👉 Download the supplementary material of the paper

News

[2024-03-15] Our preprint paper is available on arXiv.
[2024-07-02] Our paper is accepted by ECCV 2024. 🎉
[2024-07-20] Training and testing code is released.

Dataset

👉 To prepare the dataset

Environment

👉 To install the environment

Train

Our EdgeVL consists of two stages:

# Stage1
DATASET=eurosat; CONFIG=swint_mix; QUANT_CONFIG=disable
python run.py --phase=train --config=configs/${DATASET}/${CONFIG}.yaml --quant_config=quantization_configs/${QUANT_CONFIG}.yaml

# Stage 2 
DATASET=eurosat; CONFIG=swint_mix_ctrs; QUANT_CONFIG=jacob
python run.py --phase=train_ctrs --config=configs/${DATASET}/${CONFIG}.yaml --quant_config=quantization_configs/${QUANT_CONFIG}.yaml

Evaluate

RUN_NAME=[run_name]; QUANT_CONFIG=jacob; TEST_MODAL=depth
python run.py --phase=test --run_name=${RUN_NAME} --quant_config=quantization_configs/${QUANT_CONFIG}.yaml --test_modal=${TEST_MODAL} --static_or_dynamic=static

Inference with a Pretrained Model

You might want to download the pretrained weights from Hugging Face:

cd edgevl
git lfs install
git clone https://huggingface.co/ramfais/edgevl_weights
mkdir logs && mv edgevl_weights/* logs

RUN_NAME=datt_scannet; QUANT_CONFIG=jacob; TEST_MODAL=depth
python run.py --phase=test --run_name=${RUN_NAME} --quant_config=quantization_configs/${QUANT_CONFIG}.yaml --test_modal=${TEST_MODAL} --static_or_dynamic=static

Deployment

👉 To deploy on edge devices

Citation

@inproceedings{cai2024selfadapting,
    author = {Cai, Kaiwen and Duan, Zhekai and Liu, Gaowen and Fleming, Charles and Lu, Chris Xiaoxuan},
    booktitle = {European {Conference} on {Computer} {Vision} ({ECCV})}, 
    year = {2024},
    pages = {},
    publisher = {},
    title = {Self-{Adapting} {Large} {Visual}-{Language} {Models} to {Edge} {Devices} across {Visual} {Modalities}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset		dataset
docs		docs
onnx_trt		onnx_trt
preprocess_dataset		preprocess_dataset
quantization_configs		quantization_configs
quantization_libs		quantization_libs
utils		utils
world_dat		world_dat
world_swin		world_swin
world_vit		world_vit
.gitignore		.gitignore
README.md		README.md
clean_wandb.py		clean_wandb.py
evaluate_clip.py		evaluate_clip.py
export_clip_labels.py		export_clip_labels.py
requirements.txt		requirements.txt
run.py		run.py
setup.cfg		setup.cfg
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News

Dataset

Environment

Train

Evaluate

Inference with a Pretrained Model

Deployment

Citation

About

Releases

Packages

Languages

ramdrop/edgevl

Folders and files

Latest commit

History

Repository files navigation

News

Dataset

Environment

Train

Evaluate

Inference with a Pretrained Model

Deployment

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages