[model] Support Audio #6701

BUAADreamer · 2025-01-18T15:41:50Z

What does this PR do?

Support audio-text to text finetuning and inference of MiniCPM-o-2.6/qwen2-audio

Fixes #5756
Fixes #6738

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

setup.py

src/llamafactory/data/collator.py

src/llamafactory/data/mm_plugin.py

hiyouga

LGTM

support qwen2_audio

4a5cec0

BUAADreamer temporarily deployed to tests January 18, 2025 15:42 — with GitHub Actions Inactive

BUAADreamer force-pushed the qwen2_audio branch from 1f020ec to bf6a7b1 Compare January 18, 2025 15:46

BUAADreamer temporarily deployed to tests January 18, 2025 15:46 — with GitHub Actions Inactive

BUAADreamer force-pushed the qwen2_audio branch from bf6a7b1 to cee3ccf Compare January 18, 2025 15:53

BUAADreamer temporarily deployed to tests January 18, 2025 15:54 — with GitHub Actions Inactive

hiyouga had a problem deploying to tests January 31, 2025 20:54 — with GitHub Actions Failure

hiyouga force-pushed the qwen2_audio branch from 13ae781 to 764ffdf Compare January 31, 2025 20:55

hiyouga reviewed Jan 31, 2025

View reviewed changes

setup.py Outdated Show resolved Hide resolved

hiyouga reviewed Jan 31, 2025

View reviewed changes

src/llamafactory/data/collator.py Outdated Show resolved Hide resolved

src/llamafactory/data/mm_plugin.py Outdated Show resolved Hide resolved

src/llamafactory/data/mm_plugin.py Outdated Show resolved Hide resolved

BUAADreamer force-pushed the qwen2_audio branch 3 times, most recently from 5455ce9 to b9dec36 Compare February 1, 2025 01:32

hiyouga approved these changes Feb 1, 2025

View reviewed changes

BUAADreamer force-pushed the qwen2_audio branch 2 times, most recently from fdc002f to caf80f0 Compare February 1, 2025 06:38

support audio

ff1cf31

BUAADreamer force-pushed the qwen2_audio branch from caf80f0 to ff1cf31 Compare February 1, 2025 06:48

BUAADreamer requested a review from hiyouga February 4, 2025 04:22

hiyouga added 3 commits February 5, 2025 04:29

improve code

6ae7eb5

lint

52ef94c

fix

2074400

hiyouga approved these changes Feb 4, 2025

View reviewed changes

hiyouga added 3 commits February 5, 2025 04:42

Merge remote-tracking branch 'upstream/main' into qwen2_audio

cec2154

fix

478508b

fix

19737e1

hiyouga approved these changes Feb 4, 2025

View reviewed changes

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Feb 4, 2025

hiyouga merged commit 24c7842 into hiyouga:main Feb 4, 2025
11 of 12 checks passed

This was referenced Feb 5, 2025

Upstream branch main (revision 24c78429) neuro-inc/LLaMA-Factory#8

Open

Upstream branch main (revision 74ade3a1) neuro-inc/LLaMA-Factory#9

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model] Support Audio #6701

[model] Support Audio #6701

BUAADreamer commented Jan 18, 2025 •

edited by hiyouga

Loading

hiyouga left a comment

[model] Support Audio #6701

[model] Support Audio #6701

Conversation

BUAADreamer commented Jan 18, 2025 • edited by hiyouga Loading

What does this PR do?

Before submitting

hiyouga left a comment

Choose a reason for hiding this comment

BUAADreamer commented Jan 18, 2025 •

edited by hiyouga

Loading