Add diarization recipe v3 #347

xx205 · 2024-08-11T15:55:50Z

Add diarization recipe v3 for voxconverse dataset.

Highlights

update silero-vad to v5.1 from v3.1
new diarization method umap+hdbscan

Results

Dev set

system	MISS	FA	SC	DER
This repo (with oracle SAD)	2.3	0.0	1.3	3.6
This repo (with system SAD)	3.4	0.6	1.4	5.4
DIHARD 2019 baseline ¹	11.1	1.4	11.3	23.8
DIHARD 2019 baseline w/ SE ¹	9.3	1.3	9.7	20.2
(SyncNet ASD only) ¹	2.2	4.1	4.0	10.4
(AVSE ASD only) ¹	2.0	5.9	4.6	12.4
(proposed) ¹	2.4	2.3	3.0	7.7

Test set

system MISS FA SC DER

This repo (with oracle SAD) 1.6 0.0 1.9 3.5

This repo (with system SAD) 3.8 1.7 1.8 7.4

Spot the conversation: speaker diarisation in the wild, https://arxiv.org/pdf/2007.01216.pdf ↩ ↩² ↩³ ↩⁴ ↩⁵

…init

czy97 · 2024-08-19T09:17:10Z

The news part should be updated

czy97 · 2024-08-19T11:09:38Z

I think it is better to link the local directory and path.sh file directly if we reuse them.

Update News section in README.md

Update clustering method

czy97

Well Done!

czy97 · 2024-08-19T09:07:01Z

examples/voxconverse/v3/README.md

+  * Refer to [voxceleb sv recipe](https://github.com/wenet-e2e/wespeaker/tree/master/examples/voxceleb/v2)
+  * [pretrained model path](https://wespeaker-1256283475.cos.ap-shanghai.myqcloud.com/models/voxceleb/voxceleb_resnet34_LM.onnx)
+* Speaker activity detection model: oracle SAD (from ground truth annotation) or system SAD (VAD model pretrained by silero, https://github.com/snakers4/silero-vad)
+* Clustering method: spectral clustering


The clustering method should be umap + dbscan?

czy97 · 2024-08-19T11:03:26Z

wespeaker/cli/speaker.py

@@ -29,7 +29,7 @@
 from wespeaker.cli.utils import get_args
 from wespeaker.models.speaker_model import get_speaker_model
 from wespeaker.utils.checkpoint import load_checkpoint
-from wespeaker.diar.spectral_clusterer import cluster
+from wespeaker.diar.umap_clusterer import cluster


@JiJiJiang I am not sure whether we should change the client script.

Yes, just keep it as the better one.

czy97 · 2024-08-19T11:06:37Z

wespeaker/diar/make_system_sad.py


 import torch
+import silero_vad
 from wespeaker.utils.file_utils import read_scp


 def get_args():
    parser = argparse.ArgumentParser(description='')


should we also edit the v1 and v2 version, if we change the arguments of this script?

Yes, also update the results if change into silero vad v5.1.

xx205 added 4 commits August 11, 2024 15:51

Add diarization recipe v3

46707ab

resolve pylint issues and add missing modifications

fdcf72a

eliminate trailing whitespace

700dfe0

deterministic clustering; update README.md

77c340d

xx205 requested review from cdliang11 and JiJiJiang August 12, 2024 02:44

xx205 added 3 commits August 12, 2024 04:49

fix args usage in umap_clusterer.py

7636e32

local import; remove unused diarization args; self.model.eval() when …

2731690

…init

compact embedding clustering procedure into a single source file

4ac134d

xx205 requested review from wsstriving and czy97 August 12, 2024 16:27

xx205 and others added 6 commits August 19, 2024 16:20

link to local and path.sh; update requirements.txt and extract_emb.py

f894bb2

Merge branch 'master' into voxconverse_v3

5f9e416

fix lint error: extract_emb.py

69d2134

Update README.md

03c0e48

Update News section in README.md

Update voxconverse/v3/README.md

a33c1ce

Update clustering method

Update README.md

78e52f8

czy97 approved these changes Aug 20, 2024

View reviewed changes

czy97 merged commit 5ac089e into wenet-e2e:master Aug 20, 2024
4 checks passed

xx205 deleted the voxconverse_v3 branch August 20, 2024 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add diarization recipe v3 #347

Add diarization recipe v3 #347

xx205 commented Aug 11, 2024 •

edited

Loading

czy97 commented Aug 19, 2024

czy97 commented Aug 19, 2024

czy97 left a comment

czy97 Aug 19, 2024

czy97 Aug 19, 2024

JiJiJiang Aug 20, 2024

czy97 Aug 19, 2024

JiJiJiang Aug 20, 2024

Add diarization recipe v3 #347

Add diarization recipe v3 #347

Conversation

xx205 commented Aug 11, 2024 • edited Loading

Highlights

Results

Footnotes

czy97 commented Aug 19, 2024

czy97 commented Aug 19, 2024

czy97 left a comment

Choose a reason for hiding this comment

czy97 Aug 19, 2024

Choose a reason for hiding this comment

czy97 Aug 19, 2024

Choose a reason for hiding this comment

JiJiJiang Aug 20, 2024

Choose a reason for hiding this comment

czy97 Aug 19, 2024

Choose a reason for hiding this comment

JiJiJiang Aug 20, 2024

Choose a reason for hiding this comment

xx205 commented Aug 11, 2024 •

edited

Loading