speech-embedding

Here are 2 public repositories matching this topic...

bunyaminergen / WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

microsoft speech embedding speaker-diarization diarization nvidia-nemo wavlm speech-embedding

Updated Feb 14, 2025
Jupyter Notebook

DigitalPhonetics / BetterFinetuning

Star

Code accompanying our paper on finetuning self-supervised general speech representations with a combination of contrastive and non-contrastive methods.

self-supervised-learning speech-embedding

Updated Oct 5, 2022
Python

Improve this page

Add a description, image, and links to the speech-embedding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-embedding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly