Comments on the speaker overview paper #339
wsstriving
started this conversation in
General
Replies: 3 comments 1 reply
-
update dataset info about voxblink2 |
Beta Was this translation helpful? Give feedback.
0 replies
-
update about the latest improvement of open-source toolkits |
Beta Was this translation helpful? Give feedback.
0 replies
-
感谢分享SV领域的综述。只说知道的部分,提一个工具描述和表格部分的更正:asv-subtools 支持c++部署,同时kaldi只是特征提取和训练的可选工具,核心net部分一开始就是pytorch,也支持online dataloader使用python提取特征,因此也是容易做自定义的。 |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dear all,
We are initiating a discussion on the paper "Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning" here. Due to the page limitation (26 pages), there might be some topics that are not covered sufficiently in this paper.
We hope to draw inspiration from the valuable discussion, comments, and suggestions to improve the arXiv version (which has no strict page limitation). Especially, we are looking forward to receiving comments from industry professionals to uncover the real-world challenges and applications related to speaker modeling that are yet covered.
The comments can cover, but are not limited to, the following aspects:
Many thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions