GitHub - MattiaLimone/dnn-hmm: A Deep LSTM-CNN-HMM Neural Network system for Speaker Identification

DNN-HMM system for text-independent speaker identification

Table of Contents

About The Project
- Built With
Getting Started
- Prerequisites
- Installation
Contributing
License
Contact

About The Project

Deep learning approaches are progressively gaining popularity as alternative to HMM models for speaker identification. Promising results have been obtained with Convolutional Neural Networks (CNNs) fed by raw speech samples or raw spectral features, although this methodology does not fully take into account the temporal sequence in which speech is produced.

DNN-HMM (Deep Neural Network-Hidden Markov Model) is a methodology that combines the statistical modeling power of HMMs with the learning power of deep neural networks. While this technique has seen wide use in speech recognition field, few studies tried to apply it to speaker identification tasks.

This study proposes a novel approach to the DNN-HMM methodology for text-independent speaker identification, involving the use of both convolutional and Long-Short-Term-Memory (LSTM) networks, in order to extract both high-level features from the entire audio and temporal-wise features from each frame, which are then used to predict the emission probabilities of an HMM.

The experiments conducted on the TIMIT dataset showed very promising results, suggesting that the proposed non-sequential architecture may converge faster and perform better than other known methods, if properly tuned.

Built With

Getting started

Prerequisites

Install the requirements using the pip utility (may require to run as sudo).

#PyPI
pip install -r requirements.txt

Installation

Firstly clone the github repo

git clone https://github.com/MattiaLimone/dnn-hmm.git

Addittionally you have to install this library that probably has failed installing during prerequiste step, it's just a copy LPCTorch https://github.com/yliess86/LPCTorch with updated dependencies.

Use pip uitlity to install the dependency from our Repo (may require to run as sudo).

pip install https://github.com/Attornado/LPCTorch2/archive/refs/heads/master.zip

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

License

Distributed under the GNU General Public License v2.0. See LICENSE.txt for more information.

(back to top)

Contact

Mattia Limone [Linkedin profile]

Andrea Terlizzi [Send an email]

Carmine Iannotti [Linkedin Profile]

Luca Strefezza

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 351 Commits
model_evaluation		model_evaluation
models		models
preprocessing		preprocessing
training		training
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DNN-HMM system for text-independent speaker identification

About The Project

Built With

Getting started

Prerequisites

Installation

Contributing

License

Contact

About

Releases

Packages

Contributors 4

Languages

License

MattiaLimone/dnn-hmm

Folders and files

Latest commit

History

Repository files navigation

DNN-HMM system for text-independent speaker identification

About The Project

Built With

Getting started

Prerequisites

Installation

Contributing

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages