GitHub - gouzouni625/personalized_automatic_speech_recognition: Implementation of a Personalised Automatic Speech Recognition desktop application

Personalized Automatic Speech Recognition (PASR)

Implementation of a desktop application for adapting to the voice of a single user (personalized) or environment and applying automatic speech recognition on a set of e-mails on english.

Description

This project is developed as part of my thesis on ASR during my undergraduate studies at the school of Electrical and Computer Engineering of Aristotle University of Thessaloniki, Greece.

It is a desktop application that can be used for automatic speech recognition. Concretely, using this application, one can:

Provide sample recordings of one's voice that will be used to adapt the speech recognition engine to one's voice.
Provide a set of e-mails that will be used as a search corpus during the recognition.
Dictate any sequence of words from within the provided corpus, and get the written transcript as a result.

Implementation

The speech recognition engine consist of two parts. The ASR part and the Correction part.

ASR

For the ASR part, CMUSphinx is used. The provided sample recordings are used to adapt the default acoustic model of CMUSphinx to the user's voice. The provided e-mails are used to create a language model and a dictionary for CMUSphinx.

Correction

The Correction part is an algorithm designed to correct any errors in the output of the ASR part based on the corpus, the language model and the dictionary mentioned above.

Installation

The application is written in the Java programming language and the JavaFX library is used. The development and testing is done on Ubuntu 14.04.

Prerequisites

Java 8
JavaFX

Ubuntu 14.04: The easies way to install Java 8 and JavaFX is to install Oracle JDK. To do that see this post.
Ubuntu 16.04: You can install Oracle JDK the same way you would install it on Ubuntu 14.04 but you can also install OpenJDK and OpenJFX from aptitude:
```
apt-get install openjdk-8-jdk
apt-get install openjfx
```

Python version 2.7 or greater. You can install Python with the following command:
```
apt-get install python
```
autoconf, libtool, bison, python-dev, swig and wget packages. You can install these packages with the following command:
```
   apt-get install autoconf libtool bison python-dev swig wget
```
Gradle version 2.4 or greater. Gradle is used to build the application from its sources. To automatically install Gradle, see the installation Steps.

Steps

After you have installed all the Prerequisites you are ready to install the application using the following commands:

Clone the repository:

git clone https://github.com/gouzouni625/personalized_automatic_speech_recognition.git

Run the setup.py script that will install CMUSphinx (Note that the installation will be done inside the directory personalized_automatic_speech_recognition, no files will be created or changed anywhere else on your file system):
```
cd personalized_automatic_speech_recognition
./setup.py
```
The setup script will look for Java at the location /usr/lib/jvm/default-java. If this is not the valid location of your Java installation, you should provide the correct path as an argument to the setup script like this:
```
./setup.py  --java-path /your/java/installation/path
```
If you installed Oracle JDK using a PPA, the java path will probably be:
```
./setup.py  --java-path /usr/lib/jvm/java-8-oracle
```
If you don't have Gradle installed, the setup.py script can install it for you (the installation will be done inside the directory of the cloned repository) by passing the flag:
```
./setup.py  --java-path /your/java/installation/path --gradle-install
```
After the installation script is done, you can check the setup.log file to make sure everything was installed correctly.
After that, you can run the application. A helper script has been created for this purpose. Simply run:
```
./start.sh
```

Name		Name	Last commit message	Last commit date
Latest commit History 354 Commits
database		database
docs		docs
doxygen		doxygen
libs		libs
pocketsphinx @ e077311		pocketsphinx @ e077311
sphinxbase @ 729223e		sphinxbase @ 729223e
sphinxtrain @ 3f48dcb		sphinxtrain @ 3f48dcb
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
build.gradle		build.gradle
install_gradle.sh		install_gradle.sh
setup.py		setup.py
setup.sh		setup.sh
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personalized Automatic Speech Recognition (PASR)

Description

Implementation

ASR

Correction

Installation

Prerequisites

Steps

Further Reading

About

Releases

Packages

Languages

gouzouni625/personalized_automatic_speech_recognition

Folders and files

Latest commit

History

Repository files navigation

Personalized Automatic Speech Recognition (PASR)

Description

Implementation

ASR

Correction

Installation

Prerequisites

Steps

Further Reading

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages