Lingua-Libre

! ===========================================================================================
! DEPRECATED: This project is the (now archived) 2016-2017 version                           
! You can find the current version on https://github.com/orgs/lingua-libre/ and https://lingualibre.org
! ===========================================================================================

Lingua-Libre

LinguaLibre.fr – Massive Open Audio Recording is an opensource platform and webapps created to ease mass recording of texts into clean, well cut, well named and apps friendly audio files. It is designed from the start to ease the creation of consistent datasets of audio files. We believe it is the best tool available to create dataset from few dozens to several thousands audios files. Recording productivity can reach up to 1000 audio recordings / hour, given a clean words list and an experienced user. Lingua-Libre is funded by Wikimedia France and actively used by the Wikimedia community.

Requirements

PHP 5.5
Composer 1.2
Symfony 1.5
MySQL 5 , one instance

Installation & Development

In order to create a level playing field for development this project uses docker & docker-compose, as well as composer for dependency management.

Install PHP dependencies

docker run -it --rm --user $(id -u):$(id -g) -v "$PWD":/app -v ~/.composer:/composer -w /app composer composer install

Start-up the system

export UID && docker-compose up

This will keep stay attached to your command line and show you useful debug information. The application should now already be available in your browser at http://localhost:8000.

Use a second command line and continue with the following commands to complete installation.

(Re-)Create Database

docker-compose exec app /code/bin/console doctrine:database:create --if-not-exists
docker-compose exec app /code/bin/console doctrine:schema:update --force
docker-compose exec app /code/bin/console doctrine:fixtures:load

Build the Javascript files

make -C web/js common.js

Technical todos

Hard dependencies

The following dependencies exist in the code and are yet to be addressed in the development environment or, at least, the README (e.g. required version).

make by usage of web/js/Makefile to build Javascript
yui-compressor in web/js/Makefile
oggenc in ExportController.php

Javascript build process

Currently the Javascript (web/js/common.js) is build using make. This is an anachronism to say the least. Switching to a more modern tool, like e.g. grunt, would resolve the hard dependency to both make and yui-compressor. Additionally, the jQuery file now commited into this repo (web/js/vendor/jquery-3.2.1.min.js) can then be replaced by Javascript package management (e.g. npm).

History

Shtooka Recorder (2010) by Nicolas Vion - a notable desktop software which had a deep impact on the open audio reccording ecosystems. Hundreds of applications use data produced by this software.
SWAC Recorder (2013) by Nicolas Vion - a revamp of the earlier, less known but easier to install, with better user experience.
Lingua-Libre (2016) by Nicolas Vion - a cloud variation of the earlier versions, the project was funded by Wikimedia France, and create with feedbacks from local linguistic academics. The grant is associated with the project to reccord and preserve dying French minorities languages, but can be used for all languages around the world, as well as to reccord the voice of your important love ones. The clean, sharp, net audiofiles outputed ease the creation of various derivated applications.

Functionalities

In order to provide very consistent, app-friendly files, the current functionality are :

easy usage without downlaod nor installation, via LinguaLibre.fr
speakers' profiles, with language, gender, age, origin and few other data recommended to us by linguists.
wordlist support
intuitive interface with audio curve went speaking
roll back capability
auto roll-back / do-again when saturation is detected
consistent cut before / after the said words
auto equalization for sound's level

Equipment (recommendation)

Silent room / Recording studio
1 x Scarlett2 Solo Studio Pack 2nd Generation, comprising portable :
- 1 x microphone
- 1 x headset
- 1 x external sound card
- 1 x cables
Microphone's addons :
- 1 x Pod / Arm stand
- 1 x Anti-pop filter
- 1 x Anti-vibration system
1 x modest PC (audio recording chain is external)
Internet connexion

Cost : US$250 for external audio equipments + US$300 for optional PC = 250 ~ 550US$.

Working process

Data gathering : prepare a text file with a list of words/sentences, one by line.
Speaker : find a willing speaker
Facility : find a calm studio or room
Hardware installation : install the equipment in the room so to work comfortably
Software settings: connect to LinguaLibre.fr's studio, edit the settings according to your needs
Recording : start your high quality massive audio recording. 1000~1500 items per half day is fair.
Applications : be creative, invent your apps ! :D

License

GNU GENERAL PUBLIC LICENSE -- thanks to a Wikimedia-France's funding.

Name		Name	Last commit message	Last commit date
Latest commit History 247 Commits
app		app
bin		bin
docker		docker
src		src
var		var
web		web
.gitignore		.gitignore
AUTHORS		AUTHORS
COPYING		COPYING
README.md		README.md
composer.json		composer.json
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lingua-Libre

Requirements

Installation & Development

Install PHP dependencies

Start-up the system

(Re-)Create Database

Build the Javascript files

Technical todos

Hard dependencies

Javascript build process

History

Functionalities

Equipment (recommendation)

Working process

License

About

Releases

Packages

Contributors 9

Languages

License

wikimedia-france/Lingua-Libre

Folders and files

Latest commit

History

Repository files navigation

Lingua-Libre

Requirements

Installation & Development

Install PHP dependencies

Start-up the system

(Re-)Create Database

Build the Javascript files

Technical todos

Hard dependencies

Javascript build process

History

Functionalities

Equipment (recommendation)

Working process

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages