Source separation neural network

This is the implementation of the source separation neural network which employs the U-net structure. The implementation refers to Singing Voice Separation with Deep U-Net Convolutional Networks.

Model architecture

The encoder and decoder blocks of the U-net each contains 6 convolutional blocks. The model takes the STFT magnitude spectrogram of the input signal and outputs masked STFT spectrogram.

Folder structure

source_separation
+--README.md
+--mask_data
|  +--mixtures
|     +--train
|     +--val
|     +--test
|  +--targets
|     +--train
|     +--val
|     +--test
+--model
+--pickle_data
|  +--train
|  +--val
|  +--test
+--src_formatted
+--test_result

Installation

To run the code, python, pytorch, torchaudio, numpy, and librosa are required.

How to run

Have the data in the mask_data folder as the structure above. Every sample in training and validation set must of equal length for batch processing.
Run serialize.py to obtain the pickle data.
Run mask_main.py to execute the training and inference.

Authors

Diep Luong

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
src_formatted		src_formatted
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Source separation neural network

Model architecture

Folder structure

Installation

How to run

Authors

About

Releases

Packages

Languages

lndip/source_separation

Folders and files

Latest commit

History

Repository files navigation

Source separation neural network

Model architecture

Folder structure

Installation

How to run

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages