AudioFlex

Features

Time Stretching: Increase or decrease the length of audio clips without altering the pitch.
Pitch Shifting: Modify the pitch of audio files without changing the duration.
File I/O: Read and write support for .wav audio files. (Sample audio files located in the audio directory)

How does AudioFlex work?

(i) Change the Time of a Signal without Varying the Frequency

Functionality: Time Stretching (Speed Change) Purpose: This functionality allows users to increase or decrease the playback duration of an audio signal without changing its pitch. This is crucial in applications such as adjusting music tracks to fit a specific time slot, creating slow-motion effects in audio without altering pitch, or making spoken content more accessible by slowing it down for clarity.

How It Works:

Phase Vocoder: The primary technology behind this feature is the Phase Vocoder, which processes audio in the frequency domain using Fast Fourier Transform (FFT). The phase vocoder breaks the sound into small overlapping segments or frames, applies an FFT to convert each frame from the time domain to the frequency domain, and then modifies the timing of these frames.
Time Stretching Algorithm: In the frequency domain, the algorithm adjusts the time intervals between successive frames without changing their spectral content. This is accomplished by manipulating the phase of the frequency components to ensure coherence in the overlapped windows during synthesis. When the frames are transformed back to the time domain using the Inverse Fast Fourier Transform (IFFT) and recombined, the result is a longer or shorter version of the original sound with the pitch preserved.

(ii) Change the Frequency of a Signal without Varying the Time

Functionality: Pitch Shifting (Key Change) Purpose: This functionality allows users to change the pitch or key of an audio signal without affecting its duration. This feature is essential in music production for key modulation and harmony generation, voice modification in post-production for entertainment or privacy purposes, and making music and educational content more versatile.

How It Works:

Phase Vocoder: Similar to time stretching, pitch shifting also uses the phase vocoder technology. However, instead of changing the time intervals between the frames, pitch shifting modifies the frequencies within each frame.
Pitch Shifting Algorithm: By multiplying the frequency components obtained from the FFT by a constant factor (related to the desired pitch shift), the frequencies are effectively raised or lowered. For instance, to increase the pitch, frequencies are scaled up, whereas to decrease the pitch, they are scaled down. The phase components must also be appropriately adjusted to maintain a natural sound and prevent artifacts. After the frequency adjustment, the signal is transformed back to the time domain using IFFT, preserving the original duration but with a modified pitch.

Tutorial

Prerequisites

C++ Compiler (GCC or Clang)
libsndfile library
fftw3 library

Compiling on Ubuntu / Debian

Install the necessary libraries:

sudo apt-get install libsndfile1-dev libfftw3-dev

Compile the project:
- Navigate to the project directory and compile using g++:
```
g++ -o main main.cpp -lsndfile -lfftw3
```
- Or use the provided Makefile:
```
make
```

Example Usage

Example 1

Time Stretch Factor (Speed Change): 1.5
Pitch Shift Factor (Key Change): 1

./main ./audio/sample.wav output-1.5-1.wav 1.5 1

Example 2

Time Stretch Factor (Speed Change): 0.8
Pitch Shift Factor (Key Change): 1

./main ./audio/sample.wav output-0.8-1.wav 0.8 1

Example 3

Time Stretch Factor (Speed Change): 1
Pitch Shift Factor (Key Change): 0.8

./main ./audio/sample.wav output-1-0.8.wav 1 0.8

Example 4

Time Stretch Factor (Speed Change): 1
Pitch Shift Factor (Key Change): 1.2

./main ./audio/sample.wav output-1-1.2.wav 1 1.2

Example 5

Time Stretch Factor (Speed Change): 1.5
Pitch Shift Factor (Key Change): 0.8

./main ./audio/sample.wav output-1.5-0.8.wav 1.5 0.8

Docker Usage

Build the Docker image:
```
docker build -t audioflex .
```

Run the Docker container:

docker run --rm -it -v $(pwd):/app audioflex

Python Script Tool Usage

Prerequisites

Python 3
rosa library
matplotlib library
numpy library (< 2.0.0)

Verify/Test with `rosa.py`

python3 scripts/rosa.py <input_wav_file> <output_wav_file> <time_speed> <pitch_shift>

Plot Audio Spectrogram with `plot.py`

python3 scripts/plot.py <input_wav_file> <output_plot_file>

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
audio		audio
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioFlex

Features

How does AudioFlex work?

(i) Change the Time of a Signal without Varying the Frequency

(ii) Change the Frequency of a Signal without Varying the Time

Tutorial

Prerequisites

Compiling on Ubuntu / Debian

Example Usage

Example 1

Example 2

Example 3

Example 4

Example 5

Docker Usage

Python Script Tool Usage

Prerequisites

Verify/Test with `rosa.py`

Plot Audio Spectrogram with `plot.py`

About

Releases

Packages

Languages

License

ryanlinjui/AudioFlex

Folders and files

Latest commit

History

Repository files navigation

AudioFlex

Features

How does AudioFlex work?

(i) Change the Time of a Signal without Varying the Frequency

(ii) Change the Frequency of a Signal without Varying the Time

Tutorial

Prerequisites

Compiling on Ubuntu / Debian

Example Usage

Example 1

Example 2

Example 3

Example 4

Example 5

Docker Usage

Python Script Tool Usage

Prerequisites

Verify/Test with rosa.py

Plot Audio Spectrogram with plot.py

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Verify/Test with `rosa.py`

Plot Audio Spectrogram with `plot.py`

Packages