- A Python package to provide suggestion on volume at which the music audio file needs to be played for better experience and feeling.
- In backend, it extracts various generic features for particular audio and analyze among them and provide feedback on volumne on it.
- This tools helps in maintaining good vibes along the music playout.
- First user need to download the script and run Volume Suggester.py in the local system.
- After running it, user will be prompted to select an audio file(mp3 file) using dialog box.
- Once user has selected the audio file, following feature extraction and analysis graph will be generated at the backend.
- Generic Audio Features:
Channels
: (number of channels; 1 for mono, 2 for stereo audio)Sample Width
: (number of bytes per sample; 1 means 8-bit, 2 means 16-bit)Frame Rate / Sample Rate
: (frequency of samples used (in Hertz))Frame Width
: (Number of bytes for each “frame”. One frame contains a sample for each channel.)Audio Length / Duration
: (audio file length (in milliseconds))Frame Count
: (the number of frames from the sample)Intensity
: (loudness in dBFS (dB relative to the maximum possible loudness))
- Plot on
Amplitude over Time
Analysis - Following Derived Audio Features:
Spectogram
RMS/Energy Spectogram
Zero Crossing Rate
Mel Frequency Cepstral Coefficients
Mel Frequency Spectogram
Chroma Feature
Tempogram
- Generic Audio Features:
- After these feature extraction is done, user will be able to Play/Pause(using CTRL button) and Stop(using ESC button) the selected song.
- Input:
- Single Audio File (.mp3)
- Output:
- Output Folder consisting of different feature extraction
- Implemented following features in the package:
-
Generic Audio Features:
Channels
: (number of channels; 1 for mono, 2 for stereo audio)Sample Width
: (number of bytes per sample; 1 means 8-bit, 2 means 16-bit)Frame Rate / Sample Rate
: (frequency of samples used (in Hertz))Frame Width
: (Number of bytes for each “frame”. One frame contains a sample for each channel.)Audio Length / Duration
: (audio file length (in milliseconds))Frame Count
: (the number of frames from the sample)Intensity
: (loudness in dBFS (dB relative to the maximum possible loudness))
-
Derived Audio Features:
Amplitude over Time
Spectogram
RMS/Energy Spectogram
Zero Crossing Rate
Mel Frequency Cepstral Coefficients
Mel Frequency Spectogram
Chroma Feature
Tempogram
-
Player for playing selected audio file
-
Suggestion on volume on the basis of various feature extraction.
-
- In order to use this package, one need to ensure the following requirements:
- Python 3.7 or later installed on your machine.
- ffmpeg pacakge must be installed and its PATH must be added to Environment variables.
This command will import the PyVolSuggester
module.
import PyVolSuggester
This command will install all the uninstalled required libraries used in script.
from PyVolSuggester import Suggester
This will prompt user for input of ffmpeg
path and audio file selection(.wav format).
Suggester.main()
This will extract the generic features of the selected audio file.
Suggester.extract()
This will allow user to play, plause and stop the audio file.
Suggester.play_pause_stop()
This will generate a amplitude over time plot and save it as an image to output directory.
Suggester.amplitude_wave()
This will generate a spectogram plot and save it as an image to output directory.
Suggester.spectogram()
This will generate a RMS/Energy Spectogram plot and save it as an image to output directory.
Suggester.rms_energy_spectogram()
This will generate a Zero Crossing Rate plot and save it as an image to output directory.
Suggester.zero_crossing_rate()
This will generate a Mel Frequency Cepstral Coefficients plot and save it as an image to output directory.
Suggester.mel_frequency_cepstral_coefficients()
This will generate a Mel Frequency Spectogram plot and save it as an image to output directory.
Suggester.mel_frequency_spectogram()
This will generate a Chroma Feature plot and save it as an image to output directory.
Suggester.chroma_feature()
This will generate a tempogram plot and save it as an image to output directory.
Suggester.tempogram()
This will suggest user the most feasible volume for that particular audio.
Suggester.suggest_volume()
pip install PyVolSuggester