[Feature Request] Constant-Q Transform, custom FFT and perceptual frequency scales #30

TF3RDL · 2022-04-07T06:26:07Z

Although FFTs are fine, it gets really boring for me, so the constant-Q transform (actually the variable-Q transform) is preferred over FFT for octave band analysis, but my implementation of CQT (implemented using bunch of Goertzel algorithm) is slow and it needs to use a sliding DFT to do the real-time CQT

I also aware that spectrum analyzers on Web Audio API doesn't need to use AnalyserNode.getByteFrequencyData, you can just use any FFT library and getFloatTimeDomainData as an input just like my sketch does that, but beware you need to window it using Hann window or something before using FFT, see #3

I think perceptual frequency scales like Mel and Bark should be added because the bass frequencies are less shown than logarithmic scale and more shown than linear scale

hvianna · 2022-05-07T20:13:31Z

Thank you for letting me know about these techniques! Looks like I have a lot to catch on! 😅

Also, thank you for sharing your sketch! It made me realize that using linear values for the amplitude (instead of dB) makes a huge difference in visualization. I'll have this added as an option in the next release. Next, I think weighting filters would also be a good addition.

Can you recommend any good references for equations/algorithms of the CQT/variable-Q transform, perceptual scales and weighting filters?

Cheers!

TF3RDL · 2022-05-18T12:10:19Z

The equation for Bark scale is from Traunmüller's work, and the A-weighting as well as other things is already covered on Wikipedia

As for the constant-Q transform, I prefer the sliding DFT, which works best for real-time
audio visualization and it even has a paper for it

TF3RDL · 2022-12-05T01:35:04Z

Here's the problem that I realized before you implementing the CQT; the Brown-Puckette would require real/imag parts, which AnalyserNode doesn't have (as getByteFrequencyData/getFloatFrequencyData only outputs logarithmic magnitude values), thus it requires custom FFT functionality (which can be implemented using any FFT libraries including ones like this that bundled with FFT functions), and implementing the sliding CQT requires AudioWorklets since it doesn't work well with getFloatTimeDomainData as waveform data to process

hvianna · 2022-12-10T14:33:07Z

@TF3RDL Thanks for following up on this!

For the next beta release, I've done some improvement to the linear amplitude mode and I'm finishing up the work on the weighting filters. I'll try to take a look at the perceptual scales next.

TF3RDL · 2022-12-30T17:57:36Z

As for the custom FFT, this could allow non-power of two sizes, zero-padding, and use different FFT streams or even non-audio data as an input (as custom FFT doesn't depend on Web Audio API), not just window functions right?

Not sure about the performance impact of using custom FFT over getByteFrequencyData/getFloatFrequencyData, but I do know that non-power of two FFTs are noticeably slower

TF3RDL · 2024-04-17T07:03:08Z

Of course, analog-style analyzer (IIR filter bank, no FFT required) mode might be better to implement performance-wise though I think it works best if you implemented this type of non-FFT analyzer using custom implementation (using AudioWorklets), rather than using bunch of BiquadFilterNodes connecting to each AnalyserNodes

hvianna · 2024-05-01T15:28:40Z

I need to work on making the rendering function more independent of WebAudio / FFT, but I worry that a generic solution might impact performance.

By the way, I really like the idea of fading peaks in your demo. I'll try adding these next!

TF3RDL mentioned this issue Jun 25, 2022

[Feature Request] Linear spectrum and bins configurable #34

Open

hvianna mentioned this issue Dec 15, 2022

Feature: Please consider create time domain graphs #28

Open

hvianna added the enhancement New feature or request label Dec 16, 2022

hvianna added a commit that referenced this issue Dec 19, 2022

[feature] Add linear + perceptual (Bark/Mel) frequency scales (#30,#34).

2f71f92

TF3RDL mentioned this issue Feb 14, 2023

[Feature Request] Dynamic gradients, spectrogram display, and combined channel spectrum #38

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Constant-Q Transform, custom FFT and perceptual frequency scales #30

[Feature Request] Constant-Q Transform, custom FFT and perceptual frequency scales #30

TF3RDL commented Apr 7, 2022

hvianna commented May 7, 2022

TF3RDL commented May 18, 2022

TF3RDL commented Dec 5, 2022

hvianna commented Dec 10, 2022

TF3RDL commented Dec 30, 2022 •

edited

Loading

TF3RDL commented Apr 17, 2024

hvianna commented May 1, 2024

[Feature Request] Constant-Q Transform, custom FFT and perceptual frequency scales #30

[Feature Request] Constant-Q Transform, custom FFT and perceptual frequency scales #30

Comments

TF3RDL commented Apr 7, 2022

hvianna commented May 7, 2022

TF3RDL commented May 18, 2022

TF3RDL commented Dec 5, 2022

hvianna commented Dec 10, 2022

TF3RDL commented Dec 30, 2022 • edited Loading

TF3RDL commented Apr 17, 2024

hvianna commented May 1, 2024

TF3RDL commented Dec 30, 2022 •

edited

Loading