Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Notebook #280

alicangok · 2024-01-01T23:42:01Z

Major changes:

Dynamic Augmentation is introduced
- Instead of fixing the augmented examples during dataset creation, the dataset loader now generates unique training examples during each epoch, significantly boosting robustness against noise and time shifts.
- The more costly "speed augmentation" remains fixed, carried out once during dataset creation.
- For stability of validation results across epochs, the validation examples (original + augmentations) are also fixed; they are constructed during initial dataset creation.
Changed the dataset filename (dataset2.pt->dataset3.pt) to avoid potential mix-ups, as this PR introduces a major change
- Added "shift_limits" property to each sample (for possible future feature compatibility, regarding voice activity detection)
- The generated dataset contains the following:
  - The original training samples from Google Speech Commands, and 2 augmented versions of each sample with different speeds.
  - Additional training samples from Librispeech as additional examples for the "background" class.
  - The original validation samples from Google Speech Commands, and 2 augmented versions of each sample with different speeds, time shifts, and added white noise.
  - The original test samples from Google Speech Commands without any augmentation.
Dataset creation is significantly faster (90 mins -> 4 mins), thanks to more efficient operations done in batches.
The network found via "Neural Architecture Search" is introduced, which significantly improves accuracy than its predecessors (v2 & v3), having a higher parameter count, slightly increased #MACs, and latency (3.2ms -> 3.9ms).
From: @EyubogluMerve: Added automated evaluation notebook for specified noise types and SNR levels..
- Added a new dataset (signalmixer.py)
- Modified msnoise.py to:
  - include "Tradeshow" as another type of noise
  - carry out proper train/test splits

Summary of Improvements:

Along with the previous PR, we have improved the KWS20 accuracy from ~86.5% to 92.5% on the validation set which includes augmented samples, and from 87.6% to 93.7% on the clean test set.

The impact of each change on the KWS20 accuracy are as follows:

pytsmod tempo augmentation -> torchaudio speed augmentation: +1%
v3 -> v2 model: +1.5%
v2 -> NAS model: +2.5%
Dynamic noise & shift augmentation: +1%
Total: +6% Absolute change in accuracy, from 86.5%->92.5%
- 44% decrease in error rates, with even more significant reduction in false alarm rates.

alicangok · 2024-01-08T11:32:50Z

~~Changed the pull request to "draft" mode: Awaiting requested changes at: alicangok#1~~

* changed files are added * name changes are done * Update msnoise.py copyrights * Update signalmixer.py copyright notices * Update Automated_Evaluation_KWS.ipynb copyright notices * signalmixer parameters are updated * Notebook is updated using current paths * Correct os.path.join usage for non-Linux operating systems * Define `data_path` once --------- Co-authored-by: Alican Gök <alicangok@gmail.com>

change the name of the notebook

ermanok

Added minor comments.

datasets/kws20.py

aniktash

Some minor text updates suggested

notebooks/KWS_Noise_Evaluation.ipynb

MaximGorkem

Only few small comments, looks and trains nice.

datasets/kws20.py

…ng into kws/dynamicaug_nas

alicangok added 4 commits January 2, 2024 01:35

Add NAS KWS model and Dynamic Augmentation

dfc3e42

Fix line endings

d873c85

Remove utf-8 copyright character

4486295

Fix import

76027dd

alicangok marked this pull request as draft January 8, 2024 11:32

rotx-eva and others added 2 commits January 9, 2024 10:12

Merge branch 'develop' into kws/dynamicaug_nas

4c3d904

alicangok marked this pull request as ready for review January 15, 2024 12:06

alicangok changed the title ~~Add NAS KWS model and Dynamic Augmentation~~ Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Script Jan 15, 2024

alicangok changed the title ~~Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Script~~ Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Notebook Jan 15, 2024

Rename Automated_Evaluation_KWS.ipynb to KWS_Noise_Evaluation.ipynb

5ef304f

change the name of the notebook

ermanok requested review from ermanok, MaximGorkem and aniktash January 17, 2024 20:54

ermanok reviewed Jan 17, 2024

View reviewed changes

datasets/kws20.py Outdated Show resolved Hide resolved

datasets/kws20.py Outdated Show resolved Hide resolved

aniktash reviewed Jan 17, 2024

View reviewed changes

notebooks/KWS_Noise_Evaluation.ipynb Outdated Show resolved Hide resolved

notebooks/KWS_Noise_Evaluation.ipynb Outdated Show resolved Hide resolved

notebooks/KWS_Noise_Evaluation.ipynb Outdated Show resolved Hide resolved

alicangok mentioned this pull request Jan 21, 2024

Add NAS KWS model (trained using dynamic augmentation) analogdevicesinc/ai8x-synthesis#324

Merged

alicangok added 3 commits January 21, 2024 22:11

Merge branch 'develop' into kws/dynamicaug_nas

eb7919c

Text updates for the noise evaluation notebook

3535eca

Change the data filename and fix minor typo in kws20.py

77ee949

alicangok requested a review from aniktash January 22, 2024 14:01

alicangok added 2 commits January 22, 2024 17:05

Typo

4f001fb

typo

9ffde0e

MaximGorkem reviewed Jan 22, 2024

View reviewed changes

datasets/kws20.py Outdated Show resolved Hide resolved

datasets/kws20.py Outdated Show resolved Hide resolved

alicangok added 4 commits January 23, 2024 20:22

Merge branch 'kws/dynamicaug_nas' of github.com:alicangok/ai8x-traini…

e81cec9

…ng into kws/dynamicaug_nas

Remove the word from copyright, minor comment hanges

7ec991d

Merge branch 'MaximIntegratedAI:develop' into kws/dynamicaug_nas

94b56da

signalmixer copyright notice

a9246b4

alicangok marked this pull request as draft January 24, 2024 22:30

alicangok marked this pull request as ready for review January 24, 2024 22:44

alicangok and others added 3 commits January 25, 2024 01:44

correct copyright notices for files with apache licence

45eb2b1

Copyright notices (#2)

4ba542f

Revert the change on weights of classes

8f50465

rotx-eva approved these changes Feb 1, 2024

View reviewed changes

rotx-eva merged commit 3a4a661 into analogdevicesinc:develop Feb 1, 2024
2 checks passed

rotx-eva mentioned this pull request Mar 13, 2024

Update KWS, MSNoise, Signalmixer Data Loaders & Evaluation Notebook, Add New Scripts for Mixed Signals #299

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Notebook #280

Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Notebook #280

alicangok commented Jan 1, 2024 •

edited

Loading

alicangok commented Jan 8, 2024 •

edited

Loading

ermanok left a comment

aniktash left a comment

MaximGorkem left a comment

Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Notebook #280

Add NAS KWS model, Dynamic Augmentation and Automated Evaluation Notebook #280

Conversation

alicangok commented Jan 1, 2024 • edited Loading

Major changes:

Summary of Improvements:

alicangok commented Jan 8, 2024 • edited Loading

ermanok left a comment

Choose a reason for hiding this comment

aniktash left a comment

Choose a reason for hiding this comment

MaximGorkem left a comment

Choose a reason for hiding this comment

alicangok commented Jan 1, 2024 •

edited

Loading

alicangok commented Jan 8, 2024 •

edited

Loading