Robust discriminative adversarial learning (RDAL)

The RDAL repository contains the code to reproduce the main results in the paper Adversarial Representation Learning for Robust Privacy Preservation in Audio (OJSP 2023)

Structure

The folder RDAL\data_final contains the data to run the code. The data can be obtained from TAU Sound Events and Speech Privacy Preservation
The folder RDAL\pickle_data contains the serialized data in pickle form after running RDAL\src\serialize_data.py.
The folder RDAL\src contains the source code of RDAL+M.
The folder RDAL\models contains the trained models in each run of each specific training mode. RDAL\models\tsne_supervised contains the trained models in the supervised training specifically for the TSNE visualization.
The folder RDAL\pickle_results contains the results in pickle format of each run. RDAL\best_pickle_results contains the results of the best model in each approach (baseline, NaiveAdv, RDAL, and RDAL+M) for ROC curve plotting

Requirement

The code is written in Python, and the models are implemented using Pytorch. Make sure all libraries in environment.yml are available.

How to run

Download and place the extracted data in data_final. Remove .gitkeep from the empty folders if needed.
Run
```
 python serialize_data.py
```
to serialize the data. This file will calculate the short time Fourier trasform (STFT) spectrograms of both the merged signals and the sound event signals and then serialize the spectrograms with the sound event labels and speech labels into .pickle format for training the adversarial training process and the STFT spectrograms of the merged signals containing speech and their sound event correspondences for pre-training the source separtion network in RDAL+M setup.
There are 4 setups provided in this package: The baseline, NaiveAdv, RDAL, and RDAL+M can be run from baseline_main.py, naive_rdal_main.py, rdal_main.py, and rdal_mask_main.py respectively. To run the model, run the following command:
```
python {main_file_name}.py {job_idx}
```
The models from the training process of a specific training mode with a specific job_idx is saved in the folder models. The results are saved in the folder pickle_results, in .pickle format.
Result reading:

read_pickle.ipynb is provided to read the results from the pickle file. The plot of the predicted probability densities from the attacker model can be generated in the notebook.
The TSNE plot can be generarted from visualize_data.ipynb. The trained model for the supervised training of the feature extractor on the sound event and the speech label can be found in the models directory.

Reference

Please consider citing our paper if the work is useful for your research.

@ARTICLE{gharib2023privacy,
  author={Gharib, Shayan and Tran, Minh and Luong, Diep and Drossos, Konstantinos and Virtanen, Tuomas},
  journal={IEEE Open Journal of Signal Processing},
  title={Adversarial Representation Learning for Robust Privacy Preservation in Audio},
  year={2023},
  note={submitted for publication},
}

Contact

Diep Luong (lndiep1811@gmail.com)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robust discriminative adversarial learning (RDAL)

Structure

Requirement

How to run

Reference

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
best_pickle_results		best_pickle_results
data_final		data_final
models		models
pickle_data		pickle_data
pickle_results		pickle_results
src		src
LICENSE.txt		LICENSE.txt
README.md		README.md
environment.yml		environment.yml

License

lndip/RDAL

Folders and files

Latest commit

History

Repository files navigation

Robust discriminative adversarial learning (RDAL)

Structure

Requirement

How to run

Reference

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages