Attack_practical_asv

This is the code for our ICASSP 2021 accepted paper: "Attack on practical speaker verification system using universal adversarial perturbations"

Paper link: https://ieeexplore.ieee.org/abstract/document/9413467 or https://arxiv.org/abs/2105.09022

How to run the code

1. Data preparation

You should download LibriSpeech test-clean data set and BUT Speech@FIT Reverb Database. Remember the path where they are saved.

2. Pretrained models

Their model is used as our speaker embedding encoder. Please download the pretrained model here and put it in ./checkpoint/ folder. To evaluate EER and minDCF for the pretrained model, you can change the path config in ./config/eval_libri_speaker.yaml and run the following command. You will get a 1.36% EER with 0.4321 score threshold on LibriSpeech test-clean data set.

python Testspeaker.py --config config/eval_libri_speaker.yaml

3. Generate splits

There are 40 speaker in LibriSpeech test clean set. We have to select enrolling, training, and testing audios for each speaker. You can set the number of enrolling and training audios in ./datas/splits.py. Also you have to change the wav_path in it to your save path. Just run python ./datas/splits.py to generate the split files in ./datas/splits/ folder.

4. Perform two-step attack

Change configs in ./config/attack_config.yaml and then run the following command. It has four main parts: enrolling for every speaker, match differen adversary and targeted speaker pairs including intra-gender and inter-gender matchs, generate adversarial perturbation for each pair on train audios, evaluate the perturbation on test audios. The results will be written into a txt file in your config out_path.

python attack.py --config config/attack_config.yaml

5. Evaluate adversarial examples

When generating every perturbation, the testing process followed.You could find the test result in out_path. But it also should support evaluating the perturbation separately if using different test audios of the adversary.

To evaluate the attack success rate of audio adversarial examples, you should first combine the adversarial perturbation and test audios. You can use the following command to generate adversarial examples. Change the wav_root to your data save path and noise_root to the adversarial perturbation save path.

python generate_adv_examples.py --wav_root /path/to/wav_data --wav_file ./datas/splits/test.txt --noise_root /path/to/adversarial_perturbation --out_root ./output

If your training process includes RIR simulation, you should run the following command to generate adversarial examples with test RIRs.

python generate_adv_examples.py --rir --rir_root /path/to/rir_wavs --wav_root /path/to/wav_data --wav_file ./datas/splits/test.txt --noise_root /path/to/adversarial_perturbation --out_root ./output

After generating the adversarial examples, you can change the path config in config/test_config.yaml and run the following command to get the attack success rate.

python Testattack.py --config config/test_config.yaml

Experimental results in our paper

1. Digital attack without RIR

Attack type	Steps	ASR(%)	WER(%)	SNR(dB)
Clean data	N/A	0	12.95	N/A
intra-gender/baseline	236	98.43	32.33	16.90
intra-gender/ours	846	98.65	19.43	23.66
inter-gender/baseline	617	96.63	37.57	16.55
inter-gender/ours	1872	96.40	21.53	22.26

2. Digital attack with RIR

Attack type	Steps	ASR(%)	WER(%)
Clean data with rir	N/A	0	29.17
intra-gender/baseline	279	99.21	79.33
intra-gender/ours	1003	98.82	66.48
inter-gender/baseline	748	97.20	82.71
inter-gender/ours	1525	96.41	72.47

3. Physical attack

Physical attack setting scenario:

Intra-gender attack

Attack type	ASR(%)	WER(%)	CER(%)
Clean	0	11.42	5.78
Gaussian	0	17.77	10.06
Baseline	80.00	21.82	14.48
Ours	100.00	14.97	7.53

References

Our ASV model code is cloned from their project.

@inproceedings{chung2020in,
  title={In defence of metric learning for speaker recognition},
  author={Chung, Joon Son and Huh, Jaesung and Mun, Seongkyu and Lee, Minjae and Heo, Hee Soo and Choe, Soyeon and Ham, Chiheon and Jung, Sunghwan and Lee, Bong-Jin and Han, Icksang},
  booktitle={Interspeech},
  year={2020}
}

Cite

If you find our paper is useful for your work, please cite the following.

@inproceedings{zhang2021attack,
  title={Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations},
  author={Zhang, Weiyi and Zhao, Shuning and Liu, Le and Li, Jianmin and Cheng, Xingliang and Zheng, Thomas Fang and Hu, Xiaolin},
  booktitle={ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={2575--2579},
  year={2021},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
datas		datas
images		images
models		models
LICENSE		LICENSE
README.md		README.md
Testattack.py		Testattack.py
Testspeaker.py		Testspeaker.py
attack.py		attack.py
augment.py		augment.py
generate_adv_examples.py		generate_adv_examples.py
tools.py		tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attack_practical_asv

How to run the code

1. Data preparation

2. Pretrained models

3. Generate splits

4. Perform two-step attack

5. Evaluate adversarial examples

Experimental results in our paper

1. Digital attack without RIR

2. Digital attack with RIR

3. Physical attack

References

Cite

About

Releases

Packages

Languages

License

zhang-wy15/Attack_practical_asv

Folders and files

Latest commit

History

Repository files navigation

Attack_practical_asv

How to run the code

1. Data preparation

2. Pretrained models

3. Generate splits

4. Perform two-step attack

5. Evaluate adversarial examples

Experimental results in our paper

1. Digital attack without RIR

2. Digital attack with RIR

3. Physical attack

References

Cite

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages