강연 / 세미나

세미나
세미나
일정

MINDS Seminar Series | Low-quality Fake Audio Detection through Frequency Feature Masking

기간 : 2022-11-01 ~ 2022-11-01
시간 : 17:00 ~ 18:00
개최 장소 : Online streaming (Zoom)
개요
MINDS Seminar Series | Low-quality Fake Audio Detection through Frequency Feature Masking
분야Field
날짜Date 2022-11-01 ~ 2022-11-01 시간Time 17:00 ~ 18:00
장소Place Online streaming (Zoom) 초청자Host
연사Speaker Il Youp Kwak 소속Affiliation
TOPIC MINDS Seminar Series | Low-quality Fake Audio Detection through Frequency Feature Masking
소개 및 안내사항Content The first Audio Deep Synthesis Detection Challenge (ADD 2022) competition was held which dealt with audio deepfake detection, audio deep synthesis, audio fake game, and adversarial attacks. Our team participated in track 1, classifying bona fide and fake utterances in noisy environments. Through exploratory data analysis, we found that noisy signals appear in similar frequency bands for given voice samples. If a model is trained to rely heavily on information in frequency bands where noise exists, performance will be poor. In this paper, we propose a data augmentation method, Frequency Feature Masking (FFM) that randomly masks frequency bands. FFM makes a model robust by not relying on specific frequency bands and prevents overfitting. We applied FFM and mixup augmentation on five spectrogram-based deep neural network architectures that performed well for spoofing detection using mel-spectrogram and constant Q transform (CQT) features. Our best submission achieved 23.8% in EER and ranked 3rd on track 1. To demonstrate the usefulness of our proposed FFM augmentation, we further experimented with FFM augmentation using ASVspoof 2019 Logical Access (LA) datasets.

https://us06web.zoom.us/j/6888961076?pwd=ejYxN05jNmhUa25PU2JzSUJvQ1haQT09
ID : 688 896 1076 / PW : 54321
학회명Field MINDS Seminar Series | Low-quality Fake Audio Detection through Frequency Feature Masking
날짜Date 2022-11-01 ~ 2022-11-01 시간Time 17:00 ~ 18:00
장소Place Online streaming (Zoom) 초청자Host
소개 및 안내사항Content The first Audio Deep Synthesis Detection Challenge (ADD 2022) competition was held which dealt with audio deepfake detection, audio deep synthesis, audio fake game, and adversarial attacks. Our team participated in track 1, classifying bona fide and fake utterances in noisy environments. Through exploratory data analysis, we found that noisy signals appear in similar frequency bands for given voice samples. If a model is trained to rely heavily on information in frequency bands where noise exists, performance will be poor. In this paper, we propose a data augmentation method, Frequency Feature Masking (FFM) that randomly masks frequency bands. FFM makes a model robust by not relying on specific frequency bands and prevents overfitting. We applied FFM and mixup augmentation on five spectrogram-based deep neural network architectures that performed well for spoofing detection using mel-spectrogram and constant Q transform (CQT) features. Our best submission achieved 23.8% in EER and ranked 3rd on track 1. To demonstrate the usefulness of our proposed FFM augmentation, we further experimented with FFM augmentation using ASVspoof 2019 Logical Access (LA) datasets.

https://us06web.zoom.us/j/6888961076?pwd=ejYxN05jNmhUa25PU2JzSUJvQ1haQT09
ID : 688 896 1076 / PW : 54321
성명Field MINDS Seminar Series | Low-quality Fake Audio Detection through Frequency Feature Masking
날짜Date 2022-11-01 ~ 2022-11-01 시간Time 17:00 ~ 18:00
소속Affiliation 초청자Host
소개 및 안내사항Content The first Audio Deep Synthesis Detection Challenge (ADD 2022) competition was held which dealt with audio deepfake detection, audio deep synthesis, audio fake game, and adversarial attacks. Our team participated in track 1, classifying bona fide and fake utterances in noisy environments. Through exploratory data analysis, we found that noisy signals appear in similar frequency bands for given voice samples. If a model is trained to rely heavily on information in frequency bands where noise exists, performance will be poor. In this paper, we propose a data augmentation method, Frequency Feature Masking (FFM) that randomly masks frequency bands. FFM makes a model robust by not relying on specific frequency bands and prevents overfitting. We applied FFM and mixup augmentation on five spectrogram-based deep neural network architectures that performed well for spoofing detection using mel-spectrogram and constant Q transform (CQT) features. Our best submission achieved 23.8% in EER and ranked 3rd on track 1. To demonstrate the usefulness of our proposed FFM augmentation, we further experimented with FFM augmentation using ASVspoof 2019 Logical Access (LA) datasets.

https://us06web.zoom.us/j/6888961076?pwd=ejYxN05jNmhUa25PU2JzSUJvQ1haQT09
ID : 688 896 1076 / PW : 54321
성명Field MINDS Seminar Series | Low-quality Fake Audio Detection through Frequency Feature Masking
날짜Date 2022-11-01 ~ 2022-11-01 시간Time 17:00 ~ 18:00
호실Host 인원수Affiliation Il Youp Kwak
사용목적Affiliation 신청방식Host
소개 및 안내사항Content The first Audio Deep Synthesis Detection Challenge (ADD 2022) competition was held which dealt with audio deepfake detection, audio deep synthesis, audio fake game, and adversarial attacks. Our team participated in track 1, classifying bona fide and fake utterances in noisy environments. Through exploratory data analysis, we found that noisy signals appear in similar frequency bands for given voice samples. If a model is trained to rely heavily on information in frequency bands where noise exists, performance will be poor. In this paper, we propose a data augmentation method, Frequency Feature Masking (FFM) that randomly masks frequency bands. FFM makes a model robust by not relying on specific frequency bands and prevents overfitting. We applied FFM and mixup augmentation on five spectrogram-based deep neural network architectures that performed well for spoofing detection using mel-spectrogram and constant Q transform (CQT) features. Our best submission achieved 23.8% in EER and ranked 3rd on track 1. To demonstrate the usefulness of our proposed FFM augmentation, we further experimented with FFM augmentation using ASVspoof 2019 Logical Access (LA) datasets.

https://us06web.zoom.us/j/6888961076?pwd=ejYxN05jNmhUa25PU2JzSUJvQ1haQT09
ID : 688 896 1076 / PW : 54321
Admin Admin · 2022-10-31 11:45 · 조회 141
2017년 이전 세미나
kartal escort maltepe escort