Enhancing McAdams Coefficient-Based Speaker Anonymisation with Cross-Gender Timbre Transfer
< backSpeaker anonymisation or speaker de-identification involves modifying original speech to resemble the voice of an unspecified speaker, while preserving linguistic content and speech quality. This study introduces a speaker anonymisation system based on the second baseline system from the VoicePrivacy2022 Challenge. The evaluation was performed using the evaluation scripts provided by the Challenge. Enhanced privacy is achieved by using a VAE-GAN timbre transfer model to disguise gender identity through a random gender selection strategy. Additionally, the primary objective utility evaluation shows the potential for further improvements. In the secondary utility evaluation, the proposed system shows favourable results with respect to voice distinctiveness, surpassing all baseline systems.