2016 ICME:Phonetic posteriorgrams for many-to-one voice conversion without parallel data training

时间 2021-01-11

原文原文链接

作者： Helen Meng 单位：港中文 abstract 非平行训练数据进行voice conversion 首先用一个SI-ASR（speaker-independent 语音识别系统）提取PPGs(Phonetic PosteriorGrams)，这个PPGs可以对应于说话者的发音，并且对应于独立说话者的说话内容。然后用DBLSTM（deep bi-LSTM)建模PPGs和target