admin管理员组

文章数量:1123097

I am trying to do accent conversion and I used mfccs as a feature. Now, I have converted mfccs and I want to check how it sounds. For that, I need to synthesize speech from the given mfccs. Do anyone have idea on how can I generate high quality audio from the mfccs? Is there any matlab code or python code for that? Or is there any tutorial on that?

I tried using griffin-lim algorithm which is not very effective. I also found Hifi-GAN which synthesizes great quality audio but it works with melspectrograms not with mfccs. I wanted to use a neural vocoder if there is any to generate speech from the mfccs.

本文标签: deep learningHow to synthesize high quality speech from mfccsStack Overflow