Skip to content

Latest commit

 

History

History
73 lines (38 loc) · 1.76 KB

File metadata and controls

73 lines (38 loc) · 1.76 KB

Evaluation methods in SpeechSynthesis(including VC)

subjective

Objective evaluation in speech synthesis

display overall

image 👉https://arxiv.org/pdf/2104.00355.pdf

defination in papers

MSD, F0 RMSE, F0 corr, GPE, FPE image

MCD, GPE, VDE, FFE MCD

training

MCD

definination

image http://www1.se.cuhk.edu.hk/~hccl/publications/pub/2016_paper_297.pdf section 4.2 image

code

mcd(34 mcep) other implement

normal value range

log dB 4.4

F0 RMSE/ corr/ VDE/ FFE

GPE

normal value range

RMSE 22.386

VUV

test

common

content

PER, WER

speaker similarity

https://github.com/resemble-ai/Resemblyzer tsne

in paper Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss image