Waveforms from FIRNet (Please bewawre of loud volume)) |
|||||||
Ground truth | FIRNet | ||||||
Mixed excitation |
N/A |
|
|||||
Residual signal |
|
|
|||||
Speech waveform |
|
|
f0 scaling condition: × 1.00 |
|||||||
Original | WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
||
---|---|---|---|---|---|---|---|
f0 scaling condition: × 0.00 |
|||||||
WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
---|---|---|---|---|---|---|---|
f0 scaling condition: × 0.25 |
|||||||
WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
---|---|---|---|---|---|---|---|
f0 scaling condition: × 0.50 |
|||||||
WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
---|---|---|---|---|---|---|---|
f0 scaling condition: × 2.00 |
|||||||
WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
---|---|---|---|---|---|---|---|
f0 scaling condition: × 4.00 |
|||||||
WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
---|---|---|---|---|---|---|---|
f0 scaling condition: × 8.00 |
|||||||
WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
---|---|---|---|---|---|---|---|