Waveforms from FIRNet (Please bewawre of loud volume)) |
|||||||
| Ground truth | FIRNet | ||||||
| Mixed excitation |
N/A |
|
|||||
| Residual signal |
|
|
|||||
| Speech waveform |
|
|
|||||
f0 scaling condition: × 1.00 |
|||||||
| Original | WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
||
|---|---|---|---|---|---|---|---|
f0 scaling condition: × 0.00 |
|||||||
| WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
|---|---|---|---|---|---|---|---|
f0 scaling condition: × 0.25 |
|||||||
| WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
|---|---|---|---|---|---|---|---|
f0 scaling condition: × 0.50 |
|||||||
| WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
|---|---|---|---|---|---|---|---|
f0 scaling condition: × 2.00 |
|||||||
| WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
|---|---|---|---|---|---|---|---|
f0 scaling condition: × 4.00 |
|||||||
| WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
|---|---|---|---|---|---|---|---|
f0 scaling condition: × 8.00 |
|||||||
| WORLD | SiFi-GAN (train with 1000 utt.) |
FIRNet (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 1000 utt.) |
FIRNet w/ UnivNet disc. (train with 18858 utt.) |
|||
|---|---|---|---|---|---|---|---|