Decompose speech into content, style and speaker
train_ddp_dataloader_segment_submit_success_v21.py
speechdecompose_dataloader_segment.py
convert_batch_using_fs2vocoder_denorm.py
Following ming024's FastSpeech2, python3 preprocess.py config/LJSpeech/preprocess.yaml
Following liusongxiang's ppg-vc, python3 3_compute_spk_dvecs_no_flatten.py
local GCR debug : python3 -m torch.distributed.launch train_ddp_dataloader_segment_submit.py --model-dir ./model_debug_dir --log-dir ./log_debug_dir -p config/VCTK/preprocess.yaml -m config/VCTK/model.yaml -t config/VCTK/train.yaml
WebIDE :
dataloader: segment 128 (following Wendison's VQMIVC)