1. data preprocess

Decompose speech into content, style and speaker

Quick look

train

train_ddp_dataloader_segment_submit_success_v21.py

model

speechdecompose_dataloader_segment.py

loss

loss_dataloader_segment.py

inference

convert_batch_using_fs2vocoder_denorm.py

1. data preprocess

extract mel spectrogram

Following ming024's FastSpeech2, python3 preprocess.py config/LJSpeech/preprocess.yaml

extract d-vector

Following liusongxiang's ppg-vc, python3 3_compute_spk_dvecs_no_flatten.py

verify the d-vector using t-sne

2 train

local GCR debug : python3 -m torch.distributed.launch train_ddp_dataloader_segment_submit.py --model-dir ./model_debug_dir --log-dir ./log_debug_dir -p config/VCTK/preprocess.yaml -m config/VCTK/model.yaml -t config/VCTK/train.yaml

WebIDE :

dataloader: segment 128 (following Wendison's VQMIVC)

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
Dataset/VCTK-Corpus		Dataset/VCTK-Corpus
audio		audio
config/VCTK		config/VCTK
datasets		datasets
model		model
preprocessed_data		preprocessed_data
preprocessor		preprocessor
speaker_encoder		speaker_encoder
transformer		transformer
utils		utils
vocoder		vocoder
.amltconfig		.amltconfig
.amltignore		.amltignore
.gitignore		.gitignore
3_compute_spk_dvecs_no_flatten.py		3_compute_spk_dvecs_no_flatten.py
README.md		README.md
before_ptconfig		before_ptconfig
before_ptignore		before_ptignore
convert_batch_using_fs2vocoder.py		convert_batch_using_fs2vocoder.py
convert_batch_using_fs2vocoder_denorm.py		convert_batch_using_fs2vocoder_denorm.py
convert_batch_vqmivc_fail.py		convert_batch_vqmivc_fail.py
evaluate.py		evaluate.py
evaluate_dataloader_segment.py		evaluate_dataloader_segment.py
preprocess.py		preprocess.py
read_npy.py		read_npy.py
submit_train_dist_amlt.sh		submit_train_dist_amlt.sh
submit_train_dist_pt.sh		submit_train_dist_pt.sh
synthesize_fastspeech2.py		synthesize_fastspeech2.py
test.png		test.png
test_vocoder.py		test_vocoder.py
train_ddp.py		train_ddp.py
train_ddp_dataloader_segment.py		train_ddp_dataloader_segment.py
train_ddp_dataloader_segment_submit.py		train_ddp_dataloader_segment_submit.py
train_ddp_dataloader_segment_submit_singlemel.py		train_ddp_dataloader_segment_submit_singlemel.py
train_ddp_dataloader_segment_submit_success_v21.py		train_ddp_dataloader_segment_submit_success_v21.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick look

train

model

loss

inference

1. data preprocess

extract mel spectrogram

extract d-vector

verify the d-vector using t-sne

2 train

Reference

About

Releases

Packages

Languages

inconnu11/SpeechDecompose

Folders and files

Latest commit

History

Repository files navigation

Quick look

train

model

loss

inference

1. data preprocess

extract mel spectrogram

extract d-vector

verify the d-vector using t-sne

2 train

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages