-
Notifications
You must be signed in to change notification settings - Fork 716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello, I have issue as I try to use another english dataset. And I'm wondering why Inference from packed test set can work (CUDA_VISIBLE_DEVICES=0 python tasks/run.py --config usr/configs/midi/e2e/opencpop/ds100_adj_rel.yaml --exp_name $MY_DS_EXP_NAME --reset --infer
) but inference model from raw input (python inference/svs/ds_e2e.py --config usr/configs/midi/e2e/opencpop/ds100_adj_rel.yaml --exp_name $MY_DS_EXP_NAME
) needs same phoneme set size?
#74
Comments
same issue |
When using our configs on your dataset, Please do check the "binary_data_dir" in hparams to make sure it points to your binarized data directory because the phoneme dictionary text file will decide the dimension of phone_encoder in the model. |
so, by pointing to our own binarized data in "binary_data_dir" this should change the dimension of phone_encoder to fit our model? |
Sorry I may have misunderstood your issue. If you want to infer from our pretrained ckpt, please make sure your phoneme dictionary is exactly the same as ours because some layers in the pretrained ckpt are related to this. Or the phoneme unit may be wrongly encoded due to different dictionaries. |
If you want to use customed phoneme dictionary, please follow our guidance and re-run the training. |
we did that but ran into the issue above. We retrained FFT, and Diffsinger and whenwe try to put in sequence, the error above is shown. Can you point us to where the model is written so we can debug what is causing this issue? we cant pinpoint what is requiring the missing keys. |
我們是依照這個教學 (https://github.com/MoonInTheRiver/DiffSinger/blob/master/docs/README-SVS.md) 用英文資料集重新訓練, 但是當將FFT 跟Diffsinger 接起來時 會報上面這個錯誤 |
when we retrain (using different phoneme dimension) and don't care about the phoneme, the validation script can be used to create singing voice that resemble the new data. but the inference script doesnt work. |
They are in the modules/***. |
Originally posted by @Wayne-wonderai in #29 (comment)
The text was updated successfully, but these errors were encountered: