This project is about domain adaptation in speech by adversarial network and deep neural naetwork.
My codes are Completed or in progress :
- Speaker-Verification-Wav2Vec2-ECAPA-TDNN-Voxceleb1-Deepmine.ipynb (Completed)
- Part1.ipynb (In progress)
links :
- HF course:https://huggingface.co/course/chapter2/5?fw=pt
- Speech brain github: https://github.com/speechbrain/speechbrain
- pytorch : https://www.learnpytorch.io/00_pytorch_fundamentals/
- baseline code : https://github.com/facebookresearch/fairseq
- baseline article : https://arxiv.org/abs/2110.05777
- asr like baseline : https://arxiv.org/abs/2110.00165
- result of baseline (EER in Speaker Verification) : https://github.com/microsoft/UniSpeech
new article can be the base and heppingÖ - Meta-Generalization for Domain-Invariant Speaker Verification
- MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
- LARGE-SCALE SELF-SUPERVISED SPEECH REPRESENTATION LEARNING FOR AUTOMATIC SPEAKER VERIFICATION-my base1
- End-to-end speaker segmentation for overlap-aware resegmentation
- Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Implementation's Hint :
0. https://colab.research.google.com/github/m3hrdadfi/notebooks/blob/main/Fine_Tune_XLSR_Wav2Vec2_on_Persian_ShEMO_ASR_with_%F0%9F%A4%97_Transformers_ipynb.ipynb (very important
- https://huggingface.co/microsoft/wavlm-base-sv
- https://huggingface.co/spaces/microsoft/wavlm-speaker-verification/blob/main/app.py
- https://huggingface.co/speechbrain/spkrec-ecapa-voxceleb
- https://huggingface.co/docs/transformers/model_doc/xlsr_wav2vec2 (Left PART IS IMPORTANT)
- https://huggingface.co/datasets/superb