a list of speech frontend, such as speech enhancement\speech seperation\sound source localization
-
Tasnet: time-domain audio separation network for real-time, single-channel speech separation [Code]
-
Conv-TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation [Code]
-
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation [Code1] [Code2]
-
DANet:Deep Attractor Network (DANet) for single-channel speech separation [Code]
-
TAC end-to-end microphone permutation and number invariant multi-channel speech separation [Code]
-
uPIT-for-speech-separation:Speech separation with utterance-level PIT [Code]
-
LSTM_PIT_Speech_Separation [Code]
-
asteroid:The PyTorch-based audio source separation toolkit for researchers[PDF][Code]
-
sound separation(Google) [Code]
-
sound separation: Deep learning based speech source separation using Pytorch [Code]
-
ONSSEN: An Open-source Speech Separation and Enhancement Library [Code]
-
music-source-separation [Code]
-
Singing-Voice-Separation [Code]
-
Comparison-of-Blind-Source-Separation-techniques[Code]
-
FastICA[Code]
-
A localisation- and precedence-based binaural separation algorithm[Download]
- IRM-based-Speech-Enhancement-using-LSTM [Code]
- nn-irm [Code]
- Speech Enhancement Using a Two-Stage Network for an Efficient Boosting Strategy [Code][PDF]
- SETK: Speech Enhancement Tools integrated with Kaldi [Code]
- sednn:deep_learning_for_speech_enhancement_keras_python [Code]
- Speech_Enhancement_DNN_NMF [Code]
- Deep-Learning-for-Speech-Enhancement [Code]
- gcc-nmf:Real-time GCC-NMF Blind Speech Separation and Enhancement [Code]
- TensorFlow-speech-enhancement-Chinese [Code]
- DNN-Speech-enhancement-demo-tool [Code]
- CNN-for-single-channel-speech-enhancement [Code]
- rnn-speech-denoising [Code]
- DNN-SpeechEnhancement [Code]
- segan_pytorch [Code]
- PHASEN[Code]
- TCNSE [Code]
- pb_chime5:Speech enhancement system for the CHiME-5 dinner party scenario [Code]
- A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement [Code] [PDF]
- speech enhancement toolkit[Code]
- dereverberation Single-Channel Dereverberation in Matlab [Code]
- speech_dereverbaration_using_lp_residual Single Channel Speech Dereverbaration using LP Residual [Code]
- dereverberate [Code]
- dereverberation-and-denoising:Supervised Speech Dereverberation in Noisy Environments using Exemplar-based Sparse Representations [Code]
- DNN_WPE [Code]
- nara_wpe:Different implementations of "Weighted Prediction Error" for speech dereverberation [Code]
- MASP:Microphone Array Speech Processing [Code]
- BeamformingSpeechEnhancer [Code]
- DNN_Localization_And_Separation [Code]
- nn-gev:Neural network supported GEV beamformerCHiME3 [Code]
- chime4-nn-mask:Implementation of NN based mask estimator in pytorch(reuse some programming from nn-gev)[Code]
- beamformit_matlab:A MATLAB implementation of CHiME4 baseline Beamformit [Code]
- pb_chime5:Speech enhancement system for the CHiME-5 dinner party scenario [Code]
- beamformit:麦克风阵列算法 [Code]
- Beamforming-for-speech-enhancement [Code]
- binauralLocalization [Code]
- robotaudition_examples:Some Robot Audition simplified examples (sound source localization and separation), coded in Octave/Matlab [Code]
- WSCM-MUSIC [Code]
- doa-tools [Code]
- Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks [Code] [PDF]
- messl:Model-based EM Source Separation and Localization [Code]
- messlJsalt15:MESSL wrappers etc for JSALT 2015, including CHiME3 [Code]
- fast_sound_source_localization_using_TLSSC:Fast Sound Source Localization Using Two-Level Search Space Clustering [Code]
- Binaural-Auditory-Localization-System [Code]
- Binaural_Localization:ITD-based localization of sound sources in complex acoustic environments [Code]
- Dual_Channel_Beamformer_and_Postfilter [Code]
- 麦克风声源定位 [Code]
- RTF-based-LCMV-GSC [Code]
- DOA [Code]
- sed_eval - Evaluation toolbox for Sound Event Detection [Code]
- Benchmark for sound event localization task of DCASE 2019 challenge [Code]
- sed-crnn DCASE 2017 real-life sound event detection winning method. [Code]
- seld-net [Code]
- AKtools:the open software toolbox for signal acquisition, processing, and inspection in acoustics [SVN Code](username: aktools; password: ak)
- separation_data_preparation[Code]
- MatlabToolbox [Code]
- athena-signal [[Code]](https://github.com/athena-team/athena-signal)
- python_speech_features [Code]
- speechFeatures:语音处理,声源定位中的一些基本特征 [Code]
- sap-voicebox [Code]
- Calculate-SNR-SDR [Code]
- RIR-Generator [Code]
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration [Code]
- ROOMSIM:binaural image source simulation [Code]
- binaural-image-source-model [Code]
- PESQ [Code]
- CCF语音对话与听觉专业组语音对话与听觉前沿研讨会[Link]