Popular repositories Loading
-
Lipreading_using_Temporal_Convolutional_Networks
Lipreading_using_Temporal_Convolutional_Networks PublicForked from mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
Python
-
awesome-asr-contextualization
awesome-asr-contextualization PublicForked from stevenhillis/awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
-
ASR_Uighur_Semi-supervised
ASR_Uighur_Semi-supervised PublicForked from lirui-cyber/ASR_Uighur_Semi-supervised
Shell
-
LSLM-Listening-while-Speaking-Language-Model
LSLM-Listening-while-Speaking-Language-Model PublicForked from sanowl/LSLM-Listening-while-Speaking-Language-Model
LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…
Python
-
streaming-llm
streaming-llm PublicForked from mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Python
-
mini-omni
mini-omni PublicForked from gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Python
If the problem persists, check the GitHub status page or contact support.