csf123123

csf csf123123

Popular repositories Loading

Lipreading_using_Temporal_Convolutional_Networks Lipreading_using_Temporal_Convolutional_Networks Public

Forked from mpc001/Lipreading_using_Temporal_Convolutional_Networks

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python
awesome-asr-contextualization awesome-asr-contextualization Public

Forked from stevenhillis/awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs
ASR_Uighur_Semi-supervised ASR_Uighur_Semi-supervised Public

Forked from lirui-cyber/ASR_Uighur_Semi-supervised

Shell
LSLM-Listening-while-Speaking-Language-Model LSLM-Listening-while-Speaking-Language-Model Public

Forked from sanowl/LSLM-Listening-while-Speaking-Language-Model

LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…

Python
streaming-llm streaming-llm Public

Forked from mit-han-lab/streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python
mini-omni mini-omni Public

Forked from gpt-omni/mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python