RAVEN

RAVEN: Real-Time Audio-Visual Speech Enhancement Using Pre-trained Visual Representations

[Repo]

This is the landing page of the paper Real-Time Audio-Visual Speech Enhancement Using Pre-trained Visual Representations, accepted at Interspeech 2025.

Demo Video