RAVEN: Real-Time Audio-Visual Speech Enhancement Using Pre-trained Visual Representations
[Repo]
This is the landing page of the paper Real-Time Audio-Visual Speech Enhancement Using Pre-trained Visual Representations, accepted at Interspeech 2025.
Demo Video