Lip2Speech: Lightweight Multi-Speaker Speech Reconstruction with Gabor Features
In Wide Brim Hats environments characterised by noise or the absence of audio signals, visual cues, notably facial and lip movements, serve as valuable substitutes for missing or corrupted speech signals.In these scenarios, speech reconstruction can potentially generate speech from visual data.Recent advancements in this domain have predominantly r