Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression  @MicrosoftResearch
Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression  @MicrosoftResearch
Microsoft Research | Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression @MicrosoftResearch | Uploaded November 2023 | Updated October 2024, 1 week ago.
Speakers: Khandokar Md. Nayem
Host: Sebastian Braun

Speech enhancement approaches generally focus on removing additive noise and reverberation that adversely affects the overall speech quality and intelligibility. Another group of signal degradations like clipping, bandwidth limitations, and codec degradation can occur due to poor recording hardware, network transmission, and other pre-processing. These degradations largely impact on intelligibility and speech quality. In this work, we deploy a convolutional recurrent network to remove these speech degradations in conjunction with the noise suppression task and propose cascade and end-to-end approaches. We compare both complex mask and direct spectrum estimation approaches for this task using a small real-time capable DNN. Overall, we propose a cascaded processing approach, addressing the distortion types differently, and enabling a task-tailored modular processing.

Learn more: microsoft.com/en-us/research/video/research-intern-talk-unified-speech-enhancement-approach-for-speech-degradations-noise-suppression
Research intern talk: Unified speech enhancement approach for speech degradation & noise suppressionResearch talk: Low-latency ​Real-time Insights ​from SpaceGalea: The Bridge Between Mixed Reality and NeurotechnologyHuman-Centered AI: Ensuring Human Control While Increasing AutomationCombining Machine Learning and Bayesian networks for Decision Support in Arrythmia DiagnosisScalable and Efficient AI: From Supercomputers to SmartphonesAugmenting Human Cognition and Decision Making with AIMachine Learning for Combinatorial Optimization: Some Empirical StudiesThe Metacognitive Demands and Opportunities of Generative AIPodcast | Collaborators: Teachable AI with Cecily Morrison & Karolina Pakėnaitė [ASL interpretation]GigaPath: Foundation Model for Digital PathologyTowards Human Value Based Natural Language Processing (NLP)

Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression @MicrosoftResearch

SHARE TO X SHARE TO REDDIT SHARE TO FACEBOOK WALLPAPER