Opine: Leveraging a Optimization-Inspired Deep Unfolding Method for Multi-Channel Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing(2024)
Abstract
Proximal gradient theory has demonstrated its superiority in the compressive sensing field for complex signal recovery. As an early trial in the speech front-end field, we propose OPINE, an optimization-inspired deep unfolding framework to simulate traditional iterative optimization process for multi-channel speech enhancement. Specifically, we formulate the joint optimization of beamforming weights and target speech using the Bayesian maximum a posteriori (MAP) criterion. By splitting and introducing the proximal gradient descent method, the original problem can be formulated into the alternating target solving of two sub-problems. Furthermore, we propose to formulate the proximal function into a more generalized NN-based modules, enabling the end-to-end learning from massive training data. The experiments are conducted on the spatialized LibriSpeech dataset, and quantitative results show that the proposed method can achieve comparable performance over existing advanced baselines.
MoreTranslated text
Key words
multi-channel speech enhancement,optimization-inspired,proximal gradient decent,deep learning
求助PDF
上传PDF
PPT
Code
Data
View via Publisher
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper
Summary is being generated by the instructions you defined