欢迎访问《大连理工大学学报—

文章摘要

金乃高,殷福亮.一种基于粒子滤波的双模态语音提取方法[J].,2008,(4):596-601

一种基于粒子滤波的双模态语音提取方法

Bimodal speech extraction method based on particle filtering

DOI：10.7511/dllgxb200804025

英文关键词: speech extraction particle filtering higher-order statistics maximum mutual information

基金项目:国家自然科学基金资助项目(6037208260172073).

作者	单位
金乃高,殷福亮

摘要点击次数: 1404

全文下载次数: 1064

中文摘要:

说话人的唇动信息有助于加强对语音的感知．根据说话人语音的双模态特性，将振动信息引入语音提取问题，提出了一种基于粒子滤波的贝叶斯融合架构的双模态语音提取方法．该方法融合说话人的语音和唇动信息，根据信息论中的最大互信息准则与盲源分离中的高阶统计量准则，将音视频互信息与语音峭度的乘积作为代价函数，利用粒子滤波估计混合矩阵，解决时变瞬时混合情况下的语音提取问题．仿真结果表明，该方法在低信噪比情况下仍然能够实现语音信号的有效提取．

英文摘要:

Lip movement information helps language comprehension when the auditory signal is degraded. A bimodal speech extraction method is presented based on the method of audio-visual signal processing. The particle filtering is used to construct a Bayesian fusion framework for bimodal speech extraction problem. By combining maximum mutual information criterion with higher-order statistics criterion of blind signal separation and estimating mixed matrices by particle filtering method, the proposed method can extract the interested instaneous time-varying speech signal by maximizing the product of kurtosis and audio-visual mutual information. Simulation results show that the proposed method improves the performance of the speech extraction system in the low SNR environment.

查看全文查看/发表评论下载PDF阅读器

关闭