金乃高,殷福亮.一种基于粒子滤波的双模态语音提取方法[J].,2008,(4):596-601 |
一种基于粒子滤波的双模态语音提取方法 |
Bimodal speech extraction method based on particle filtering |
|
DOI:10.7511/dllgxb200804025 |
中文关键词: 语音提取 粒子滤波 高阶统计量 最大互信息 |
英文关键词: speech extraction particle filtering higher-order statistics maximum mutual information |
基金项目:国家自然科学基金资助项目(6037208260172073). |
|
摘要点击次数: 1404 |
全文下载次数: 1064 |
中文摘要: |
说话人的唇动信息有助于加强对语音的感知.根据说话人语音的双模态特性,将振动信息引入语音提取问题,提出了一种基于粒子滤波的贝叶斯融合架构的双模态语音提取方法.该方法融合说话人的语音和唇动信息,根据信息论中的最大互信息准则与盲源分离中的高阶统计量准则,将音视频互信息与语音峭度的乘积作为代价函数,利用粒子滤波估计混合矩阵,解决时变瞬时混合情况下的语音提取问题.仿真结果表明,该方法在低信噪比情况下仍然能够实现语音信号的有效提取. |
英文摘要: |
Lip movement information helps language comprehension when the auditory signal is degraded. A bimodal speech extraction method is presented based on the method of audio-visual signal processing. The particle filtering is used to construct a Bayesian fusion framework for bimodal speech extraction problem. By combining maximum mutual information criterion with higher-order statistics criterion of blind signal separation and estimating mixed matrices by particle filtering method, the proposed method can extract the interested instaneous time-varying speech signal by maximizing the product of kurtosis and audio-visual mutual information. Simulation results show that the proposed method improves the performance of the speech extraction system in the low SNR environment. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |