文章摘要
金乃高,殷福亮.一种基于粒子滤波的双模态语音提取方法[J].,2008,(4):596-601
一种基于粒子滤波的双模态语音提取方法
Bimodal speech extraction method based on particle filtering
  
DOI:10.7511/dllgxb200804025
中文关键词: 语音提取  粒子滤波  高阶统计量  最大互信息
英文关键词: speech extraction  particle filtering  higher-order statistics  maximum mutual information
基金项目:国家自然科学基金资助项目(6037208260172073).
作者单位
金乃高,殷福亮  
摘要点击次数: 1404
全文下载次数: 1064
中文摘要:
      说话人的唇动信息有助于加强对语音的感知.根据说话人语音的双模态特性,将振动信息引入语音提取问题,提出了一种基于粒子滤波的贝叶斯融合架构的双模态语音提取方法.该方法融合说话人的语音和唇动信息,根据信息论中的最大互信息准则与盲源分离中的高阶统计量准则,将音视频互信息与语音峭度的乘积作为代价函数,利用粒子滤波估计混合矩阵,解决时变瞬时混合情况下的语音提取问题.仿真结果表明,该方法在低信噪比情况下仍然能够实现语音信号的有效提取.
英文摘要:
      Lip movement information helps language comprehension when the auditory signal is degraded. A bimodal speech extraction method is presented based on the method of audio-visual signal processing. The particle filtering is used to construct a Bayesian fusion framework for bimodal speech extraction problem. By combining maximum mutual information criterion with higher-order statistics criterion of blind signal separation and estimating mixed matrices by particle filtering method, the proposed method can extract the interested instaneous time-varying speech signal by maximizing the product of kurtosis and audio-visual mutual information. Simulation results show that the proposed method improves the performance of the speech extraction system in the low SNR environment.
查看全文   查看/发表评论  下载PDF阅读器
关闭