文章摘要
马彪,周瑜,贺建军.面向大规模类不平衡数据的变分高斯过程分类算法[J].,2016,56(3):279-284
面向大规模类不平衡数据的变分高斯过程分类算法
Variational Gaussian process classification algorithm for large-scale class-imbalanced data
  
DOI:10.7511/dllgxb201603009
中文关键词: 类不平衡问题  高斯过程  变分推理  大规模数据分类
英文关键词: class-imbalanced problem  Gaussian process  variational inference  large-scale data classification
基金项目:国家自然科学基金资助项目(6150305861374170);辽宁省自然科学基金资助项目(20150200842015020099);辽宁省教育厅科学技术研究项目(L2014540L2015127);中央高校基本科研业务费专项资金资助项目(DC201501055DC201501060201).
作者单位
马彪,周瑜,贺建军  
摘要点击次数: 990
全文下载次数: 1097
中文摘要:
      变分高斯过程分类器是最近提出的一种较有效的面向大规模数据的快速核分类算法,其在处理类不平衡问题时,对少数类样本的预测精度通常会较低.针对此问题,通过在似然函数中引入指数权重系数和构造包含相同数目正负类样本的诱导子集解决原始算法的分类面向少数类偏移的问题,建立了一种可以有效处理大规模类不平衡问题的改进变分高斯过程分类算法.在10个大规模UCI数据集上的实验结果表明,改进算法在类不平衡问题上的精度较原始算法得到大幅提高.
英文摘要:
      Variational Gaussian process classifier is an effective fast kernel algorithm proposed recently for large-scale data classification. However, for the class-imbalanced problem, it usually achieves lower accuracy on the samples of minority class. By assigning different index weight coefficients to the likelihood functions and constructing an inducing set containing equal numbers of positive and negative samples to avoid hyperplane biased toward the side of minority class, an improved variational Gaussian process classification algorithm is proposed, which can deal with the large-scale class-imbalanced problem effectively. The experimental results of ten large-scale UCI datasets show that the proposed algorithm can achieve much higher accuracy than the original one for class-imbalanced problem.
查看全文   查看/发表评论  下载PDF阅读器
关闭