马军伟,史舵,顾宏,张杰.PCA方法在蛋白质亚细胞定位中应用[J].,2012,(3):426-430 |
PCA方法在蛋白质亚细胞定位中应用 |
Application of PCA method to predicting protein subcellular location |
|
DOI:10.7511/dllgxb201203019 |
中文关键词: 蛋白质亚细胞定位 主成分分析 伪氨基酸组成 k 近邻分类器 BP神经网络 |
英文关键词: protein subcellular location principal component analysis pseudo-amino acid composition k -NN classifier BP neural network |
基金项目: |
|
摘要点击次数: 1940 |
全文下载次数: 1203 |
中文摘要: |
蛋白质的亚细胞定位与其生物功能密切相关,蛋白质数据库急剧膨胀,迫切需要设计出功能强大的高吞吐量的算法来预测蛋白质的亚细胞位置.许多预测工具都是基于伪氨基酸组成构建而成,应用一种数据分析方法——主成分分析 (PCA)法,确定能反映序列次序效应的最优 λ 值.首先让 λ 取最大以包含尽可能多的序列次序信息,然后利用主成分分析法提取关键主特征.实验结果表明此方法能解决确定最优 λ 值困难的问题,且性能优于已有的预测工具. |
英文摘要: |
The location of a protein subcellular is closely correlated with its biological function. With the rapid expansion of protein databases, it is very important to design a powerful high-throughput algorithm for predicting protein subcellular location. Many prediction tools have been designed based on the pseudo-amino acid composition, and a data analysis method, principal component analysis (PCA) method, is applied to determining in advance the optimal value of λ which reflects sequence order effects. Firstly, the parameter λ is set to the maximum to contain more sequence order information; then, PCA is employed to extract the essential features. Experimental results show that the proposed method solves the above problem, and its performance is better than those of other predictors. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |
|
|
|