Knowledge Management System of Northwest Institute of Plateau Biology, CAS
Sequence-based protein-protein interaction prediction via support vector machine | |
Wang, Yongcui1,2; Wang, Jiguang3; Yang, Zhixia4; Deng, Naiyang1 | |
2010-10-01 | |
发表期刊 | JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY |
ISSN | 1009-6124 |
卷号 | 23期号:5页码:1012-1023 |
文章类型 | Article |
摘要 | This paper develops sequence-based methods for identifying novel protein-protein interactions (PPIs) by means of support vector machines (SVMs). The authors encode proteins ont only in the gene level but also in the amino acid level, and design a procedure to select negative training set for dealing with the training dataset imbalance problem, i.e., the number of interacting protein pairs is scarce relative to large scale non-interacting protein pairs. The proposed methods are validated on PPIs data of Plasmodium falciparum and Escherichia coli, and yields the predictive accuracy of 93.8% and 95.3%, respectively. The functional annotation analysis and database search indicate that our novel predictions are worthy of future experimental validation. The new methods will be useful supplementary tools for the future proteomics studies.; This paper develops sequence-based methods for identifying novel protein-protein interactions (PPIs) by means of support vector machines (SVMs). The authors encode proteins ont only in the gene level but also in the amino acid level, and design a procedure to select negative training set for dealing with the training dataset imbalance problem, i.e., the number of interacting protein pairs is scarce relative to large scale non-interacting protein pairs. The proposed methods are validated on PPIs data of Plasmodium falciparum and Escherichia coli, and yields the predictive accuracy of 93.8% and 95.3%, respectively. The functional annotation analysis and database search indicate that our novel predictions are worthy of future experimental validation. The new methods will be useful supplementary tools for the future proteomics studies. |
关键词 | Imbalance Problem Protein-protein Interactions Sequence-based Support Vector Machine |
WOS标题词 | Science & Technology ; Physical Sciences |
关键词[WOS] | AMINO-ACID-COMPOSITION ; INTERACTION NETWORK ; ESCHERICHIA-COLI ; DATABASE ; COMPLEXES ; RESOURCE ; UPDATE |
收录类别 | SCI ; ISTP |
语种 | 英语 |
WOS研究方向 | Mathematics |
WOS类目 | Mathematics, Interdisciplinary Applications |
WOS记录号 | WOS:000284074000014 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://210.75.249.4/handle/363003/1650 |
专题 | 中国科学院西北高原生物研究所 |
作者单位 | 1.China Agr Univ, Coll Sci, Beijing 100083, Peoples R China 2.Chinese Acad Sci, NW Inst Plateau Biol, Key Lab Adaptat & Evolut Plateau Biota, Xining 810008, Peoples R China 3.Chinese Acad Sci, Acad Math & Syst Sci, Inst Syst Sci, Beijing 100190, Peoples R China 4.Xinjiang Univ, Coll Math & Syst Sci, Urumuchi 830046, Peoples R China |
推荐引用方式 GB/T 7714 | Wang, Yongcui,Wang, Jiguang,Yang, Zhixia,et al. Sequence-based protein-protein interaction prediction via support vector machine[J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY,2010,23(5):1012-1023. |
APA | Wang, Yongcui,Wang, Jiguang,Yang, Zhixia,&Deng, Naiyang.(2010).Sequence-based protein-protein interaction prediction via support vector machine.JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY,23(5),1012-1023. |
MLA | Wang, Yongcui,et al."Sequence-based protein-protein interaction prediction via support vector machine".JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY 23.5(2010):1012-1023. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论