NWIPB OpenIR
Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction
Wang, Yongcui ; Ren, Xianwen ; Zhang, Chunhua ; Deng, Naiyang ; Zhang, Xiangsun
2012-12-21
发表期刊JOURNAL OF THEORETICAL BIOLOGY ; Wang, YC; Ren, XW; Zhang, CH; Deng, NY; Zhang, XS.Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction,JOURNAL OF THEORETICAL BIOLOGY,2012,315():64-70
摘要The past decades witnessed extensive efforts to study the relationship among proteins. Particularly, sequence-based protein-protein interactions (PPIs) prediction is fundamentally important in speeding up the process of mapping interactomes of organisms. High-throughput experimental methodologies make many model organism's PPIs known, which allows us to apply machine learning methods to learn understandable rules from the available PPIs. Under the machine learning framework, the composition vectors are usually applied to encode proteins as real-value vectors. However, the composition vector value might be highly correlated to the distribution of amino acids, i.e., amino acids which are frequently observed in nature tend to have a large value of composition vectors. Thus formulation to estimate the noise induced by the background distribution of amino acids may be needed during representations. Here, we introduce two kinds of denoising composition vectors, which were successfully used in construction of phylogenetic trees, to eliminate the noise. When validating these two denoising composition vectors on Escherichia coli (E. coli), Saccharomyces cerevisiae (S. cerevisiae) and human PPIs datasets, surprisingly, the predictive performance is not improved, and even worse than non-denoised prediction. These results suggest that the noise in phylogenetic tree construction may be valuable information in PPIs prediction. (C) 2012 Elsevier Ltd. All rights reserved.; The past decades witnessed extensive efforts to study the relationship among proteins. Particularly, sequence-based protein-protein interactions (PPIs) prediction is fundamentally important in speeding up the process of mapping interactomes of organisms. High-throughput experimental methodologies make many model organism's PPIs known, which allows us to apply machine learning methods to learn understandable rules from the available PPIs. Under the machine learning framework, the composition vectors are usually applied to encode proteins as real-value vectors. However, the composition vector value might be highly correlated to the distribution of amino acids, i.e., amino acids which are frequently observed in nature tend to have a large value of composition vectors. Thus formulation to estimate the noise induced by the background distribution of amino acids may be needed during representations. Here, we introduce two kinds of denoising composition vectors, which were successfully used in construction of phylogenetic trees, to eliminate the noise. When validating these two denoising composition vectors on Escherichia coli (E. coli), Saccharomyces cerevisiae (S. cerevisiae) and human PPIs datasets, surprisingly, the predictive performance is not improved, and even worse than non-denoised prediction. These results suggest that the noise in phylogenetic tree construction may be valuable information in PPIs prediction. (C) 2012 Elsevier Ltd. All rights reserved.
文献类型期刊论文
条目标识符http://210.75.249.4/handle/363003/57170
专题中国科学院西北高原生物研究所
推荐引用方式
GB/T 7714
Wang, Yongcui,Ren, Xianwen,Zhang, Chunhua,et al. Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction[J]. JOURNAL OF THEORETICAL BIOLOGY, Wang, YC; Ren, XW; Zhang, CH; Deng, NY; Zhang, XS.Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction,JOURNAL OF THEORETICAL BIOLOGY,2012,315():64-70,2012.
APA Wang, Yongcui,Ren, Xianwen,Zhang, Chunhua,Deng, Naiyang,&Zhang, Xiangsun.(2012).Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction.JOURNAL OF THEORETICAL BIOLOGY.
MLA Wang, Yongcui,et al."Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction".JOURNAL OF THEORETICAL BIOLOGY (2012).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang, Yongcui]的文章
[Ren, Xianwen]的文章
[Zhang, Chunhua]的文章
百度学术
百度学术中相似的文章
[Wang, Yongcui]的文章
[Ren, Xianwen]的文章
[Zhang, Chunhua]的文章
必应学术
必应学术中相似的文章
[Wang, Yongcui]的文章
[Ren, Xianwen]的文章
[Zhang, Chunhua]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。