An integrated view of baseline protein expression in human tissues

biorxiv(2022)

引用 6|浏览22
暂无评分
摘要
The availability of proteomics datasets in the public domain, and in the PRIDE database in particular, has increased dramatically in recent years. This unprecedented large-scale availability of data provides an opportunity for combined analyses of datasets to get organism-wide protein abundance data in a consistent manner. We have reanalysed 24 public proteomics datasets from healthy human individuals, to assess baseline protein abundance in 31 organs. We defined tissue as a distinct functional or structural region within an organ. Overall, the aggregated dataset contains 67 healthy tissues, corresponding to 3,119 mass spectrometry runs covering 498 samples, coming from 489 individuals. We compared protein abundances between the different organs and studied the distribution of proteins across organs. We also compared the results with data generated in analogous studies. We also performed gene ontology and pathway enrichment analyses to identify organ-specific enriched biological processes and pathways. As a key point, we have integrated the protein abundance results into the resource Expression Atlas, where it can be accessed and visualised either individually or together with gene expression data coming from transcriptomics datasets. We believe this is a good mechanism to make proteomics data more accessible for life scientists. ### Competing Interest Statement The authors have declared no competing interest. * AD : Alzheimer’s Disease DLPFC : Dorsolateral PreFrontal Cortex FOT : Fraction Of Total HPA : Human Protein Atlas GO : Gene Ontology iBAQ : intensity-based absolute quantification IDF : Investigation Description Format MS : Mass Spectrometry PCA : Principal Component Analysis SDRF : Sample and Data Relationship Format
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要