Accurate label-free quantification by directLFQ to compare unlimited numbers of proteomes

biorxiv(2023)

引用 2|浏览29
暂无评分
摘要
Recent advances in mass spectrometry (MS)-based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Whereas peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratiobased approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 minutes and 100,000 proteomes in less than two hours - thousand-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition (DDA) and data-independent acquisition (DIA). Additionally, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is available as an open-source Python package and as a GUI with a one-click installer and can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
proteomics,algorithms,quantification,label-free,protein intensity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要