AlphaPeptStats: an open-source Python package for automated, scalable and industrial-strength statistical analysis of mass spectrometry-based proteomics

biorxiv(2023)

引用 0|浏览13
暂无评分
摘要
The widespread application of mass spectrometry (MS)-based proteomics in biomedical research increasingly requires robust, transparent and streamlined solutions to extract statistically reliable insights. Existing, popular tools were generally developed for specific uses in academic environments and did not fully embrace current open-source principles and best practices of software engineering. We have designed and implemented AlphaPeptStats, an inclusive python package with broad functionalities for normalization, imputation, visualization, and statistical analysis of proteomics data. It modularly builds on the established stack of Python scientific libraries, and is accompanied by a rigorous testing framework with 98% test coverage. It imports the output of a range of popular search engines. Data can be filtered and normalized according to user specifications. At its heart, AlphaPeptStats provides a wide range of robust statistical algorithms such as t-tests, ANOVA, PCA, hierarchical clustering and multiple covariate analysis, all in an automatable manner. Data visualization capabilities include heat maps, volcano plots, scatter plots in publication-ready format. AlphaPeptStats advances proteomic research through its robust tools that enable researchers to manually or automatically explore complex datasets to identify interesting patterns and outliers. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
alphapeptstats,open-source,industrial-strength,spectrometry-based
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要