谷歌浏览器插件
订阅小程序
在清言上使用

Imodulonminer and PyModulon: Software for Unsupervised Mining of Gene Expression Compendia

Anand V Sastry, Yuan,Saugat Poudel,Kevin Rychel,Reo Yoo, Cameron R Lamoureux,Gaoyuan Li, Joshua T Burrows,Siddharth Chauhan,Zachary B Haiman, Tahani Al Bulushi,Yara Seif,Bernhard O Palsson, Daniel C Zielinski

PLoS computational biology(2024)

引用 0|浏览1
暂无评分
摘要
Public gene expression databases are a rapidly expanding resource of organism responses to diverse perturbations, presenting both an opportunity and a challenge for bioinformatics workflows to extract actionable knowledge of transcription regulatory network function. Here, we introduce a five-step computational pipeline, called iModulonMiner, to compile, process, curate, analyze, and characterize the totality of RNA-seq data for a given organism or cell type. This workflow is centered around the data-driven computation of co-regulated gene sets using Independent Component Analysis, called iModulons, which have been shown to have broad applications. As a demonstration, we applied this workflow to generate the iModulon structure of Bacillus subtilis using all high-quality, publicly-available RNA-seq data. Using this structure, we predicted regulatory interactions for multiple transcription factors, identified groups of co-expressed genes that are putatively regulated by undiscovered transcription factors, and predicted properties of a recently discovered single-subunit phage RNA polymerase. We also present a Python package, PyModulon, with functions to characterize, visualize, and explore computed iModulons. The pipeline, available at https://github.com/SBRG/iModulonMiner, can be readily applied to diverse organisms to gain a rapid understanding of their transcriptional regulatory network structure and condition-specific activity.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要