Chrome Extension
WeChat Mini Program
Use on ChatGLM

CPA-Perturb-seq: Multiplexed Single-Cell Characterization of Alternative Polyadenylation Regulators

bioRxiv(2023)

New York Genome Center | Department of Genetics

Cited 3|Views44
Abstract
Most mammalian genes have multiple polyA sites, representing a substantial source of transcript diversity that is governed by the cleavage and polyadenylation (CPA) regulatory machinery. To better understand how these proteins govern polyA site choice we introduce CPA-Perturb-seq, a multiplexed perturbation screen dataset of 42 known CPA regulators with a 3' scRNA-seq readout that enables transcriptome-wide inference of polyA site usage. We develop a statistical framework to specifically identify perturbation-dependent changes in intronic and tandem polyadenylation, and discover modules of co-regulated polyA sites exhibiting distinct functional properties. By training a multi-task deep neural network (APARENT-Perturb) on our dataset, we delineate a cis-regulatory code that predicts responsiveness to perturbation and reveals interactions between distinct regulatory complexes. Finally, we leverage our framework to re-analyze published scRNA-seq datasets, identifying new regulators that affect the relative abundance of alternatively polyadenylated transcripts, and characterizing extensive cellular heterogeneity in 3' UTR length amongst antibody-producing cells. Our work highlights the potential for multiplexed single-cell perturbation screens to further our understanding of post-transcriptional regulation in vitro and in vivo.
More
Translated text
Key words
Gene Expression Regulation,Transcriptional Regulation
PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Try using models to generate summary,it takes about 60s
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了CPA-Perturb-seq方法,实现了对替代性多腺苷酸化(APA)调控因子在单细胞水平上的大规模扰动分析,并揭示了APA调控的分子机制。

方法】:作者开发了一种统计框架,用于识别内含子和串联多腺苷酸化位点在扰动依赖性变化中的特定模式,并利用多任务深度神经网络(APARENT-Perturb)预测对扰动的响应和揭示不同调控复合物之间的相互作用。

实验】:通过在42个已知CPA调控因子上进行扰动实验,并结合3'端单细胞RNA测序(3' scRNA-seq)数据,作者分析了扰动对转录组范围内多腺苷酸化位点使用的影响,并使用公开的scRNA-seq数据集进行了再分析,发现了影响APA转录本相对丰度的新调控因子,并在抗体产生细胞中揭示了3'非翻译区(3' UTR)长度的广泛细胞异质性。