sideRETRO: a pipeline for identifying somatic and dimorphic insertions of processed pseudogenes or retrocopies

biorxiv(2020)

引用 0|浏览7
暂无评分
摘要
Retrocopies or processed pseudogenes are gene copies resulting from mRNA retrotransposition. These gene duplicates can be fixed, somatically inserted or dimorphic in the genome. However, knowledge regarding unfixed retrocopies (retroCNVs) is still limited, and the development of computational tools for effectively identifying and genotyping them is an urgent need. Here, we present sideRETRO, a pipeline dedicated not only to detecting retroCNVs in whole-genome or whole-exome sequencing data but also to revealing their insertion sites, zygosity, and genomic context and classifying them as somatic or dimorphic events. We show that sideRETRO can identify novel retroCNVs and genotype them (93.2% accuracy), in addition to identifying dimorphic retroCNVs in whole-genome and whole-exome data. Therefore, sideRETRO fills a gap in the literature and presents an efficient and straightforward algorithm to accelerate the study of retroCNVs.
更多
查看译文
关键词
Processed pseudogenes,Retrocopies,Mobile Elements,Bioinformatics,Genomics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要