IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences

BMC BIOINFORMATICS(2021)

引用 4|浏览0
暂无评分
摘要
Background An inverted repeat is a DNA sequence followed downstream by its reverse complement, potentially with a gap in the centre. Inverted repeats are found in both prokaryotic and eukaryotic genomes and they have been linked with countless possible functions. Many international consortia provide a comprehensive description of common genetic variation making alternative sequence representations, such as IUPAC encoding, necessary for leveraging the full potential of such broad variation datasets. Results We present IUPACpal , an exact tool for efficient identification of inverted repeats in IUPAC-encoded DNA sequences allowing also for potential mismatches and gaps in the inverted repeats. Conclusion Within the parameters that were tested, our experimental results show that IUPACpal compares favourably to a similar application packaged with EMBOSS . We show that IUPACpal identifies many previously unidentified inverted repeats when compared with EMBOSS , and that this is also performed with orders of magnitude improved speed.
更多
查看译文
关键词
Inverted repeat,Palindrome,Gaps,Mismatches,Software,IUPAC
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要