谷歌浏览器插件
订阅小程序
在清言上使用

Generalised Implementation for Fixed-Length Approximate String Matching under Hamming Distance and Applications

IPDPSW '15 Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop(2015)

引用 7|浏览11
暂无评分
摘要
Approximate string matching is the problem of finding all factors of a text t of length n with a distance at most k from a pattern x of length m ale; n. Fixed-length approximate string matching is the problem of finding all factors of a text t of length n with a distance at most k from any factor of length h of a pattern x of length m ale; n, where h ale; m. It is thus a generalisation of approximate string matching and has numerous direct applications in molecular biology -- motif extraction and circular sequence alignment, to name a few. MaxShiftM is a bit-parallel algorithm for fixed-length approximate string matching under the Hamming distance model with time complexity O (m[h/w]n), where w is the size of the computer word (Crochemore et al., 2010). An implementation of this algorithm is straightforward as long as the maximal length of alignments is less than or equal to w. In this article, our contribution is twofold: first, we propose a generalised implementation of MaxShiftM, that is, for any given h ale; m, under the Hamming distance model, and second, we show how our implementation can be used to improve the accuracy and efficiency of multiple circular sequence alignment in terms of the inferred likelihood-based phylogenies.
更多
查看译文
关键词
bioinformatics,computational complexity,parallel algorithms,string matching,Hamming distance model,MaxShiftM algorithm,O(m⌈h/w⌉n) time complexity,accuracy improvement,bit-parallel algorithm,computer word size,efficiency improvement,fixed-length approximate string matching,inferred likelihood-based phylogenies,maximal alignment length,multiple circular sequence alignment,pattern length,text length,algorithms on strings,approximate string matching,dynamic programming
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要