EightyDVec: a method for protein sequence similarity analysis using physicochemical properties of amino acids

COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION(2022)

引用 4|浏览1
暂无评分
摘要
Similarity analysis of protein sequences can expose the evolutionary relationship among them. It is required to design effective computational algorithms that can compare the similarities among the colossal amount of sequences. Alignment-based approaches to this problem are often computationally expensive, especially when the number of sequences is large. This research aims to develop an efficient alignment-free tool in the field of protein sequence comparison and phylogenetic study. The proposed method, namely EightyDVec, performs a feature generation process based on the physiochemical properties of amino acids that best describe the evolutionary relationship among the species in a protein family. Using EightyDVec, protein sequences are transformed into 80-dimensional feature vectors and the comparisons between sequences are performed conveniently through these vectors. Four different datasets are used to validate the accuracy of EightyDVec, and the obtained results have shown the great effectiveness of the proposed method in the similarity analysis of protein sequences.
更多
查看译文
关键词
Sequence similarity, amino acids, physiochemical property, Markov chain transition matrix, phylogenetic
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要