Sentence embedding and fine-tuning to automatically identify duplicate bugs

FRONTIERS IN COMPUTER SCIENCE(2023)

引用 0|浏览8
暂无评分
摘要
Industrial software maintenance is critical but burdensome. Activities such as detecting duplicate bug reports are often performed manually. Herein an automated duplicate bug report detection system improves maintenance efficiency using vectorization of the contents and deep learning-based sentence embedding to calculate the similarity of the whole report from vectors of individual elements. Specifically, sentence embedding is realized using Sentence-BERT fine tuning. Additionally, its performance is experimentally compared to baseline methods to validate the proposed system. The proposed system detects duplicate bug reports more effectively than existing methods.
更多
查看译文
关键词
bug reports,duplicate detection,BERT,sentence embedding,natural language processing,information retrieval
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要