A discriminative model approach for accurate duplicate bug report retrieval 论文

2010引用 336

Software Engineering ResearchSoftware Testing and Debugging TechniquesSoftware System Performance and Reliability

企业软件 Software Engineering Research Software Testing and Debugging Techniques Software System Performance and Reliability

关系图谱

作者

摘要

Bug repositories are usually maintained in software projects. Testers or users submit bug reports to identify various issues with systems. Sometimes two or more bug reports corre-spond to the same defect. To address the problem with du-plicate bug reports, a person called a triager needs to man-ually label these bug reports as duplicates, and link them to their ”master ” reports for subsequent maintenance work. However, in practice there are considerable duplicate bug re-ports sent daily; requesting triagers to manually label these bugs could be highly time consuming. To address this issue, recently, several techniques have be proposed using various similarity based metrics to detect candidate duplicate bug reports for manual verification. Au-tomating triaging has been proved challenging as two reports of the same bug could be written in various ways. There is still much room for improvement in terms of accuracy of du-plicate detection process. In this paper, we leverage recent advances on using discriminative models for information re-trieval to detect duplicate bug reports more accurately. We have validated our approach on three large software bug repositories from Firefox, Eclipse, and OpenOffice. We show that our technique could result in 17–31%, 22–26%, and 35– 43 % relative improvement over state-of-the-art techniques in OpenOffice, Firefox, and Eclipse datasets respectively using commonly available natural language information only.

作者查看全部 (5)

Siau‐Cheng Khoo

Jing Jiang

Xiaoyin Wang

David Lo

A discriminative model approach for accurate duplicate bug report retrieval 论文

详细信息

摘要

作者查看全部 (5)

相关技术

相关事件

相关文章