Title An Approach to Detecting Duplicate Bug Reports using Natural Language and Execution Information
Authors Wang, Xiaoyin
Zhang, Lu
Xie, Tao
Anvik, John
Sun, Jiasu
Affiliation Peking Univ, Minist Educ, Key Lab High Confidence Software Technol, Inst Software,EECS, Beijing 100871, Peoples R China.
Keywords Duplicate bug report
execution information
information retrieval
SOFTWARE
Issue Date 2008
Citation ICSE'08 PROCEEDINGS OF THE THIRTIETH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING..
Abstract An open source project typically maintains an open bug repository so that bug reports from all over the world can be gathered. When a new bug report is submitted to the repository, a person, called a triager, examines whether it is a duplicate of an existing bug report. If it is, the triager marks it as DUPLICATE and the bug report is removed from consideration for further work. In the literature, there are approaches exploiting only natural language information to detect duplicate bug reports. In this paper we present a new approach that further involves execution information. In our approach, when a new bug report arrives, its natural language information and execution information are compared with those of the existing bug reports. Then, a small number of existing bug reports are suggested to the triager as the most similar bug reports to the new bug report. Finally, the triager examines the suggested bug reports to determine whether the new bug report duplicates an existing bug report. We calibrated our approach on a subset of the Eclipse bug repository and evaluated our approach on a subset of the Firefox bug repository. The experimental results show that our approach can detect 67%-93% of duplicate bug reports in the Firefox bug repository, compared to 43%-72% using natural language information alone.
URI http://hdl.handle.net/20.500.11897/406648
Indexed CPCI-S(ISTP)
Appears in Collections: 信息科学技术学院
高可信软件技术教育部重点实验室

Files in This Work
There are no files associated with this item.

Web of Science®


259

Checked on Last Week

百度学术™


0

Checked on Current Time




License: See PKU IR operational policies.