Practical duplicate bug reports detection in a large web-based development community

  • Liang Feng*
  • , Leyi Song
  • , Chaofeng Sha
  • , Xueqing Gong
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

26 Scopus citations

Abstract

Most of large web-based development communities require a bug tracking system to keep track of various bug reports. However, duplicate bug reports tend to result in waste of resources, and may cause potential conflicts. There have been two types of works focusing on this problem: relevant bug report retrieval [8][11][10][13] and duplicate bug report identification [5][12]. The former methods can achieve high accuracy (82%) in the top 10 results in some dataset, but they do not really reduce the workload of developers. The latter methods still need further improvement on the performance. In this paper, we propose a practical duplicate bug reports detection method, which aims to help project team to reduce their workload by combining existing two categories of methods. We also propose some new features extracted from comments, user profiles and query feedback, which are useful for improving the detection performance. Experiments on real dataset show that our method improves the accuracy rate by 23% compared to state-of-the-art work in duplicate bug report identification, and improves the recall rate by up to 8% in relevant bug report retrieval.

Original languageEnglish
Title of host publicationWeb Technologies and Applications - 15th Asia-Pacific Web Conference, APWeb 2013, Proceedings
Pages709-720
Number of pages12
DOIs
StatePublished - 2013
Event15th Asia-Pacific Web Conference on Web Technologies and Applications, APWeb 2013 - Sydney, NSW, Australia
Duration: 4 Apr 20136 Apr 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7808 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th Asia-Pacific Web Conference on Web Technologies and Applications, APWeb 2013
Country/TerritoryAustralia
CitySydney, NSW
Period4/04/136/04/13

Keywords

  • Bug Report
  • Classification
  • Duplicate Detection

Fingerprint

Dive into the research topics of 'Practical duplicate bug reports detection in a large web-based development community'. Together they form a unique fingerprint.

Cite this