This document discusses a study on duplicate defect detection using natural language processing (NLP) techniques applied to defect reports, highlighting the challenges of identifying duplicates in structured natural language. A prototype tool was evaluated at Sony Ericsson, finding it could identify about 2/3 of potential duplicates, leading to significant efficiency gains. A replication study investigated similar methodologies on Android OS defect reports, confirming some results while revealing the need for further empirical evaluations.
Related topics: