A duplicate bug report detection model with enhanced text relevance semantics and multi-feature extraction
Article
Figures
Metrics
Preview PDF
Reference
Related
Cited by
Materials
Abstract:
A duplicate bug report detection model with enhanced text relevance semantics and multi-feature extraction was proposed to address the issues of semantic long-distance dependence and the singleness of bug report features in the current research on duplicate bug report detection. The model introduced the self-attention mechanism to capture the semantic relevance within the bug report text sequence. This mechanism calculates the contextual semantic vector dynamically for semantic analysis and resolves the problem of long-distance dependence. Additionally, the model employed the latent Dirichlet allocation algorithm to capture the topic characteristics of the bug report text. Furthermore, a feature extraction network was constructed to calculate category difference features, providing category information for the bug report simultaneously. Finally, comprehensive detection was performed based on three types of feature vectors. The experimental results demonstrate that the model achieves improved detection performance.
Keywords:
Project Supported:
Supported by National Natural Science Foundation of China(61502401, 12050410248), Sichuan Science and Technology Program(2021YFH0120), and Fundamental Research Funds for the Central Universities, Southwest Minzu University (2020YYXS59).