Write a Blog >>
ICSE 2020
Wed 24 June - Thu 16 July 2020
Thu 9 Jul 2020 07:36 - 07:44 at Baekje - I13-Testing and Debugging 1 Chair(s): Shin Hwei Tan

This is an extended abstract and presentation proposal for the manuscript ID EMSE-D-18-00360R1 accepted by the Empirical Software Engineering journal. The journal paper is not yet online. The accepted manuscript is uploaded along with this proposal.

Automatically predicting the defect type of a software defect from its description can significantly speed up and improve the software defect management process. The standard supervised learning based approach for this task (Thung et al., WCRE2012) needs 90% of labeled data for training the classifier. Creating such data is an expensive and effort-intensive task requiring domain-specific expertise.

In this paper, we propose to circumvent this problem by carrying out concept-based classification (CBC) of software defect reports with help of the Explicit Semantic Analysis (ESA) framework. We first create the concept-based representations of a software defect report and the defect types in the software defect classification scheme by projecting their textual descriptions into a concept-space spanned by the Wikipedia articles. Then, we compute the semantic similarity between these concept-based representations and assign the software defect type that has the highest similarity with the defect report. The proposed CBC approach achieves accuracy (F1 score = 63.16%) similar to the state-of-the-art semi-supervised and active learning approach (Thung et al., , ICPC 2015) for this task without requiring labeled training data. The state-of-the-art approach requires labels for 15% of input defects and achieves accuracy (F1 score) of 62.3%.

Unlike the state-of-the-art approach, our method does not need access to the source-code used to fix the defect. We use just the textual description of the defect reports and the keywords describing the defect types in the defect classification scheme. Note that learning a classifier without labeled training data is known as zero-shot learning and it is a significantly harder task than learning a classifier using labeled data. The proposed concept-based classification of software defect types is the first instance of zero-shot learning philosophy in the software defect analytics domain.

Thu 9 Jul

Displayed time zone: (UTC) Coordinated Universal Time change

07:00 - 08:00
I13-Testing and Debugging 1Demonstrations / Technical Papers / Software Engineering in Practice / Journal First at Baekje
Chair(s): Shin Hwei Tan Southern University of Science and Technology
07:00
12m
Talk
Learning-to-Rank vs Ranking-to-Learn: Strategies for Regression Testing in Continuous IntegrationTechnical
Technical Papers
Antonia Bertolino CNR-ISTI, Antonio Guerriero Università di Napoli Federico II, Breno Miranda Federal University of Pernambuco, Roberto Pietrantuono Università di Napoli Federico II, Stefano Russo Università di Napoli Federico II
07:12
12m
Talk
Debugging InputsArtifact ReusableTechnicalArtifact Available
Technical Papers
Lukas Kirschner Saarland University, Ezekiel Soremekun CISPA Helmholtz Center for Information Security, Andreas Zeller CISPA Helmholtz Center for Information Security
Link to publication DOI Pre-print
07:24
12m
Talk
Property-based Testing for LG Home Appliances using Accelerated Software-in-the-Loop SimulationIEEE Software Best Software Engineering in Practice AwardSEIP
Software Engineering in Practice
Mingyu Park LG Electronics, Hoon Jang Hyundai Motor Company, Taejoon Byun University of Minnesota, Yunja Choi Kyungpook National University
Pre-print
07:36
8m
Talk
Predicting Software Defect Type using Concept-based ClassificationJ1
Journal First
Sangameshwar Patil Dept. of CSE, IIT Madras and TRDDC, TCS, Balaraman Ravindran IIT Madras
07:44
8m
Talk
The Art, Science, and Engineering of Fuzzing: A SurveyJ1
Journal First
Valentin Manès CSRC, KAIST, HyungSeok Han KAIST, Choongwoo Han NAVER Corporation, Sang Kil Cha KAIST, Manuel Egele Boston University, USA, Edward Schwartz Carnegie Mellon University, Maverick Woo Carnegie Mellon University
07:52
3m
Talk
GeekyNote: A Technical Documentation Tool with Coverage, Backtracking, Traces, and CouplingsDemo
Demonstrations
Yung-Pin Cheng National Central University, Wei-Nien Hsiung National Central University, Yu-Shan Wu IsCoollab Co. Ltd, Li-Hsuan Chen IsCoollab Co. Ltd