A defect estimator for source code: linking defect reports with programming constructs usage metrics

KAPUR, R.; Sodhi, B.

DSpace Home
→
Research Publications
→
Year-2020
→
View Item

dc.contributor.author	KAPUR, R.
dc.contributor.author	Sodhi, B.
dc.date.accessioned	2021-07-04T09:14:04Z
dc.date.available	2021-07-04T09:14:04Z
dc.date.issued	2021-07-04
dc.identifier.uri	http://localhost:8080/xmlui/handle/123456789/1996
dc.description.abstract	An important issue faced during software development is to identify defects and the properties of those defects, if found, in a given source file. Determining defectiveness of source code assumes significance due to its implications on software development and maintenance cost. We present a novel system to estimate the presence of defects in source code and detect attributes of the possible defects, such as the severity of defects. The salient elements of our system are: (i) a dataset of newly introduced source code metrics, called PROgramming CONstruct (PROCON) metrics, and (ii) a novel MachineLearning (ML)-based system, called Defect Estimator for Source Code (DESCo), that makes use of PROCON dataset for predicting defectiveness in a given scenario. The dataset was created by processing 30,400+ source files written in four popular programming languages, viz., C, C++, Java, and Python. The results of our experiments show that DESCo system outperforms one of the state-of-the-art methods with an improvement of 44.9%. To verify the correctness of our system, we compared the performance of 12 different ML algorithms with 50+ different combinations of their key parameters. Our system achieves the best results with SVM technique with a mean accuracy measure of 80.8%.	en_US
dc.language.iso	en_US	en_US
dc.subject	Maintaining software	en_US
dc.subject	source code mining	en_US
dc.subject	software defect prediction	en_US
dc.subject	software metrics	en_US
dc.subject	software faults and failures	en_US
dc.subject	automated software engineering	en_US
dc.subject	AI in software engineering	en_US
dc.title	A defect estimator for source code: linking defect reports with programming constructs usage metrics	en_US
dc.type	Article	en_US