DSpace Repository

How repeated data points affect bug prediction performance: A case study

Show simple item record

dc.creator Ozturk, Muhammed Maruf
dc.creator ZENGIN, Ahmet
dc.date 2016-11-30T21:00:00Z
dc.date.accessioned 2020-10-06T09:47:48Z
dc.date.available 2020-10-06T09:47:48Z
dc.identifier 3b663f24-1ac2-4a8d-b0e1-e96fc42b4630
dc.identifier 10.1016/j.asoc.2016.08.002
dc.identifier https://avesis.sdu.edu.tr/publication/details/3b663f24-1ac2-4a8d-b0e1-e96fc42b4630/oai
dc.identifier.uri http://acikerisim.sdu.edu.tr/xmlui/handle/123456789/57816
dc.description In defect prediction studies, open-source and real-world defect data sets are frequently used. The quality of these data sets is one of the main factors affecting the validity of defect prediction methods. One of the issues is repeated data points in defect prediction data sets. The main goal of the paper is to explore how low-level metrics are derived. This paper also presents a cleansing algorithm that removes repeated data points from defect data sets. The method was applied on 20 data sets, including five open source sets, and area under the curve (AUC) and precision performance parameters have been improved by 4.05% and 6.7%, respectively. In addition, this work discusses how static code metrics should be used in bug prediction. The study provides tips to obtain better defect prediction results. (C) 2016 Elsevier B.V. All rights reserved.
dc.language eng
dc.rights info:eu-repo/semantics/closedAccess
dc.title How repeated data points affect bug prediction performance: A case study
dc.type info:eu-repo/semantics/article


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account