Please use this identifier to cite or link to this item: http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/495
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDas, B.-
dc.contributor.authorKrishnan, N.C.-
dc.contributor.authorCook, D.J.-
dc.date.accessioned2016-11-19T09:30:10Z-
dc.date.available2016-11-19T09:30:10Z-
dc.date.issued2016-11-19-
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/495-
dc.description.abstractAs machine learning techniques mature and are used to tackle complex scientific problems, challenges arise such as the imbalanced class distribution problem, where one of the target class labels is under-represented in comparison with other classes. Existing oversampling approaches for addressing this problem typically do not consider the probability distribution of the minority class while synthetically generating new samples. As a result, the minority class is not represented well which leads to high misclassification error. We introduce two probabilistic oversampling approaches, namely RACOG and wRACOG, to synthetically generating and strategically selecting new minority class samples. The proposed approaches use the joint probability distribution of data attributes and Gibbs sampling to generate new minority class samples. While RACOG selects samples produced by the Gibbs sampler based on a predefined lag, wRACOG selects those samples that have the highest probability of being misclassified by the existing learning model. We validate our approach using nine UCI data sets that were carefully modified to exhibit class imbalance and one new application domain data set with inherent extreme class imbalance. In addition, we compare the classification performance of the proposed methods with three other existing resampling techniquesen_US
dc.language.isoen_USen_US
dc.subjectApproximating joint probability distributionen_US
dc.subjectGibbs samplingen_US
dc.subjectImbalanced class distributionen_US
dc.subjectProbabilistic oversamplingen_US
dc.titleRACOG and wRACOG: Two probabilistic oversampling techniquesen_US
dc.typeArticleen_US
Appears in Collections:Year-2015

Files in This Item:
File Description SizeFormat 
Full Text.pdf3.85 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.