DSpace@Çankaya

Small and Unbalanced Data Set Problem in Classification

Basit öğe kaydını göster

dc.contributor.author Par, Öznur Esra
dc.contributor.author Sezer, Ebru Akçapınar
dc.contributor.author Sever, Hayri
dc.date.accessioned 2023-01-04T08:28:53Z
dc.date.available 2023-01-04T08:28:53Z
dc.date.issued 2019
dc.identifier.citation Par, Öznur Esra; Sezer, Ebru Akçapınar; Sever, Hayri (2019). "Small and Unbalanced Data Set Problem in Classification", 27th Signal Processing and Communications Applications Conference (SIU), Sivas Cumhuriyet Univ, Sivas, TURKEY, APR 24-26, 2019. tr_TR
dc.identifier.issn 2165-0608
dc.identifier.uri http://hdl.handle.net/20.500.12416/6019
dc.description.abstract Classification of data is difficult in case of small and unbalanced data set and this problem directly affects the classification performance. Small and / or the imbalance dataset has become a major problem in data mining. Classification algorithms are developed based on the assumption that the data sets are balanced and large enough. The most of the algorithms ignore or misclassify examples of the minority class, focus on the majority class. Small and unbalanced data set problem is frequently encountered in medical data mining due to some limitations. Within the scope of the study, the public accessible data set, hepatitis, was divided into small and imblanced data subsets, each of the data subsets were oversampled by distance based data generation methods. The oversampled data sets were classified by using four different machine learning algorithms (Artificial Neural Networks, Support Vector Machines, Naive Bayes and Decision Tree) and the classification scores were compared. tr_TR
dc.language.iso eng tr_TR
dc.rights info:eu-repo/semantics/closedAccess tr_TR
dc.subject Machine Learning tr_TR
dc.subject Small Data Set tr_TR
dc.subject Imbalanced Data Set tr_TR
dc.subject Oversampling Methods tr_TR
dc.title Small and Unbalanced Data Set Problem in Classification tr_TR
dc.type conferenceObject tr_TR
dc.relation.journal 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) tr_TR
dc.contributor.authorID 11916 tr_TR
dc.contributor.department Çankaya Üniversitesi, Mühendislik Fakültesi, Yazılım Mühendisliği Bölümü tr_TR


Bu öğenin dosyaları:

Dosyalar Boyut Biçim Göster

Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster