Clinical Decision Support Systems: From the Perspective of Small and Imbalanced Data Set

Par, Öznur Esra; Akçapınar Sezer, Ebru; Sever, Hayri

DSpace Home
→
Mühendislik Fakültesi
→
Bilgisayar Mühendisliği Bölümü
→
Bilgisayar Mühendisliği Bölümü Yayın Koleksiyonu
→
View Item

Clinical Decision Support Systems: From the Perspective of Small and Imbalanced Data Set

Par, Öznur Esra; Akçapınar Sezer, Ebru; Sever, Hayri

URI: http://hdl.handle.net/20.500.12416/3895

Date: 2019

Abstract:

Clinical decision support systems are data analysis software that supports health professionals' decision-making the process to reach their ultimate outcome, taking into account patient information. However, the need for decision support systems cannot be denied because of most activities in the field of health care within the decision-making process. Decision support systems used for diagnosis are designed based on disease due to the complexity of diseases, symptoms, and disease-symptoms relationships. In the design and implementation of clinical decision support systems, mathematical modeling, pattern recognition and statistical analysis techniques of large databases and data mining techniques such as classification are also widely used. Classification of data is difficult in case of the small and/or imbalanced data set and this problem directly affects the classification performance. Small and/or imbalance dataset has become a major problem in data mining because classification algorithms are developed based on the assumption that the data sets are balanced and large enough. Most of the algorithms ignore or misclassify examples of the minority class, focus on the majority class. Most health data are small and imbalanced by nature. Learning from imbalanced and small data sets is an important and unsettled problem. Within the scope of the study, the publicly accessible data set, hepatitis was oversampled by distance-based data generation methods. The oversampled data sets were classified by using four different machine learning algorithms. Considering the classification scores of four different machine learning algorithms (Artificial Neural Networks, Support Vector Machines, Naive Bayes and Decision Tree), optimal synthetic data generation rate is recommended.

Show full item record

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Bilgisayar Mühendisliği Bölümü Yayın Koleksiyonu [332]
Bilgisayar Mühendisliği Bölümü yayınlarını içerir.

Search DSpace

Advanced Search

Browse

All of DSpace
- Communities & Collections
- By Issue Date
- Authors
- Titles
- Subjects
- Type
- Language
- Department
- Publisher
- Citation
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Type
- Language
- Department
- Publisher
- Citation

Clinical Decision Support Systems: From the Perspective of Small and Imbalanced Data Set

Clinical Decision Support Systems: From the Perspective of Small and Imbalanced Data Set

Abstract:

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account