DSpace Repository

Reporting and analyzing alternative clustering solutions by employing multi-objective genetic algorithm and conducting experiments on cancer data

Show simple item record

dc.contributor.author Peng, Peter
dc.contributor.author Addam, Omer
dc.contributor.author Elzohbi, Mohamad
dc.contributor.author Özyer, Sibel T.
dc.contributor.author Elhajj, Ahmad
dc.contributor.author Gao, Shang
dc.contributor.author Liu, Yimin
dc.contributor.author Özyer, Tansel
dc.contributor.author Kaya, Mehmet
dc.contributor.author Ridley, Mick
dc.date.accessioned 2020-06-02T07:01:22Z
dc.date.available 2020-06-02T07:01:22Z
dc.date.issued 2014-01
dc.identifier.citation Peng, Peter...et.al., "Reporting and analyzing alternative clustering solutions by employing multi-objective genetic algorithm and conducting experiments on cancer data" Knowledge-Based Systems, Vol.56, pp.108-122, (2014). tr_TR
dc.identifier.issn 0950-7051
dc.identifier.uri http://hdl.handle.net/20.500.12416/3999
dc.description.abstract Clustering is an essential research problem which has received considerable attention in the research community for decades. It is a challenge because there is no unique solution that fits all problems and satisfies all applications. We target to get the most appropriate clustering solution for a given application domain. In other words, clustering algorithms in general need prior specification of the number of clusters, and this is hard even for domain experts to estimate especially in a dynamic environment where the data changes and/or become available incrementally. In this paper, we described and analyze the effectiveness of a robust clustering algorithm which integrates multi-objective genetic algorithm into a framework capable of producing alternative clustering solutions; it is called Multi-objective K-Means Genetic Algorithm (MOKGA). We investigate its application for clustering a variety of datasets, including microarray gene expression data. The reported results are promising. Though we concentrate on gene expression and mostly cancer data, the proposed approach is general enough and works equally to cluster other datasets as demonstrated by the two datasets Iris and Ruspini. After running MOKGA, a pareto-optimal front is obtained, and gives the optimal number of clusters as a solution set. The achieved clustering results are then analyzed and validated under several cluster validity techniques proposed in the literature. As a result, the optimal clusters are ranked for each validity index. We apply majority voting to decide on the most appropriate set of validity indexes applicable to every tested dataset. The proposed clustering approach is tested by conducting experiments using seven well cited benchmark data sets. The obtained results are compared with those reported in the literature to demonstrate the applicability and effectiveness of the proposed approach. tr_TR
dc.language.iso eng tr_TR
dc.publisher Elsevıer tr_TR
dc.relation.isversionof 10.1016/j.knosys.2013.11.003 tr_TR
dc.rights info:eu-repo/semantics/closedAccess tr_TR
dc.subject Clustering tr_TR
dc.subject Genetic Algorithm tr_TR
dc.subject Gene Expression Data tr_TR
dc.subject Multi-Objective Optimization tr_TR
dc.subject Cluster Validity Analysis tr_TR
dc.title Reporting and analyzing alternative clustering solutions by employing multi-objective genetic algorithm and conducting experiments on cancer data tr_TR
dc.type article tr_TR
dc.relation.journal Knowledge-Based Systems tr_TR
dc.identifier.volume 56 tr_TR
dc.identifier.startpage 108 tr_TR
dc.identifier.endpage 122 tr_TR
dc.contributor.department Çankaya Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü tr_TR


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record