Fuzzy Cluster Quality Index using Decision Theory

Revathy Ramesh

Fuzzy Cluster Quality Index using Decision Theory

Revathy Ramesh

2013

visibility

…

description

6 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Clustering can be defined as the process of grouping physical or abstract objects into classes of similar objects. It’s an unsupervised learning problem of organizing unlabeled objects into natural groups in such a way objects in the same group is more similar than objects in the different groups. Conventional clustering algorithms cannot handle uncertainty that exists in the real life experience. Fuzzy clustering handles incompleteness, vagueness in the data set efficiently. The goodness of clustering is measured in terms of cluster validity indices where the results of clustering are validated repeatedly for different cluster partitions to give the maximum efficiency i.e. to determine the optimal number of clusters. Especially, fuzzy clustering has been widely applied in a variety of areas and fuzzy cluster validation plays a very important role in fuzzy clustering. Since then Fuzzy clustering has been evaluated using various cluster validity indices. But primary indices have used...

Tanju Sirmen

Computers, Materials & Continua, 2022

Unsupervised clustering and clustering validity are used as essential instruments of data analytics. Despite clustering being realized under uncertainty, validity indices do not deliver any quantitative evaluation of the uncertainties in the suggested partitionings. Also, validity measures may be biased towards the underlying clustering method. Moreover, neglecting a confidence requirement may result in over-partitioning. In the absence of an error estimate or a confidence parameter, probable clustering errors are forwarded to the later stages of the system. Whereas, having an uncertainty margin of the projected labeling can be very fruitful for many applications such as machine learning. Herein, the validity issue was approached through estimation of the uncertainty and a novel low complexity index proposed for fuzzy clustering. It involves only uni-dimensional membership weights, regardless of the data dimension, stipulates no specific distribution, and is independent of the underlying similarity measure. Inclusive tests and comparisons returned that it can reliably estimate the optimum number of partitions under different data distributions, besides behaving more robust to over partitioning. Also, in the comparative correlation analysis between true clustering error rates and some known internal validity indices, the suggested index exhibited the highest strong correlations. This relationship has been also proven stable through additional statistical acceptance tests. Thus the provided relative uncertainty measure can be used as a probable error estimate in the clustering as well. Besides, it is the only method known that can exclusively identify data points in dubiety and is adjustable according to the required confidence level.

Log In

Fuzzy Cluster Quality Index using Decision Theory

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Related topics