REVIEW_OF_DIFFERENT_ALGORITHMS_FOR_OUTLIER_DETECTION_v22.pdf

World Academy of Informatics and Management Sciences

REVIEW_OF_DIFFERENT_ALGORITHMS_FOR_OUTLIER_DETECTION_v22.pdf

World Academy of Informatics and Management Sciences

visibility

…

description

4 pages

link

1 file

Clustering plays an important role in data mining. Its main job is division of data into groups. The similar type data is grouped one cluster and dissimilar data is grouped another cluster. But major problem with in clustering is to handle outliers. Outliers occur because of mechanical faults, system behaviour, human fault or mistake of natural deviations. Outlier detection refers to the problem of finding pattern in data that do not conform to expected normal behaviour. A variety of algorithms used to solve the problem of outliers. They are subject of this paper. This paper explores the behaviour of some clustering algorithms that performs on different type's dataset and methods to solve the problem of outliers.

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

IJIRST - International Journal for Innovative Research in Science and Technology

Data mining, in general, deals with the discovery of non-trivial, hidden and interesting knowledge from different types of data. With the development of information technologies, the number of databases, as well as their dimension and complexity, grow rapidly. It is necessary what we need automated analysis of great amount of information. The analysis results are then used for making a decision by a human or program. One of the basic problems of data mining is the outlier detection. The outlier detection problem in some cases is similar to the classification problem. For example, the main concern of clustering-based outlier detection algorithms is to find clusters and outliers, which are often regarded as noise that should be removed in order to make more reliable clustering. In this thesis, the ability to detect outliers can be improved using a combined perspective from outlier detection and cluster identification. In proposed work comparison of four methods will be done like K-Mean, k-Mediods, Iterative k-Mean and density based method. Unlike the traditional clustering-based methods, the proposed algorithm provides much efficient outlier detection and data clustering capabilities in the presence of outliers, so comparison has been made. The purpose of our method is not only to produce data clustering but at the same time to find outliers from the resulting clusters. The goal is to model an unknown nonlinear function based on observed input-output pairs. The whole simulation of this proposed work has been taken in MATLAB environment.

Log In

REVIEW_OF_DIFFERENT_ALGORITHMS_FOR_OUTLIER_DETECTION_v22.pdf

Sign up for access to the world's latest research

Related papers

Related papers