DATA STREAM CLUSTERING ISSUES AND CHALLENGES-A SURVEY.

Rupa, B.; Professor, Assistant; of CSE, Department; GRIET; Hyderabad.; Soujanya., R.; Professor, Assistant; of CSE, Department; GRIET; Hyderabad.

DATA STREAM CLUSTERING ISSUES AND CHALLENGES-A SURVEY

IJAR Indexing

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

In recent years, advances in both hardware and software technology has allowed us to automatically record transactions and other information everyday at a rapid rate. Huge volumes of web, sensory and transactional data are continuously generated everyday as data streams, which need to be analyzed online as they arrive. Analysis of data streams have been researched extensively because of its emerging, imminent, and broad applications. One of the important method is clustering have been widely studied in the data mining community. Many existing data mining methods cannot be applied directly on streaming data because of the fact that the data needs to be mined in single pass. Furthermore, in data stream processing temporal locality is also quite important, because the essential patterns in the data may change and therefore, the clusters in the past history may no longer remain relevant to the future. In this paper we explore various issues and challenges on clustering data streams.

Related papers

A Study on Clustering in Data Stream

zaranaben Gajjar

In recent years, advances in hardware technology have facilitated new ways of collecting data continuously. Tremendous and potentially infinite volumes of data streams are often generated by real time systems, internet traffic, financial market, communication network, remote sensors, and other environments. Analyzing huge data sets and extracting valuable pattern in many applications are interesting for researchers. We identify two techniques for huge data bases mining. One is to streaming data and apply mining techniques whereas second is to solve this problem directly with competent algorithms. The main problem in data stream mining means growing data is more difficult to detect in this techniques therefore unsupervised methods should be applied. However, clustering techniques can indication us to determine hidden information.

Log In

DATA STREAM CLUSTERING ISSUES AND CHALLENGES-A SURVEY

Sign up for access to the world's latest research

Abstract

Related papers

Related topics

Related papers