New Incremental Privacy-Preserving Clustering Protocols

SAEED SAMET

New Incremental Privacy-Preserving Clustering Protocols

SAEED SAMET

2013, Lecture notes on software engineering

visibility

…

description

5 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

We consider the problem of data clustering on streamed data, when the number of transactions is growing very quickly, or when data is distributed among several parties and their privacy is a concern. In this paper we present two new protocols for incremental privacy-preserving k-means clustering, which is a very popular data mining method, when data is distributed, horizontally or vertically, among multiple parties. At the end of each protocol, each party, without revealing its own private data, receives the final result of the clustering algorithm. Also, to improve efficiency, previous knowledge is used to incrementally update the centers and membership of each cluster.

Saeed Samet

Proceedings of the International Conference on …, 2007

Extracting meaningful and valuable knowledge from databases is often done by various data mining algorithms. Nowadays, databases are distributed among two or more parties because of different reasons such as physical and geographical restrictions and the most important issue is privacy. Related data is normally maintained by more than one organization, each of which wants to keep its individual information private. Thus, privacy-preserving techniques and protocols are designed to perform data mining on distributed environments when privacy is highly concerned. Cluster analysis is a technique in data mining, by which data can be divided into some meaningful clusters, and it has an important role in different fields such as bio-informatics, marketing, machine learning, climate and medicine. k-means Clustering is a prominent algorithm in this category which creates a one-level clustering of data. In this paper we introduce privacy-preserving protocols for this algorithm, along with a protocol for Secure comparison, known as the Millionaires' Problem, as a sub-protocol, to handle the clustering of horizontally or vertically partitioned data among two or more parties.

Log In

New Incremental Privacy-Preserving Clustering Protocols

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Related topics