Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). The clustering problem has been addressed in many contexts and by researchers in many disciplines; this... more
Finding useful patterns in large datasets has attracted considerable interest recently, and one of the most widely st,udied problems in this area is the identification of clusters, or deusel y populated regions, in a multi-dir nensi onal... more
Clustering, in data mining, is useful to discover distribution patterns in the underlying data. Clustering algorithms usually employ a distance metric based (e.g., euclidean) similarity measure in order to partition the database such that... more
Hierarchical clustering is a widely used method for detecting clusters in genomic data. Clusters are defined by cutting branches off the dendrogram. A common but inflexible method uses a constant height cutoff value; this method exhibits... more
| Discovering association rules is one of the most important task in data mining. Many e cient algorithms have been proposed in the literature. The most noticeable are Apriori, Mannila's algorithm, Partition, Sampling and DIC, that are... more
Traditional problem determination techniques rely on static dependency models that are difficult to generate accurately in today's large, distributed, and dynamic application environments such as e-commerce systems. In this paper, we... more
Data clustering is an important technique for exploratory data analysis, and has been studied for several years. It has been shown to be useful in many practical domains such as data classification and image processing. Recently, there... more
The graph construction procedure essentially determines the potentials of those graph-oriented learning algorithms for image analysis. In this paper, we propose a process to build the so-called directed 1 -graph, in which the vertices... more
Partitioning a data set and extracting hidden structure from the data arises in different application areas of pattern recognition, speech and image processing. Pairwise data clustering is a combinatorial optimization method for data... more
This paper describes the application of adaptive neuro-fuzzy inference system (ANFIS) model for classification of electroencephalogram (EEG) signals. Decision making was performed in two stages: feature extraction using the wavelet... more
We explore the idea of evidence accumulation for combining the results of multiple clusterings. Initially, n d−dimensional data is decomposed into a large number of compact clusters; the K-means algorithm performs this decomposition, with... more
This paper presents a computational paradigm called Data Driven Markov Chain Monte Carlo (DDMCMC) for image segmentation in the Bayesian statistical framework. The paper contributes to image seymentation in three aspects. Firstly, it... more
Data clustering describes a set of frequently employed techniques in exploratory data analysis to extract "natural" group structure in data. Such groupings need to be validated to separate the signal in the data from spurious structure.... more
This paper tries to reconcile two sets of apparently contradictory results. One is the positive link, postulated in literature, between place attachment and civic activity, the other is the sociological claim that there is a negative... more
Background: Data clustering analysis has been extensively applied to extract information from gene expression profiles obtained with DNA microarrays. To this aim, existing clustering approaches, mainly developed in computer science, have... more
The user has requested enhancement of the downloaded file.
This paper explores basic aspects of the immune system and proposes a novel immune network model with the main goals of clustering and filtering unlabeled numerical data sets. It is not our concern to reproduce with confidence any immune... more
Despite its benefit in a wide range of applications, data mining techniques also have raised a number of ethical issues. Some such issues include those of privacy, data security, intellectual property rights, and many others. In this... more
The Fuping Complex, located within the central zone of the North China Craton, is composed of amphibolite to granulite facies orthogneisses, interleaved with minor supracrustal rocks at similar metamorphic grade. The oldest components... more
We have developed a simple and expandable procedure for classification and validation of extracellular data based on a probabilistic model of data generation. This approach relies on an empirical characterization of the recording noise.... more
Until now, neighbor-embedding-based (NE) algorithms for super-resolution (SR) have carried out two independent processes to synthesize high-resolution (HR) image patches. In the first process, neighbor search is performed using the... more
Veins formed in a variety of rock types and tectonic environments were measured at seven field locations to determine a general scaling relationship between the length and opening displacement (aperture). For these naturally formed... more
Group work is widespread in education. The growing use of online tools supporting group work generates huge amounts of data. We aim to exploit this data to support mirroring: presenting useful high-level views of information about the... more
. A density-based unsupervised clustering approach for detecting natural patterns in data further denoted as NP is presented, and its performance is illustrated for data sets with different types of clusters. NP works for arbitrary... more
We present a novel optimization framework for unsupervised texture segmentation that relies on statistical tests as a measure of homogeneity. Texture segmentation is formulated as a data clustering problem based on sparse proximity data.... more
Automatic human motion tracking in video sequences is one of the most frequently tackled tasks in computer vision community. The goal of human motion capture is to estimate the joints angles of human body at any time. However, this is one... more
We study the spatial structure and sub-structure of regions rich in Hipparcos stars with blue BT − VT colours. These regions, which comprise large stellar complexes, OB associations, and young open clusters, are tracers of on-going star... more
In this paper, we propose a new approach of fault detection and diagnosis combining a Neural Nonlinear Principal Component Analysis (NNLPCA) and Partial Least Square (PLS). We have made a comparative study between the Linear Principal... more
A resampling scheme for clustering with similarity to bootstrap aggregation (bagging) is presented. Bagging is used to improve the quality of pathbased clustering, a data clustering method that can extract elongated structures from data... more
Clustering techniques have received attention in many fields of study such as engineering, medicine, biology and data mining. The aim of clustering is to collect data points. The K-means algorithm is one of the most common techniques used... more
A modified cluster analysis method has been developed to identify spatial patterns of planetary flow regimes and to study transitions between them. This method has been applied first to a simple deterministic model and second to northern... more
Background-Advances in multi-parameter flow cytometry (FCM) now allow for the independent detection of larger numbers of fluorochromes on individual cells, generating data with increasingly higher dimensionality. The increased complexity... more
A fuzzy rule can have the shape of an ellipsoid in the input-output state space of a system. Then an additive fuzzy system approximates a function by covering its graph with ellipsoidal rule patches. It averages rule patches that overlap.... more
Optimization based pattern discovery has emerged as an important field in knowledge discovery and data mining (KDD), and has been used to enhance the efficiency and accuracy of clustering, classification, association rules and outlier... more
We describe the use of principle component analysis (PCA) to serve as a prefilter for cluster analysis or pattern récognition analysis of soft x-ray spectromicroscopy data. Cluster analysis provides a method to group régions with common... more
Conventional clustering algorithms utilize a single criterion that may not conform to the diverse shapes of the underlying clusters. We offer a new clustering approach that uses multiple clustering objective functions simultaneously. The... more
We present efficient fixed-parameter algorithms for the NP-complete edge modification problems Cluster Editing and Cluster Deletion. Here, the goal is to make the fewest changes to the edge set of an input graph such that the new graph is... more
We consider the problem of determining the structure of high-dimensional data without prior knowledge of the number of clusters. Data are represented by a finite mixture model based on the generalized Dirichlet distribution. The... more
Home literacy surveys were collected from the primary caregiver of 1,044 2-to 5-year-old children (M = 49.32 months, SD = 9.36) representing a wide range of socioeconomic backgrounds and types of early educational programs or child care.... more
Microarray analysis using clustering algorithms can suffer from lack of inter-method consistency in assigning related gene-expression profiles to clusters. Obtaining a consensus set of clusters from a number of clustering methods should... more
An electronic nose (e-nose), the Cyrano Sciences' Cyranose 320, comprising an array of thirty-two polymer carbon black composite sensors has been used to identify six species of bacteria responsible for eye infections when present at... more
Dimension reduction in today's vector space based information retrieval system is essential for improving computational efficiency in handling massive amounts of data. A mathematical framework for lower dimensional representation of text... more
Proterozoic subduction metasomatism subcontinental lithospheric mantle post-collisional ore deposit lead isotopes laser-ablation ICP-MS The global supply of Mo and much of Cu and Au comes from porphyry-type ore deposits associated with... more