Papers by Serhat Selcuk Bucak

IEEE transactions on pattern analysis and machine intelligence, 2014
Multiple kernel learning (MKL) is a principled approach for selecting and combining kernels for a... more Multiple kernel learning (MKL) is a principled approach for selecting and combining kernels for a given recognition task. A number of studies have shown that MKL is a useful tool for object recognition, where each image is represented by multiple sets of features and MKL is applied to combine different feature sets. We review the state-of-the-art for MKL, including different formulations and algorithms for solving the related optimization problems, with the focus on their applications to object recognition. One dilemma faced by practitioners interested in using MKL for object recognition is that different studies often provide conflicting results about the effectiveness and efficiency of MKL. To resolve this, we conduct extensive experiments on standard datasets to evaluate various approaches to MKL for object recognition. We argue that the seemingly contradictory conclusions offered by studies are due to different experimental setups. The conclusions of our study are: (i) given a s...
Online video scene clustering by competitive incremental NMF
Signal, Image and Video Processing, 2011
... ORIGINAL PAPER Online video scene clustering by competitive incremental NMF Serhat Selcuk Buc... more ... ORIGINAL PAPER Online video scene clustering by competitive incremental NMF Serhat Selcuk Bucak ?? Bilge Gunsel ... Let data matrix V = (v1, v2,..., vm) contain m data col-umn vectors, each of which corresponds to a video frame with n pixels. ...

2007 IEEE International Conference on Image Processing, 2007
Nonnegative Matrix Factorization (NMF) is a powerful decomposition tool which has been used in se... more Nonnegative Matrix Factorization (NMF) is a powerful decomposition tool which has been used in several content representation applications recently. However, there are some difficulties in implementing NMF in on-line video applications. This paper introduces an incremental NMF (INMF) without deviating from conventional NMF's main objective function, which is minimizing the reconstruction error. The proposed algorithm is capable of modeling dynamic content of the video; thus controls contribution of the subsequent observations to the NMF representation properly. It is shown that the INMF preserves additive, parts-based representation capability of the NMF with a low computational load while offering dimension reduction. Experimental results are given to compare the reconstruction performances of the conventional and incremental NMF. In addition, video scene change detection and dynamic video content representation by INMF are investigated. Test results demonstrate that the INMF can be used as a powerful on-line factorization tool.
Incremental clustering via nonnegative matrix factorization
2008 19th International Conference on Pattern Recognition, 2008
Nonnegative matrix factorization (NMF) has been shown to be an efficient clustering tool. However... more Nonnegative matrix factorization (NMF) has been shown to be an efficient clustering tool. However, NMF`s batch nature necessitates recomputation of whole basis set for new samples. Although NMF is a powerful content representation tool, this limits the use of ...

Mathematical Problems in Engineering, 2008
Nonnegative matrix factorization (NMF) is a promising approach for local feature extraction in fa... more Nonnegative matrix factorization (NMF) is a promising approach for local feature extraction in face recognition tasks. However, there are two major drawbacks in almost all existing NMF-based methods. One shortcoming is that the computational cost is expensive for large matrix decomposition. The other is that it must conduct repetitive learning, when the training samples or classes are updated. To overcome these two limitations, this paper proposes a novel incremental nonnegative matrix factorization (INMF) for face representation and recognition. The proposed INMF approach is based on a novel constraint criterion and our previous block strategy. It thus has some good properties, such as low computational complexity, sparse coefficient matrix. Also, the coefficient column vectors between different classes are orthogonal. In particular, it can be applied to incremental learning. Two face databases, namely FERET and CMU PIE face databases, are selected for evaluation. Compared with PCA...

2010 IEEE International Conference on Data Mining, 2010
Consider a typical recommendation problem. A company has historical records of products sold to a... more Consider a typical recommendation problem. A company has historical records of products sold to a large customer base. These records may be compactly represented as a sparse customer-times-product "who-bought-what" binary matrix. Given this matrix, the goal is to build a model that provides recommendations for which products should be sold next to the existing customer base. Such problems may naturally be formulated as collaborative filtering tasks. However, this is a one-class setting, that is, the only known entries in the matrix are one-valued. If a customer has not bought a product yet, it does not imply that the customer has a low propensity to potentially be interested in that product. In the absence of entries explicitly labeled as negative examples, one may resort to considering unobserved customer-product pairs as either missing data or as surrogate negative instances. In this paper, we propose an approach to explicitly deal with this kind of ambiguity by instead treating the unobserved entries as optimization variables. These variables are optimized in conjunction with learning a weighted, low-rank nonnegative matrix factorization (NMF) of the customer-product matrix, similar to how Transductive SVMs implement the low-density separation principle for semi-supervised learning. Experimental results show that our approach gives significantly better recommendations in comparison to various competing alternatives on one-class collaborative filtering tasks.

2009 IEEE 12th International Conference on Computer Vision, 2009
Multi-label learning is useful in visual object recognition when several objects are present in a... more Multi-label learning is useful in visual object recognition when several objects are present in an image. Conventional approaches implement multi-label learning as a set of binary classification problems, but they suffer from imbalanced data distributions when the number of classes is large. In this paper, we address multi-label learning with many classes via a ranking approach, termed multi-label ranking. Given a test image, the proposed scheme aims to order all the object classes such that the relevant classes are ranked higher than the irrelevant ones. We present an efficient algorithm for multi-label ranking based on the idea of block coordinate descent. The proposed algorithm is applied to visual object recognition. Empirical results on the PASCAL VOC 2006 and data sets show promising results in comparison to the state-of-the-art algorithms for multi-label learning.

This paper is motivated by the industrial research problem of designing a real-world recommender ... more This paper is motivated by the industrial research problem of designing a real-world recommender system for a large Information Technology (IT) company. Given historical records of client purchases, compactly represented as a sparse client-times-product “who-bought-what” binary matrix, the goal is to build a model that provides recommendations for what products should be sold next to the existing client base. Such a problem may naturally be formulated as a collaborative filtering task. However, this is a one-class setting, that is, if a client has not bought a product yet, it does not imply that the client has a low propensity to potentially buy that product later. In the absence of explicitly labeled negative examples, one may resort to considering zero-valued client-product pairs as either missing data or as surrogate negative instances. In this paper, we outline an approach to explicitly deal with this kind of ambiguity by instead treating zero-valued pairs as optimization variab...

Recent studies have shown that multiple kernel learning is very effective for object recognition,... more Recent studies have shown that multiple kernel learning is very effective for object recognition, leading to the popularity of kernel learning in computer vision problems. In this work, we develop an efficient algorithm for multi-label multiple kernel learning (ML-MKL). We assume that all the classes under consideration share the same combination of kernel functions, and the objective is to find the optimal kernel combination that benefits all the classes. Although several algorithms have been developed for ML-MKL, their computational cost is linear in the number of classes, making them unscalable when the number of classes is large, a challenge frequently encountered in visual object recognition. We address this computational challenge by developing a framework for ML-MKL that combines the worst-case analysis with stochastic approximation. Our analysis shows that the complexity of our algorithm is O(m1/3√lnm), where m is the number of classes. Empirical studies with object recognit...
Collaborative Filtering on Spare Datasets with Matrix Factorizations

Pattern Recognition, May 1, 2009
In this paper we introduce an incremental non-negative matrix factorization (INMF) scheme in orde... more In this paper we introduce an incremental non-negative matrix factorization (INMF) scheme in order to overcome the difficulties that conventional NMF has in online processing of large data sets. The proposed scheme enables incrementally updating its factors by reflecting the influence of each observation on the factorization appropriately. This is achieved via a weighted cost function which also allows controlling the memorylessness of the factorization. Unlike conventional NMF, with its incremental nature and weighted cost function the INMF scheme successfully utilizes adaptability to dynamic data content changes with a lower computational complexity. Test results reported for two video applications, namely background modeling in video surveillance and clustering, demonstrate that INMF is capable of online representing data content while reducing dimension significantly.
… Artımlı Negatif Olmayan Matris Ayrıştırma ile Arka Plan Modelleme Incremental Nonnegative Matrix Factorization for Background Modeling in Surveillance Video
ABSTRACT

Recent studies have shown that multiple kernel learning is very effective for object recognition,... more Recent studies have shown that multiple kernel learning is very effective for object recognition, leading to the popularity of kernel learning in computer vision problems. In this work, we develop an efficient algorithm for multi-label multiple kernel learning (ML-MKL). We assume that all the classes under consideration share the same combination of kernel functions, and the objective is to find the optimal kernel combination that benefits all the classes. Although several algorithms have been developed for ML-MKL, their computational cost is linear in the number of classes, making them unscalable when the number of classes is large, a challenge frequently encountered in visual object recognition. We address this computational challenge by developing a framework for ML-MKL that combines the worst-case analysis with stochastic approximation. Our analysis shows that the complexity of our algorithm is O(m 1/3 √ lnm), where m is the number of classes. Empirical studies with object recognition show that while achieving similar classification accuracy, the proposed method is significantly more efficient than the state-of-the-art algorithms for ML-MKL.
Multiple kernel and multi-label learning for image categorization
Apparatus and method for performing visual search

CVPR 2011, 2011
We consider a special type of multi-label learning where class assignments of training examples a... more We consider a special type of multi-label learning where class assignments of training examples are incomplete. As an example, an instance whose true class assignment is (c 1 , c 2 , c 3) is only assigned to class c 1 when it is used as a training sample. We refer to this problem as multi-label learning with incomplete class assignment. Incompletely labeled data is frequently encountered when the number of classes is very large (hundreds as in MIR Flickr dataset) or when there is a large ambiguity between classes (e.g., jet vs plane). In both cases, it is difficult for users to provide complete class assignments for objects. We propose a ranking based multi-label learning framework that explicitly addresses the challenge of learning from incompletely labeled data by exploiting the group lasso technique to combine the ranking errors. We present a learning algorithm that is empirically shown to be efficient for solving the related optimization problem. Our empirical study shows that the proposed framework is more effective than the state-ofthe-art algorithms for multi-label learning in dealing with incompletely labeled data.

Mid-level feature based local descriptor selection for image search
2013 Visual Communications and Image Processing (VCIP), 2013
ABSTRACT The objective in developing compact descriptors for visual image search is building an i... more ABSTRACT The objective in developing compact descriptors for visual image search is building an image retrieval system that works efficiently and effectively under bandwidth and memory constraints. Selecting local descriptors to be processed, and sending them to the server for matching is an integral part of such a system. One such image search and retrieval system is the Compact Descriptors for Visual Search (CDVS) standardization test model being developed by MPEG which has an efficient local descriptor selection criteria. However, all the existing selection parameters in CDVS are based on low-level features. In this paper, we propose two “mid-level” local descriptor selection criteria: Visual Meaning Score (VMS), and Visual Vocabulary Score (VVS) which can be seamlessly integrated into the existing CDVS framework. A mid-level criteria explicitly allows selection of local descriptors closer to a given set of images. Both VMS and VVS are based on visual words (patches) of images, and provide significant gains over the current CDVS standard in terms of matching accuracy, and have very low implementation cost.

Low complexity image matching using color based SIFT
2013 Visual Communications and Image Processing (VCIP), 2013
ABSTRACT Image matching and search is gaining significant commercial importance nowadays due to v... more ABSTRACT Image matching and search is gaining significant commercial importance nowadays due to various applications it enables such as augmented reality, image-queries for internet search, etc. Many researchers have effectively used color information in an image to improve its matching accuracy. These techniques, however, cannot be directly used for large scale mobile visual search applications that pose strict constraints on the size of the extracted features, computational resources and the system accuracy. To overcome this limitation, we propose a new and effective technique to incorporate color information that can use the SIFT extraction technique. We conduct our experiments on a large dataset containing around 33, 000 images that is currently being investigated in the MPEG-Compact Descriptors for Visual Search Standard and show substantial improvement compared to baseline.
Uploads
Papers by Serhat Selcuk Bucak