Skip to main content

Roberto Brunelli

Followers

8

Following

3

Co-authors

2

Public Views

Joaquim Llisterri

Universitat Autònoma de Barcelona

Wake Forest University

Luis Garcia Fanlo

Universidad de Buenos Aires

Andrea R Olinger

University of Louisville

University of Gothenburg

Waldemar Ferreira Netto

Universidade de São Paulo

JNTU,Anantapur

Sarun Nakthanom

Sukhothai Thammathirat Open University - Thailand

Richard A. Wright

University of Washington

Cristian Arriagada Garcia

Universidad de Tarapacá de Arica (UTA)

Interests

Uploads

Papers by Roberto Brunelli

Efficient Image Retrieval by Examples

Advances in Visual Information Management, 2000

Google, Inc. (search). ...

Estimation of Head Pose

Computers in the Human Interaction Loop, 2009

In building proactive systems for interacting with users by analyzing and recognizing scenes and ... more In building proactive systems for interacting with users by analyzing and recognizing scenes and settings, an important task is to deal with people’s occupations: Not only do their locations or identities become important, but their looking direction and orientation are crucial cues to determine everybody’s intentions and actions. The understanding of interaction partners or targeted objects is relevant in deciding

Automatic person recognition by acoustic and geometric features

Machine Vision and Applications, 1995

Abstract|The paper describes a multisensorial person identi cation system: visual and acoustic cu... more Abstract|The paper describes a multisensorial person identi cation system: visual and acoustic cues are used jointly for person identi cation. A simple approach, based on the fusion of the lists of scores produced independently by a speaker recognition system and a face recognition system, is presented. Experiments are reported which s h o w that integration of visual and acoustic information enhances both performance and reliability of the separate systems. Finally two n e t work architectures, based on radial basis function theory, a r e proposed to describe integration at di erent levels of abstraction.

A Survey on the Automatic Indexing of Video Data

Journal of Visual Communication and Image Representation, 1999

Today a considerable amount of video data in multimedia databases requires sophisticated indices ... more Today a considerable amount of video data in multimedia databases requires sophisticated indices for its effective use. Manual indexing is the most effective method to do this, but it is also the slowest and the most expensive. Automated methods have then to be developed. This paper surveys several approaches and algorithms that have been recently proposed to automatically structure audio-visual data, both for annotation and access. C 1999 Academic Press

Stochastic minimization with adaptive memory

Journal of Computational and Applied Mathematics, 1995

In this paper a nondeterministic minimization algorithm is presented. A common feature of random ... more In this paper a nondeterministic minimization algorithm is presented. A common feature of random search algorithms is that little or no use is made of information on the local structure of the function to be minimized. While this can be justified when the function has a very complicated microstructure, it results in an unnecessary loss of efficiency when the landscape is smooth but anisotropic. To overcome this deficiency, we propose a random minimization algorithm with adaptive memory: the algorithm decides by itself how much of the information gathered through the process of minimizing the function can be successfully used to guide the search. Extensive experiments (minimization of quadratic forms, computation of the minimum eigenvalue of positive definite quadratic forms of high dimensionality, eigenvalue computation in Hilbert spaces and fitting of data by superposition of Gaussians) show that efficiency is increased and that the algorithm is able to adapt quickly to the current landscape.

Person identification using multiple cues

IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995

Image retrieval by examples

IEEE Transactions on Multimedia, 2000

A currently relevant research field in information sciences is the management of non-traditional ... more A currently relevant research field in information sciences is the management of non-traditional distributed multimedia databases. Two related key issues are achieving an efficient content-based query by example retrieval and a fast response time. This paper presents the architecture of a distributed image retrieval system which provides novel solutions to these key issues. In particular, a way to quantify the effectiveness of low level visual descriptors in database query tasks is presented. The results are also used to improve the system response time, an important issue when querying very large databases. A new mechanism to adapt system query strategies to user behavior is also introduced to improve the effectiveness of relevance feedback and overall system response time. Finally, the issue of browsing multiple distributed databases is considered and a solution proposed using multidimensional scaling techniques.

SpotIt!An Interactive Identikit System

Graphical Models and Image Processing, 1996

Abstract| In this paper we present a system for browsing large mug-shot databases and the creatio... more Abstract| In this paper we present a system for browsing large mug-shot databases and the creation identikits of photographic quality. The two functions are interrelated: the available database provides direct feedback to the user building the identikit and the identikit itself can be used as an access key to the image database. SpotIt! provides a virtually unlimited set of alternative features that can be browsed e ciently in the appropriate context, interactive holistic feature modi cation coupled to syntactic access to a feature database, and quantitative, automatic computation of face similarities, providing real-time feedback of the system which constantly shows the most promising matches to the identikit being built.

On random minimization of functions

Biological Cybernetics, 1991

Query simplification and strategy selection for image retrieval

Abstract While many systems are currently available supporting the query-by-example paradigm for ... more Abstract While many systems are currently available supporting the query-by-example paradigm for image retrieval, some key issues, such as the effective introduction of relevance feedback and the automatic selection of an optimal search strategy, need further investigation. This paper discusses query splitting as a way to improve the effectiveness of relevance feedback techniques based on feature weighting. The possibility of selecting among a set of image retrieval strategies the one which optimizes speed and quality on ...

Ontology learning in multimedia information extraction from product catalogues

We propose a methodology for extracting multimedia information from product catalogues empowered ... more We propose a methodology for extracting multimedia information from product catalogues empowered by the synergetic use and extension of a domain ontology. The use of domain ontologies in this context additionally opens up innovative ways of catalogue use. The method is characterized by incrementally feeding and exploiting the ontology during an information extraction process, implemented by the semantic annotation of the analysed document, and by providing support for detecting existing similar ontologies to enable reuse of (parts of) them.

COMPASS: An image retrieval system for distributed databases

Abstract A currently relevant research field in information sciences is the management of non-tra... more Abstract A currently relevant research field in information sciences is the management of non-traditional distributed multimedia databases. Three related key issues are achieving an effective content-based query-by-example retrieval, a fast response time and adapting image comparison strategies to user needs. This paper presents COMPASS, a distributed image retrieval system which provides novel solutions to these key issues. Statistical methods are used to quantify image descriptor effectiveness, to simplify user queries and ...

On the use of histograms for image retrieval

Abstract This paper analyzes the use of histograms of low-level image features, such as color and... more

Multimedia Information Extraction in Ontology-based Semantic Annotation of Product Catalogues

Proc. of the 3rd Italian Semantic Web Workshop-SWAP 2006, 2006

Abstract—The demand for efficient methods for extracting knowledge from multimedia content has le... more Abstract—The demand for efficient methods for extracting knowledge from multimedia content has led to a growing research community investigating the convergence of multimedia and knowledge technologies. In this paper we describe a methodology for extracting multimedia information from product catalogues empowered by the synergetic use and extension of a domain ontology. The methodology was implemented in the Trade Fair Advanced Semantic Annotation Pipeline of the VIKE-framework. Index Terms—Semantic ...

Enabling a knowledge supply chain: From content resources to ontologies

In Proceedings of the ESWC 2006 Workshop on Mastering the Gap: From Information Extraction to Semantic Representation, Budva, Montenegro, Jun 12, 2006

Abstract. Semantic annotation of content is a crucial building block of making the Semantic Web f... more Abstract. Semantic annotation of content is a crucial building block of making the Semantic Web fly. The (semi-) automatic support of the underlying semantic knowledge supply chain requires contributions from different research disciplines and well-defined pipelines, which step-by-step create such annotations from raw content objects. This paper presents an annotation pipeline that has been designed and implemented as part of the VIKEF project. A clear structuring of the pipeline, the selection of adequate representation formats for the ...

Person recognition using acoustic and visual cues

Abstract| A person recognition system that makes use of acoustic and visual features is described... more Abstract| A person recognition system that makes use of acoustic and visual features is described. The system combines features at a score level and is capable of performing either an identi cation or a rejection. The improved performance of the integrated system with respect to the separate subsystems (acoustic and visual) is quanti ed.

Recognition system, particularly for recognising people

Tracking Visitors in a Museum

by Francesco Tobia and Roberto Brunelli

Cognitive Technologies, 2007

The most basic, low-level information about people is their location in the environment. This inf... more

Joint Bayesian tracking of head location and pose from low-resolution video

This paper presents a visual particle lter for jointly track- ing the position of a person and he... more This paper presents a visual particle lter for jointly track- ing the position of a person and her head pose. The resulting informa- tion may be used to support automatic analysis of interactive people behaviour, by supporting proxemics analysis and providing dynamic in- formation on focus of attention. A pose-sensitive visual likelihood is pro- posed which models the appearance of

Synthetic movies for computer vision applications

This paper presents a real time graphical simulator based on a client-server architecture. The re... more This paper presents a real time graphical simulator based on a client-server architecture. The rendering engine, supported by a specialized client application for the automatic generation of goal oriented motion of synthetic characters, is used to produce realistic image sequences for extensive performance assessment of computer vision algorithms for people tracking.

Efficient Image Retrieval by Examples

Advances in Visual Information Management, 2000

Google, Inc. (search). ...

Estimation of Head Pose

Computers in the Human Interaction Loop, 2009

In building proactive systems for interacting with users by analyzing and recognizing scenes and ... more In building proactive systems for interacting with users by analyzing and recognizing scenes and settings, an important task is to deal with people’s occupations: Not only do their locations or identities become important, but their looking direction and orientation are crucial cues to determine everybody’s intentions and actions. The understanding of interaction partners or targeted objects is relevant in deciding

Automatic person recognition by acoustic and geometric features

Machine Vision and Applications, 1995

Abstract|The paper describes a multisensorial person identi cation system: visual and acoustic cu... more Abstract|The paper describes a multisensorial person identi cation system: visual and acoustic cues are used jointly for person identi cation. A simple approach, based on the fusion of the lists of scores produced independently by a speaker recognition system and a face recognition system, is presented. Experiments are reported which s h o w that integration of visual and acoustic information enhances both performance and reliability of the separate systems. Finally two n e t work architectures, based on radial basis function theory, a r e proposed to describe integration at di erent levels of abstraction.

A Survey on the Automatic Indexing of Video Data

Journal of Visual Communication and Image Representation, 1999

Today a considerable amount of video data in multimedia databases requires sophisticated indices ... more Today a considerable amount of video data in multimedia databases requires sophisticated indices for its effective use. Manual indexing is the most effective method to do this, but it is also the slowest and the most expensive. Automated methods have then to be developed. This paper surveys several approaches and algorithms that have been recently proposed to automatically structure audio-visual data, both for annotation and access. C 1999 Academic Press

Stochastic minimization with adaptive memory

Journal of Computational and Applied Mathematics, 1995

In this paper a nondeterministic minimization algorithm is presented. A common feature of random ... more In this paper a nondeterministic minimization algorithm is presented. A common feature of random search algorithms is that little or no use is made of information on the local structure of the function to be minimized. While this can be justified when the function has a very complicated microstructure, it results in an unnecessary loss of efficiency when the landscape is smooth but anisotropic. To overcome this deficiency, we propose a random minimization algorithm with adaptive memory: the algorithm decides by itself how much of the information gathered through the process of minimizing the function can be successfully used to guide the search. Extensive experiments (minimization of quadratic forms, computation of the minimum eigenvalue of positive definite quadratic forms of high dimensionality, eigenvalue computation in Hilbert spaces and fitting of data by superposition of Gaussians) show that efficiency is increased and that the algorithm is able to adapt quickly to the current landscape.

Person identification using multiple cues

IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995

Image retrieval by examples

IEEE Transactions on Multimedia, 2000

A currently relevant research field in information sciences is the management of non-traditional ... more A currently relevant research field in information sciences is the management of non-traditional distributed multimedia databases. Two related key issues are achieving an efficient content-based query by example retrieval and a fast response time. This paper presents the architecture of a distributed image retrieval system which provides novel solutions to these key issues. In particular, a way to quantify the effectiveness of low level visual descriptors in database query tasks is presented. The results are also used to improve the system response time, an important issue when querying very large databases. A new mechanism to adapt system query strategies to user behavior is also introduced to improve the effectiveness of relevance feedback and overall system response time. Finally, the issue of browsing multiple distributed databases is considered and a solution proposed using multidimensional scaling techniques.

SpotIt!An Interactive Identikit System

Graphical Models and Image Processing, 1996

Abstract| In this paper we present a system for browsing large mug-shot databases and the creatio... more Abstract| In this paper we present a system for browsing large mug-shot databases and the creation identikits of photographic quality. The two functions are interrelated: the available database provides direct feedback to the user building the identikit and the identikit itself can be used as an access key to the image database. SpotIt! provides a virtually unlimited set of alternative features that can be browsed e ciently in the appropriate context, interactive holistic feature modi cation coupled to syntactic access to a feature database, and quantitative, automatic computation of face similarities, providing real-time feedback of the system which constantly shows the most promising matches to the identikit being built.

On random minimization of functions

Biological Cybernetics, 1991

Query simplification and strategy selection for image retrieval

Abstract While many systems are currently available supporting the query-by-example paradigm for ... more Abstract While many systems are currently available supporting the query-by-example paradigm for image retrieval, some key issues, such as the effective introduction of relevance feedback and the automatic selection of an optimal search strategy, need further investigation. This paper discusses query splitting as a way to improve the effectiveness of relevance feedback techniques based on feature weighting. The possibility of selecting among a set of image retrieval strategies the one which optimizes speed and quality on ...

Ontology learning in multimedia information extraction from product catalogues

We propose a methodology for extracting multimedia information from product catalogues empowered ... more We propose a methodology for extracting multimedia information from product catalogues empowered by the synergetic use and extension of a domain ontology. The use of domain ontologies in this context additionally opens up innovative ways of catalogue use. The method is characterized by incrementally feeding and exploiting the ontology during an information extraction process, implemented by the semantic annotation of the analysed document, and by providing support for detecting existing similar ontologies to enable reuse of (parts of) them.

COMPASS: An image retrieval system for distributed databases

Abstract A currently relevant research field in information sciences is the management of non-tra... more Abstract A currently relevant research field in information sciences is the management of non-traditional distributed multimedia databases. Three related key issues are achieving an effective content-based query-by-example retrieval, a fast response time and adapting image comparison strategies to user needs. This paper presents COMPASS, a distributed image retrieval system which provides novel solutions to these key issues. Statistical methods are used to quantify image descriptor effectiveness, to simplify user queries and ...

On the use of histograms for image retrieval

Abstract This paper analyzes the use of histograms of low-level image features, such as color and... more

Multimedia Information Extraction in Ontology-based Semantic Annotation of Product Catalogues

Proc. of the 3rd Italian Semantic Web Workshop-SWAP 2006, 2006

Abstract—The demand for efficient methods for extracting knowledge from multimedia content has le... more Abstract—The demand for efficient methods for extracting knowledge from multimedia content has led to a growing research community investigating the convergence of multimedia and knowledge technologies. In this paper we describe a methodology for extracting multimedia information from product catalogues empowered by the synergetic use and extension of a domain ontology. The methodology was implemented in the Trade Fair Advanced Semantic Annotation Pipeline of the VIKE-framework. Index Terms—Semantic ...

Enabling a knowledge supply chain: From content resources to ontologies

In Proceedings of the ESWC 2006 Workshop on Mastering the Gap: From Information Extraction to Semantic Representation, Budva, Montenegro, Jun 12, 2006

Abstract. Semantic annotation of content is a crucial building block of making the Semantic Web f... more Abstract. Semantic annotation of content is a crucial building block of making the Semantic Web fly. The (semi-) automatic support of the underlying semantic knowledge supply chain requires contributions from different research disciplines and well-defined pipelines, which step-by-step create such annotations from raw content objects. This paper presents an annotation pipeline that has been designed and implemented as part of the VIKEF project. A clear structuring of the pipeline, the selection of adequate representation formats for the ...

Person recognition using acoustic and visual cues

Abstract| A person recognition system that makes use of acoustic and visual features is described... more Abstract| A person recognition system that makes use of acoustic and visual features is described. The system combines features at a score level and is capable of performing either an identi cation or a rejection. The improved performance of the integrated system with respect to the separate subsystems (acoustic and visual) is quanti ed.

Recognition system, particularly for recognising people

Tracking Visitors in a Museum

by Francesco Tobia and Roberto Brunelli

Cognitive Technologies, 2007

The most basic, low-level information about people is their location in the environment. This inf... more

Joint Bayesian tracking of head location and pose from low-resolution video

This paper presents a visual particle lter for jointly track- ing the position of a person and he... more This paper presents a visual particle lter for jointly track- ing the position of a person and her head pose. The resulting informa- tion may be used to support automatic analysis of interactive people behaviour, by supporting proxemics analysis and providing dynamic in- formation on focus of attention. A pose-sensitive visual likelihood is pro- posed which models the appearance of

Synthetic movies for computer vision applications

This paper presents a real time graphical simulator based on a client-server architecture. The re... more This paper presents a real time graphical simulator based on a client-server architecture. The rendering engine, supported by a specialized client application for the automatic generation of goal oriented motion of synthetic characters, is used to produce realistic image sequences for extensive performance assessment of computer vision algorithms for people tracking.