Skip to main content

Georgia Koutrika

Followers

31

Following

16

Co-authors

16

Public Views

Guillermo R Simari

Universidad Nacional del Sur

UFCG - Universidade Federal de Campina Grande

Cesar Galindolegaria

University of South Carolina

Grigoris Karvounarakis

University of Waterloo

Ernesto Compatangelo

University of Aberdeen

Madalina Croitoru

Université Montpellier 2 - Sciences et Techniques du Languedoc

Politecnico di Milano

Interests

Uploads

Papers by Georgia Koutrika

Generalized Précis Queries for Logical Database Subset Creation

As a large fraction of available information resides in databases, the need for facilitating acce... more As a large fraction of available information resides in databases, the need for facilitating access for the large majority of users becomes increasingly more important. Précis queries are free-form queries that generate entire multirelation databases, which are logical subsets of existing ones. A logical subset contains not only items directly related to the given query selections but also items implicitly related to them in various ways with the purpose of providing to the user much greater insight into the original data. This paper is concerned with the definition and generation of logical database subsets based on précis queries under a generalized perspective that removes several restrictions of previous work and handles queries containing multiple terms combined using the operators AN D, OR, and NOT .

Setting the Foundations of Digital Libraries: The DELOS Manifesto

by Georgia Koutrika and Seamus Ross

D-Lib Magazine

The term "Digital Libraries" corresponds to a very complex notion with several diverse aspects an... more The term "Digital Libraries" corresponds to a very complex notion with several diverse aspects and cannot be captured by a simple definition. A robust model of Digital Libraries encapsulating the richness of these perspectives is required. This need has led to the drafting of The Digital Library Manifesto, the aim of which is to set the foundations and identify the cornerstone concepts within the universe of Digital Libraries, facilitating the integration of research results and proposing better ways of developing appropriate systems. The Manifesto is a result of the collaborative work of members of the European Union co-funded DELOS Network of Excellence on Digital Libraries. 1 It exploits the collective understanding that has been acquired, over more than a decade, on Digital Libraries by European research groups active in the Digital Library field, both within DELOS and outside, as well as by other groups around the world. This article presents the core parts of the Manifesto that introduce the entities of discourse of the Digital Library universe.

PrefDB: Supporting Preferences as First-Class Citizens in Relational Databases

IEEE Transactions on Knowledge and Data Engineering, 2014

Abstract In this paper, we argue that preference-aware query processing needs to be pushed closer... more Abstract In this paper, we argue that preference-aware query processing needs to be pushed closer to the DBMS. We introduce a preference-aware relational data model that extends database tuples with preferences and an extended algebra that captures the essence of processing queries with preferences. Based on a set of algebraic properties and a cost model that we propose, we provide several query optimization strategies for extended query plans. Further, we describe a query execution algorithm that blends preference ...

Pr&#233;cis: The Essence of a Query Answer

22nd International Conference on Data Engineering (ICDE'06), 2006

Wide spread use of database systems in modern society has brought the need to provide inexperienc... more Wide spread use of database systems in modern society has brought the need to provide inexperienced users with the ability to easily search a database with no specific knowledge of a query language. Several recent research efforts have focused on supporting keyword-based searches over relational databases. This paper presents an alternative proposal and introduces the idea of précis queries. These are free-form queries whose answer (a précis) is a synthesis of results, containing not only information directly related to the query selections but also information implicitly related to them in various ways. Our approach to précis queries includes two additional novelties: (a) queries do not generate individual relations but entire multi-relation databases; and (b) query results are personalized to user-specific and/or domain requirements. We develop a framework and system architecture for supporting such queries in the context of a relational database system and describe algorithms that implement the required functionality. Finally, we present a set of experimental results that evaluate the proposed algorithms and show the potential of this work.

Constrained optimalities in query personalization

Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05, 2005

... algorithms are potentially applicable: genetic algorithms [5], simulated annealing [10], tabu... more ... algorithms are potentially applicable: genetic algorithms [5], simulated annealing [10], tabu search [4], etc. ... Personalized Query Construction: After &amp;amp;amp;#x27;CQP State Space Search&amp;amp;amp;#x27; has selected the optimal ... can be effectively combined into one sub-query, but investigating the details of ...

Personalization of queries in database systems

Proceedings. 20th International Conference on Data Engineering, 2004

As information becomes available in increasing amounts to a wide spectrum of users, the need for ... more

Personalized Queries under a Generalized Preference Model

21st International Conference on Data Engineering (ICDE'05), 2005

Query Personalization is the process of dynamically enhancing a query with related user preferenc... more Query Personalization is the process of dynamically enhancing a query with related user preferences stored in a user profile with the aim of providing personalized answers. The underlying idea is that different users may find different things relevant to a search due to different preferences. Essential ingredients of query personalization are: (a) a model for representing and storing preferences in user profiles, and (b) algorithms for the generation of personalized answers using stored preferences. Modeling the plethora of preference types is a challenge. In this paper, we present a preference model that combines expressivity and concision. In addition, we provide efficient algorithms for the selection of preferences related to a query, and an algorithm for the progressive generation of personalized results, which are ranked based on user interest. Several classes of ranking functions are provided for this purpose. We present results of experiments both synthetic and with real users (a) demonstrating the efficiency of our algorithms, (b) showing the benefits of query personalization, and (c) providing insight as to the appropriateness of the proposed ranking functions.

An Enhanced Search Interface for Information Discovery from Digital Libraries

Lecture Notes in Computer Science, 2006

Libraries, museums, and other organizations make their electronic contents available to a growing... more Libraries, museums, and other organizations make their electronic contents available to a growing number of users on the Web. A large fraction of the information published is stored in structured or semi-structured form. However, most users have no specific knowledge of schemas or structured query languages for accessing information stored in (relational or XML) databases. Under these circumstances, the need

Synthesizing structured text from logical database subsets

Proceedings of the 11th international conference on Extending database technology Advances in database technology - EDBT '08, 2008

In the classical database world, information access has been based on a paradigm that involves st... more In the classical database world, information access has been based on a paradigm that involves structured, schema-aware, queries and tabular answers. In the current environment, however, where information prevails in most activities of society, serving people, applications, and devices in dramatically increasing numbers, this paradigm has proved to be very limited. On the query side, much work has been done on moving towards keyword queries over structured data. In our previous work, we have touched the other side as well, and have proposed a paradigm that generates entire databases in response to keyword queries. In this paper, we continue in the same direction and propose synthesizing textual answers in response to queries of any kind over structured data. In particular, we study the transformation of a dynamically-generated logical database subset into a narrative through a customizable, extensible, and templatebased process. In doing so, we exploit the structured nature of database schemas and describe three generic translation modules for different formations in the schema, called unary, split, and join modules. We have implemented the proposed translation procedure into our own database front end and have performed several experiments evaluating the textual answers generated as several features and parameters of the system are varied. We have also conducted a set of experiments measuring the effectiveness of such answers on users. The overall results are very encouraging and indicate the promise that our approach has for several applications.

Generalized Pr�cis Queries for Logical Database Subset Creation

2007 IEEE 23rd International Conference on Data Engineering, 2007

As a large fraction of available information resides in databases, the need for facilitating acce... more As a large fraction of available information resides in databases, the need for facilitating access for the large majority of users becomes increasingly more important. Précis queries are free-form queries that generate entire multirelation databases, which are logical subsets of existing ones. A logical subset contains not only items directly related to the given query selections but also items implicitly related to them in various ways with the purpose of providing to the user much greater insight into the original data. This paper is concerned with the definition and generation of logical database subsets based on précis queries under a generalized perspective that removes several restrictions of previous work and handles queries containing multiple terms combined using the operators AN D, OR, and NOT .

Report on the First International Workshop on Exploratory Search in Databases and the Web (ExploreDB 2014)

by Kostas Stefanidis, Georgia Koutrika, and Laks Lakshmanan

ACM SIGMOD Record, 2014

Third int'l workshop on "personalized access, profile management, and context awareness in databases" (PersDB 2009)

ACM SIGMOD Record, 2010

Proceedings of the 2010 international conference on Management of data - SIGMOD '10, 2010

We study recommendations in applications where there are temporal patterns in the way items are c... more We study recommendations in applications where there are temporal patterns in the way items are consumed or watched. For example, a student who has taken the Advanced Algorithms course is more likely to be interested in Convex Optimization, but a student who has taken Convex Optimization need not be interested in Advanced Algorithms in the future. Similarly, a person who

On the selection of tags for tag clouds

Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11, 2011

Abstract We examine the creation of a tag cloud for exploring and understanding a set of objects ... more Abstract We examine the creation of a tag cloud for exploring and understanding a set of objects (eg, web pages, documents). In the first part of our work, we present a formal system model for reasoning about tag clouds. We then present metrics that capture the structural properties of a tag cloud, and we briefly present a set of tag selection algorithms that are used in current sites (eg, del. icio. us, Flickr, Technorati) or that have been described in recent work. In order to evaluate the results of these algorithms, we devise a novel ...

Proceedings of the 2012 international conference on Management of Data - SIGMOD '12, 2012

In this demonstration we present a preference-aware relational query answering system, termed Pre... more In this demonstration we present a preference-aware relational query answering system, termed PrefDB. The key novelty of PrefDB is the use of an extended relational data model and algebra that allow expressing different flavors of preferential queries. Furthermore, unlike existing approaches that either treat the DBMS as a black box or require modifications of the database core, PrefDB's hybrid implementation enables operator-level query optimizations without being obtrusive to the database engine. We showcase the flexibility and efficiency of PrefDB using PrefDBAdmin, a graphical tool that we have built aiming at assisting application designers in the task of building, testing and tuning queries with preferences.

MANIFEST DIGITÁLNYCH KNIŽNÍC

by Georgia Koutrika and Seamus Ross

Can social bookmarking improve web search?

Proceedings of the international conference on Web search and web data mining - WSDM '08, 2008

Page 1. Can Social Bookmarking Improve Web Search? ... Our dataset represents about forty million... more

Proceedings of the 35th SIGMOD international conference on Management of data - SIGMOD '09, 2009

Abstract Special-purpose social sites can offer valuable services to well-defined, closed, commun... more Abstract Special-purpose social sites can offer valuable services to well-defined, closed, communities, eg, in a university or in a corporation. The purpose of this demo is to show the challenges, special features and potential of a focused social system in action through CourseRank, a course evaluation and planning social system.

Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology - EDBT '09, 2009

Abstract In this demo, we show data clouds that summarize the results of keyword searches over st... more

Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology - EDBT '09, 2009

Abstract Keyword searches are attractive because they facilitate users searching structured datab... more Abstract Keyword searches are attractive because they facilitate users searching structured databases. On the other hand, tag clouds are popular for navigation and visualization purposes over unstructured data because they can highlight the most significant concepts and hidden relationships in the underlying content dynamically. In this paper, we propose coupling the flexibility of keyword searches over structured data with the summarization and navigation capabilities of tag clouds to help users access a database. We propose using ...

Generalized Précis Queries for Logical Database Subset Creation

As a large fraction of available information resides in databases, the need for facilitating acce... more As a large fraction of available information resides in databases, the need for facilitating access for the large majority of users becomes increasingly more important. Précis queries are free-form queries that generate entire multirelation databases, which are logical subsets of existing ones. A logical subset contains not only items directly related to the given query selections but also items implicitly related to them in various ways with the purpose of providing to the user much greater insight into the original data. This paper is concerned with the definition and generation of logical database subsets based on précis queries under a generalized perspective that removes several restrictions of previous work and handles queries containing multiple terms combined using the operators AN D, OR, and NOT .

Setting the Foundations of Digital Libraries: The DELOS Manifesto

by Georgia Koutrika and Seamus Ross

D-Lib Magazine

The term "Digital Libraries" corresponds to a very complex notion with several diverse aspects an... more The term "Digital Libraries" corresponds to a very complex notion with several diverse aspects and cannot be captured by a simple definition. A robust model of Digital Libraries encapsulating the richness of these perspectives is required. This need has led to the drafting of The Digital Library Manifesto, the aim of which is to set the foundations and identify the cornerstone concepts within the universe of Digital Libraries, facilitating the integration of research results and proposing better ways of developing appropriate systems. The Manifesto is a result of the collaborative work of members of the European Union co-funded DELOS Network of Excellence on Digital Libraries. 1 It exploits the collective understanding that has been acquired, over more than a decade, on Digital Libraries by European research groups active in the Digital Library field, both within DELOS and outside, as well as by other groups around the world. This article presents the core parts of the Manifesto that introduce the entities of discourse of the Digital Library universe.

PrefDB: Supporting Preferences as First-Class Citizens in Relational Databases

IEEE Transactions on Knowledge and Data Engineering, 2014

Abstract In this paper, we argue that preference-aware query processing needs to be pushed closer... more Abstract In this paper, we argue that preference-aware query processing needs to be pushed closer to the DBMS. We introduce a preference-aware relational data model that extends database tuples with preferences and an extended algebra that captures the essence of processing queries with preferences. Based on a set of algebraic properties and a cost model that we propose, we provide several query optimization strategies for extended query plans. Further, we describe a query execution algorithm that blends preference ...

Pr&#233;cis: The Essence of a Query Answer

22nd International Conference on Data Engineering (ICDE'06), 2006

Wide spread use of database systems in modern society has brought the need to provide inexperienc... more Wide spread use of database systems in modern society has brought the need to provide inexperienced users with the ability to easily search a database with no specific knowledge of a query language. Several recent research efforts have focused on supporting keyword-based searches over relational databases. This paper presents an alternative proposal and introduces the idea of précis queries. These are free-form queries whose answer (a précis) is a synthesis of results, containing not only information directly related to the query selections but also information implicitly related to them in various ways. Our approach to précis queries includes two additional novelties: (a) queries do not generate individual relations but entire multi-relation databases; and (b) query results are personalized to user-specific and/or domain requirements. We develop a framework and system architecture for supporting such queries in the context of a relational database system and describe algorithms that implement the required functionality. Finally, we present a set of experimental results that evaluate the proposed algorithms and show the potential of this work.

Constrained optimalities in query personalization

Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05, 2005

... algorithms are potentially applicable: genetic algorithms [5], simulated annealing [10], tabu... more ... algorithms are potentially applicable: genetic algorithms [5], simulated annealing [10], tabu search [4], etc. ... Personalized Query Construction: After &amp;amp;amp;#x27;CQP State Space Search&amp;amp;amp;#x27; has selected the optimal ... can be effectively combined into one sub-query, but investigating the details of ...

Personalization of queries in database systems

Proceedings. 20th International Conference on Data Engineering, 2004

As information becomes available in increasing amounts to a wide spectrum of users, the need for ... more

Personalized Queries under a Generalized Preference Model

21st International Conference on Data Engineering (ICDE'05), 2005

Query Personalization is the process of dynamically enhancing a query with related user preferenc... more Query Personalization is the process of dynamically enhancing a query with related user preferences stored in a user profile with the aim of providing personalized answers. The underlying idea is that different users may find different things relevant to a search due to different preferences. Essential ingredients of query personalization are: (a) a model for representing and storing preferences in user profiles, and (b) algorithms for the generation of personalized answers using stored preferences. Modeling the plethora of preference types is a challenge. In this paper, we present a preference model that combines expressivity and concision. In addition, we provide efficient algorithms for the selection of preferences related to a query, and an algorithm for the progressive generation of personalized results, which are ranked based on user interest. Several classes of ranking functions are provided for this purpose. We present results of experiments both synthetic and with real users (a) demonstrating the efficiency of our algorithms, (b) showing the benefits of query personalization, and (c) providing insight as to the appropriateness of the proposed ranking functions.

An Enhanced Search Interface for Information Discovery from Digital Libraries

Lecture Notes in Computer Science, 2006

Libraries, museums, and other organizations make their electronic contents available to a growing... more Libraries, museums, and other organizations make their electronic contents available to a growing number of users on the Web. A large fraction of the information published is stored in structured or semi-structured form. However, most users have no specific knowledge of schemas or structured query languages for accessing information stored in (relational or XML) databases. Under these circumstances, the need

Synthesizing structured text from logical database subsets

Proceedings of the 11th international conference on Extending database technology Advances in database technology - EDBT '08, 2008

In the classical database world, information access has been based on a paradigm that involves st... more In the classical database world, information access has been based on a paradigm that involves structured, schema-aware, queries and tabular answers. In the current environment, however, where information prevails in most activities of society, serving people, applications, and devices in dramatically increasing numbers, this paradigm has proved to be very limited. On the query side, much work has been done on moving towards keyword queries over structured data. In our previous work, we have touched the other side as well, and have proposed a paradigm that generates entire databases in response to keyword queries. In this paper, we continue in the same direction and propose synthesizing textual answers in response to queries of any kind over structured data. In particular, we study the transformation of a dynamically-generated logical database subset into a narrative through a customizable, extensible, and templatebased process. In doing so, we exploit the structured nature of database schemas and describe three generic translation modules for different formations in the schema, called unary, split, and join modules. We have implemented the proposed translation procedure into our own database front end and have performed several experiments evaluating the textual answers generated as several features and parameters of the system are varied. We have also conducted a set of experiments measuring the effectiveness of such answers on users. The overall results are very encouraging and indicate the promise that our approach has for several applications.

Generalized Pr�cis Queries for Logical Database Subset Creation

2007 IEEE 23rd International Conference on Data Engineering, 2007

As a large fraction of available information resides in databases, the need for facilitating acce... more As a large fraction of available information resides in databases, the need for facilitating access for the large majority of users becomes increasingly more important. Précis queries are free-form queries that generate entire multirelation databases, which are logical subsets of existing ones. A logical subset contains not only items directly related to the given query selections but also items implicitly related to them in various ways with the purpose of providing to the user much greater insight into the original data. This paper is concerned with the definition and generation of logical database subsets based on précis queries under a generalized perspective that removes several restrictions of previous work and handles queries containing multiple terms combined using the operators AN D, OR, and NOT .

Report on the First International Workshop on Exploratory Search in Databases and the Web (ExploreDB 2014)

by Kostas Stefanidis, Georgia Koutrika, and Laks Lakshmanan

ACM SIGMOD Record, 2014

Third int'l workshop on "personalized access, profile management, and context awareness in databases" (PersDB 2009)

ACM SIGMOD Record, 2010

Proceedings of the 2010 international conference on Management of data - SIGMOD '10, 2010

We study recommendations in applications where there are temporal patterns in the way items are c... more We study recommendations in applications where there are temporal patterns in the way items are consumed or watched. For example, a student who has taken the Advanced Algorithms course is more likely to be interested in Convex Optimization, but a student who has taken Convex Optimization need not be interested in Advanced Algorithms in the future. Similarly, a person who

On the selection of tags for tag clouds

Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11, 2011

Abstract We examine the creation of a tag cloud for exploring and understanding a set of objects ... more Abstract We examine the creation of a tag cloud for exploring and understanding a set of objects (eg, web pages, documents). In the first part of our work, we present a formal system model for reasoning about tag clouds. We then present metrics that capture the structural properties of a tag cloud, and we briefly present a set of tag selection algorithms that are used in current sites (eg, del. icio. us, Flickr, Technorati) or that have been described in recent work. In order to evaluate the results of these algorithms, we devise a novel ...

Proceedings of the 2012 international conference on Management of Data - SIGMOD '12, 2012

In this demonstration we present a preference-aware relational query answering system, termed Pre... more In this demonstration we present a preference-aware relational query answering system, termed PrefDB. The key novelty of PrefDB is the use of an extended relational data model and algebra that allow expressing different flavors of preferential queries. Furthermore, unlike existing approaches that either treat the DBMS as a black box or require modifications of the database core, PrefDB's hybrid implementation enables operator-level query optimizations without being obtrusive to the database engine. We showcase the flexibility and efficiency of PrefDB using PrefDBAdmin, a graphical tool that we have built aiming at assisting application designers in the task of building, testing and tuning queries with preferences.

MANIFEST DIGITÁLNYCH KNIŽNÍC

by Georgia Koutrika and Seamus Ross

Can social bookmarking improve web search?

Proceedings of the international conference on Web search and web data mining - WSDM '08, 2008

Page 1. Can Social Bookmarking Improve Web Search? ... Our dataset represents about forty million... more

Proceedings of the 35th SIGMOD international conference on Management of data - SIGMOD '09, 2009

Abstract Special-purpose social sites can offer valuable services to well-defined, closed, commun... more Abstract Special-purpose social sites can offer valuable services to well-defined, closed, communities, eg, in a university or in a corporation. The purpose of this demo is to show the challenges, special features and potential of a focused social system in action through CourseRank, a course evaluation and planning social system.

Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology - EDBT '09, 2009

Abstract In this demo, we show data clouds that summarize the results of keyword searches over st... more

Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology - EDBT '09, 2009

Abstract Keyword searches are attractive because they facilitate users searching structured datab... more Abstract Keyword searches are attractive because they facilitate users searching structured databases. On the other hand, tag clouds are popular for navigation and visualization purposes over unstructured data because they can highlight the most significant concepts and hidden relationships in the underlying content dynamically. In this paper, we propose coupling the flexibility of keyword searches over structured data with the summarization and navigation capabilities of tag clouds to help users access a database. We propose using ...