Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2010, IFIP Advances in Information and Communication Technology
Enterprise information integration (EII) requires an accurate, precise and complete understanding of the disparate data sources, the needs of the information consumers, and how these map to the business concepts of the enterprise. In practice, such integration takes place in context of any enterprise information system. In the paper we explain various approaches to EII, its architectures as well as its association to enterprise application integration. We justify why XML technology contributes to finding sufficiently powerful support for EII. We present some features of the XML technology, mainly its database part, and show how it is usable in EII.
2004
Abstract. The paper proposes a high-level approach for the e-enterprise modelling, able to respond to the new performance criteria, to the consideration of the whole life cycle of a product, process orientation, etc. The proposed model is based on GERAM methodology, UML modelling language and XML-based technologies. At the implementation level, we propose different solutions based on open standards and software applications. Keywords: e-enterprise, integration, XML, modelling.
Lecture Notes in Computer Science, 2002
This paper describes the e-XML component suite, a modular product for integrating heterogeneous data sources under an XML schema and querying in real-time the integrated information using XQuery, the emerging W3C standard for XML query. We describe the two main components of the suite, i.e., the repository for warehousing XML and the mediator for distributed query processing. We also discuss some typical applications.
Selected Readings on Database Technologies and Applications
This chapter introduces different alternatives to store and manage jointly relational and eXtensible Markup Language (XML) data sources. Nowadays, businesses are transformed in e-business and have to manage large data volumes and from heterogeneous sources. To manage large amounts of information, Database Management Systems (DBMS) continue to be one of the most used tools, and the most extended model is the relational one. On the other side, XML has reached the de facto standard to present and exchange information between businesses on the Web. Therefore, it could be necessary to use tools as mediators to integrate these two different data to a common format like XML, since it is the main data format on the Web. First, a classification of the main tools and systems where this problem is handled is made, with their advantages and disadvantages. The objective will be to propose a new system to solve the integration business information problem.
2003
The eXtensible Markup Language (XML) offers many important benefits and improvements over its predecessor, HTML. But, articles have appeared about XML with exaggerated claims of it being a "Rosetta Stone" with "miraculuous ways" to almost automatically provide information integration. These claims are actually being believed by some executives. It is almost surprising that no one has claimed that XML can cure cancer and provide world peace! In reality, XML must face many of the same challenges that plagued Electronic Data Interchange (EDI) and database integration efforts of the past. To a large extent, there are both managerial and technical challenges -much related to the difficulties of attaining universally accepted semantically-rich standards. In this paper, these challenges will be discussed with specific emphasis on the issue of dealing with a real-world with multiple "contexts." Some promising research directions, some overlapping with the "semantic web" effort, will be presented.
This paper describes what the implementation of XML technology in information systems. In the information system, its can work as a time format XML data exchange, temporal databases, integration of information resources or other internet. In the exchange of data needed to be formatted files between servers. XML also supports the temporal extension as a representation of the historical database. In addition to supporting temporal, XML can also be implemented based on the framework that can serve as Internet information resources integration. All of this implementation is applied to the efficiency in information systems.
Journal of Information Science, 2005
Data warehousing has been widely adopted by contemporary enterprises. For inter-organizational information sharing, the need cannot be over-emphasized to conduct researches on the integration of heterogeneous data warehouses to overcome the challenging situations today. That makes it urgent to establish a systematic integration methodology for integrating heterogeneous data warehouses via the Internet or proprietary extranets. Traditionally, researchers usually employed a canonical format as the integration medium for logical data integrations among heterogeneous systems. In this paper, to fully utilize the power of the Internet, we propose a framework and develop a prototype to integrate heterogeneous data warehouses by XML technologies. We first formally define the elements in data warehousing and discuss various semantic conflicts occurring among heterogeneous data cubes. Then, we propose the system architecture and related resolution procedures for all kinds of semantic conflict...
Database systems are well-known for consistent storage, retrieval, and manipulation of data. At the same time, the Extensible Markup Language (XML) is generally accepted as data description language for both web-based information systems and electronic data interchange between different organizations. Since database systems form the backbone of essentially any information system, the integration of XML and database systems is a must. Data model heterogeneity and schema heterogeneity, however, makes this a challenging task, for example when mapping XML documents to relational database systems (RDBS). This paper focuses on data model heterogeneity and provides an in-depth comparison of concepts available in RDBS and XML schema specification languages, comprising XML DTD and XML Schema. Such an analysis provides the basis for developing appropriate middleware bridging the gap between XML and RDBS.
Label Streams, Semantics Utilization and Data Query Technologies
In XML Data Integration, data/metadata merging and query processing are indispensable. Specifically, merging integrates multiple disparate (heterogeneous and autonomous) input data sources together for further usage, while query processing is one main reason why the data need to be integrated in the first place. Besides, when supported with appropriate user feedback techniques, queries can also provide contexts in which conflicts among the input sources can be interpreted and resolved. The flexibility of XML structure provides opportunities for alleviating some of the difficulties that other less flexible data types face in the presence of uncertainty; yet, this flexibility also introduces new challenges in merging multiple sources and query processing over integrated data. In this chapter, the authors discuss two alternative ways XML data/schema can be integrated: conflict-eliminating (where the result is cleaned from any conflicts that the different sources might have with each ot...
Proceedings of the Seventh International Conference on Enterprise Information Systems, 2005
Nowadays, special with the Internet explosion, enterprises have to work with data from heterogeneous sources, such as data from conventional databases, or from new sources of Internet world like XML or HTML documents. Organizations have to work with these different data sources at the same time, so, it's necessary to find some way to integrate this heterogeneous information.In this paper we are going to centre in two main types of data, conventional data from relational databases, and the new web data format XML. Traditional relational database continues being the main data store and XML has become the main format to exchange and representation data on the web. At the end our purpose would be that the necessary data in each moment were in the same and common format, in XML, because this is the most used format on the web.This paper proposes an efficient environment for accessing relational databases from a web perspective using XML. Our environment defines a query system based on XML for relational databases, called XBD. XBD has a full XML appearance, query language and query results are in XML format. For the end user it is similar to query a XML document. This system includes a model to adapt any relational database in order it could be queried in two new query languages, derived from XSL and XQuery languages, and a software tool to implement the functionality of the XBD environment.
in the past decade, research works in heterogeneous database integration have established a good and solid framework to alleviate this task. However, there are still works that need to be accomplished to make these achievements easily implementable. In our project, we shall develop a software tool using XML for integrating and querying disparate heterogeneous information as unified XML views.
2001
The paper describes a prototype tool, named DIXSE, which supports the integration of XML Document Type Definitions (DTDs) into a common conceptual schema. The mapping from each individual DTD into the common schema is used to automatically generate wrappers for XML documents, which conform to a given DTD. These wrappers are used to populate the common conceptual schema thereby achieving data integration for XML documents.
Malaysian Journal of Computer Science, 2005
Over the past few years, XML has become the undisputable lingua franca standard both for semi-structured data representation and exchange format over the Internet, and also content management in various e-business worlds, especially the B2B and B2C enterprise applications. However, most of these organisations still rely heavily on existing relational database management systems (RDBMS) to store and manage their structured data for daily critical business transactions. In fact, major database vendors, which also happen to be the giant software companies like Microsoft, IBM and Oracle, have ventured and taken great initiatives in researching and providing for a single solution to integrate these semi-structured XML data with structured data in relational databases. Most importantly, it is estimated that during the next few years to come, more than 75% of e-business applications will implement XML technologies in their applications. Consequently, as more software applications are rapidly beginning to implement XML, there should be a growing need for XML middleware to efficiently integrating XML data at the front-end with a RDBMS at the back-end. Hence, this research is aimed at providing a generic XML-based framework, which is known as JXDB, that allows a user to use XML for dealing with semi-structured data for creating, accessing or updating to existing heterogeneous relational databases that store structured data and vice versa. JXDB is designed to provide a generic and extensive XML middleware framework for integration between XML documents and heterogeneous relational databases.
INCOSE International Symposium, 2001
To interoperate data sources that differ structurally and semantically particular problems occur, for example, problems of changing schema in data sources will affect the integrated schema. In this paper, conflict between heterogeneous systems is investigated and existing approaches to integration are reviewed. We propose XMInt, an XML Mediated Integration model which employs mediation and wrapping techniques as the main components for the integration of databases and legacy systems. With XMInt, a mediator acts as an intermediate medium transforming queries to subqueries, integrating result data and resolving conflicts. Wrappers then transform sub-queries to specific local queries so that each local system is able to understand the queries. This framework is currently being developed to make the integration process more widely accessible by using standard tools. An initial prototype is included to demonstrate the model.
Journal of The Chinese Institute of Industrial Engineers, 2005
Data exchange between enterprise resource planning (ERP) systems in a supply chain system needs to fulfill requirements of both schema integration and message translation. Since ERP systems with relational database systems are developed independently, schema conflicts between databases is a common problem for schema integration. Thus, supply chain partners need to preserve the data integrity of transacting messages with respect to the ERP database systems. The core technique of this kind of data exchange is matching XML documents and relational database schemas. An interactive tool is proposed here to simplify the integration and implementation efforts of data exchange between heterogeneous database systems in the supply chain system. The interactive tool is Web-enabled and it applies XML and XSL to translate trading messages. In order to store XML documents into the relational database, the structure and content of the XML document should be matched with the database schemas and verified by the users. The data contents of the XML document need to be matched not only with data types of the database but also with relational constraints to preserve the data integrity. XML schema and XSL transformation are applied to translate data and solve schema conflicts between XML documents and the database system. Using this interactive tool, the matching methods provide mapping suggestions, and it relies on users with specific schema knowledge to provide final verification of the matching results. This tool provides an environment for administrators and designers of heterogeneous database systems to communicate data requirements, to resolve semantic conflicts, and to preserve data integrity.
GI Jahrestagung (2), 2004
This paper addresses some classical problems to integrate data sources that are heterogeneous in structure with possibly redundant data along a real world example integrating three bibliographic data sources. We show how XML-technology can be applied for the data integration process in a straightforward manner to populate a data warehouse, how an ontology can be used as common schema, and how a generic, declarative approach can increase flexibility and scalability. Our procedure enables more advanced query functionality for integrated data sources.
2009
XML is becoming one of the main technological integredients of the Internet. It is now accepted as the standard for information exchange. XML-based data integration system, which enables sharing and cooperation with legacy data sources, arises as a more and more important data service provider on the web. These services can provide the users with a uniform interface to a multitude of data sources such as relational databases, XML files, text files, delimited files, Excel files, etc. Users can thus focus on what they want, rather than think about how to obtain the answers. Therefore, users do not have to carry on the tedious tasks such as finding the relevant data sources, interacting with each data source in isolation using the local interface and combining data from multiple data sources. Users are always expecting better query performance and data consistency from the data integration systems. This work proposes an approach to support constraints and triggers in the XML-based data...
Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05, 2005
As XML has evolved from a document markup language to a widely-used format for exchange of structured and semistructured data, managing large amounts of XML data has become increasingly important. A number of companies, including both established database vendors and startups, have recently announced new XML database systems or new XML functionality integrated into existing database systems. This tutorial will provide an insight into how XML functionality fits into relational database management systems as seen by three major relational vendors: IBM, Microsoft and Oracle.
Lecture Notes in Computer Science, 2003
Various XML instances from different data sources can model the same object of the real world. Query processing or view definition over these sources demands instance integration. In this context, integration means to identify which data instances represent the same object of the real world, as well as to solve ambiguities of representation of this object. The entity identification problem in XML is more complex than in structured databases. XML data, as originally considered, necessarily do not have the identification notion of primary key or object identifier. Thus, it is necessary the adoption of a mechanism that identifies the instances at the moment of data integration. This paper presents a proposal for identifiers attribution to XML instances, based on the use of Skolem functions and XPath recommendation, as proposed by W3C. Integration of XML Data 75 2.1 State of the Art Universal Key This is the simplest method for data integration. It is based on the existence of a common key between the instances to be integrated [2, 14, 16, 17, 23]. However, this approach is restricted, since the sources not always have a common key, as XML data. Key Equivalence Specified by the User This approach requires that the user specifies equivalence between the instances, for example, using a mapping table of the local identifiers from each source to the global identifiers in the integrated system. This technique is used in [1, 18, 20]. The disadvantage of this technique is that the mapping table can be considerable and present difficult maintenance, handled by the database administrator, not in a automatic way.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.