Academia.eduAcademia.edu

XML Similarity Detection and Measures

Models, Methods, and Applications

Abstract

XML similarity detection plays an important role in facilitating many applications such as data integration, document classification/clustering, querying, and change management. In this chapter, we present an overview on XML document syntactic and semantic similarity/distance measures along with existing research related to XML similarity detection. The measures are classified into two main categories: structural similarity, and structural and content similarity. We review similarity detection approaches proposed in the literature and discuss some of the challenges and future directions for research on XML similarity detection and related fields.