Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2009, … and Informatics, 2009. …
…
1 page
1 file
XML has become the standard for data representation on the web. This expansion in reputation has prompted the need for a technique to access XML documents. Many techniques have been proposed to tackle the problem of mining XML data. We study the ...
Lecture Notes in Computer Science, 2002
The eXtensible Markup Language (XML) rapidly emerged as a standard for representing and exchanging information. The fastgrowing amount of available XML data sets a pressing need for languages and tools to manage collections of XML documents, as well as to mine interesting information out of them. Although the data mining community has not yet rushed into the use of XML, there have been some proposals to exploit XML. However, in practice these proposals mainly rely on more or less traditional relational databases with an XML interface. In this paper, we introduce association rules from native XML documents and discuss the new challenges and opportunities that this topic sets to the data mining community. More specifically, we introduce an extension of XQuery for mining association rules. This extension is used throughout the paper to better define association rule mining within XML and to emphasize its implications in the XML context.
2012
XML has become the standard for data representation on the internet. This expansion in reputation has prompt the need for a technique to access XML documents for particular information and to manipulate repositories of documents represented in XML to find specific documents. Having the ability to extract information from XML data would answer the problem of mining the web contents which is a very useful and required power nowadays. Efforts are made to develop a new tool or method for extracting information from XML data directly without any preprocessing or post processing of the XML documents. Association rules express the probability of the existing of a set of items when another set of items exists. It searches for similarities among large database. “Web mining” refer to how we can apply the traditional mining techniques that works on relational data and bind it to new data input represented in XML data which might be semi structure or unstructured. There are several techniques t...
Microcomputer Applications, 2004
In recent years XML has became very popular for representing semistructured data and a standard for data exchange over the web. Mining XML data from the web is becoming increasingly important. Several encouraging attempts at developing methods for mining XML data have been proposed. However, efficiency and simplicity are still a barrier for further development. Normally, pre-processing or post-processing are required for mining XML data, such as transforming the data from XML format to relational format. In this paper, we show that extracting association rules from XML documents without any preprocessing or post-processing using XQuery is possible and analyze the XQuery implementation of the well-known Apriori algorithm. In addition, we suggest features that need to be added into XQuery in order to make the implementation of the Apriori algorithm more efficient.
2009
Abstract The increasing amount of very large XML datasets available to casual users is a most challenging problem for our community, and calls for an appropriate support to efficiently gather knowledge from these data. Data mining, already widely applied to extract frequent correlations of values from both structured and semi-structured datasets, is the appropriate tool for knowledge elicitation. In this work we describe an approach to extract Tree-based association rules from XML documents.
2008
The inherent flexibilities of XML in both structure and semantics makes mining from XML data a complex task with more challenges compared to traditional association rule mining in relational databases. In this paper, we propose a new model for the effective extraction of generalized association rules form a XML document collection. We directly use frequent subtree mining techniques in the discovery process and do not ignore the tree structure of data in the final rules. The frequent subtrees based on the user provided support are split to complement subtrees to form the rules. We explain our model within multi-steps from data preparation to rule generation.
2009
Abstract The role of the eXtensible Markup Language (XML) is becoming very important in the research fields focusing on the representation, the exchange, and the integration of information coming from different data sources and containing information related to various contexts such as, for example, medical and biological data.
Models, Methods, and Applications
In this work we describe the TreeRuler tool, which makes it possible for inexperienced users to access huge XML (or relational) datasets. TreeRuler encompasses two main features: (1) it mines all the frequent association rules from input documents without any a-priori specification of the desired results, and (2) it provides quick, summarized, thus often approximate answers to user’s queries, by using the previously mined knowledge. TreeRuler has been developed in the scenario of the Odyssey EU project dealing with information about crimes, both for the relational and XML data model. In this chapter we mainly focus on the objectives, strategies, and difficulties encountered in the XML context.
International Journal of Computer Engineering in Research Trends, 2018
XML is globally accepted format for sending the data on internet and between different applications which are running on different platforms and architectures. Due to this, the huge amount of data on the internet is in XML. Thus researchers are attracted toward XML to identify interesting findings and patterns from these documents. Many data mining algorithms have been applied to XML including clustering, classification and association rules. In this paper association, rule mining on XML document is studied. This can be used to identify what work is done in the stated field and how we can extend it further in future.
14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings., 2002
The recent success of XML as a standard to represent semi-structured data, and the increasing amount of available XML data, pose new challenges to the data mining community. In this paper we present the XMINE operator a tool we developed to extract XML association rules for XML documents. The operator, that is based on XPath and inspired by the syntax of XQuery, allows us to express complex mining tasks, compactly and intuitively. XMINE can be used to specify indifferently (and simultaneously) mining tasks both on the content and on the structure of the data, since the distinction in XML is slight.
Revealing issues with current framework is itself a critical assignment. A review taken out for revealing issues related with Association standard mining on XML data. Preparatory essential ideas of Association rule mining is given in this work. Mining enormous amount of data, association rule mining have been demonstrated a powerful idea. Amid late years, the vast majority of the overall information exchanges are finished with XML (eXtensible Markup Language). Numerous empowering techniques have been distinguished and produced for mining XML data. In this paper, the idea of XML data examination is compressed and its importance towards association rule extraction has been represented. We have cantered a variety of strategies and methodologies of the examination, which are useful and set apart as the imperative field of XML data investigation. This work gives a study of different association rule strategies connected effectively on XML information since last one decade.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
Database and Expert Systems …, 2003
International Conference on Tools with Artificial Intelligence, 2000
… in Knowledge Discovery and Data Mining, 2004
Proceedings of the Third International Conference on Web Information Systems and Technologies, 2007
Expert Systems with Applications, 2012
2014 Iranian Conference on Intelligent Systems (ICIS), 2014
Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), 2007
International Journal of Engineering Research and Technology (IJERT), 2012