QUT ePrints

A Study of XML models for data mining : representations, methods, and issues

Kutty, Sangeetha, Nayak, Richi, & Tran, Tien (2011) A Study of XML models for data mining : representations, methods, and issues. In Tagarelli, Andrea (Ed.) XML Data Mining : Models, Methods, and Applications. IGI Global Publishing, Hershey, PA, pp. 1-28.

[img] Accepted Version (PDF 319kB)
Administrators only | Request a copy from author

    View at publisher

    Abstract

    With the increasing number of XML documents in varied domains, it has become essential to identify ways of finding interesting information from these documents. Data mining techniques were used to derive this interesting information. Mining on XML documents is impacted by its model due to the semi-structured nature of these documents. Hence, in this chapter we present an overview of the various models of XML documents, how these models were used for mining and some of the issues and challenges in these models. In addition, this chapter also provides some insights into the future models of XML documents for effectively capturing the two important features namely structure and content of XML documents for mining.

    Impact and interest:

    Citation countsare sourced monthly from Scopus and Web of Science® citation databases.

    These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

    Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

    ID Code: 47870
    Item Type: Book Chapter
    Keywords: XML, STRUCTURE, CONTENT, DATA MINING
    DOI: 10.4018/978-1-61350-356-0
    ISBN: 9781613503560
    Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > DATA FORMAT (080400)
    Divisions: Past > QUT Faculties & Divisions > Faculty of Science and Technology
    Past > Schools > Mathematical Sciences
    Copyright Owner: Copyright 2012 IGI Global Publishing
    Deposited On: 21 Dec 2011 08:20
    Last Modified: 29 Dec 2011 15:54

    Export: EndNote | Dublin Core | BibTeX

    Repository Staff Only: item control page