XCLS: A Fast and Effective Clustering Algorithm for Heterogenous XML Documents

Nayak, Richi & Xu, Sumei (2006) XCLS: A Fast and Effective Clustering Algorithm for Heterogenous XML Documents. Lecture Notes in Computer Science, pp. 292-302.

View at publisher


We present a novel clustering algorithm to group the XML documents by similar structures. We introduce a Level structure format to represent the XML documents for efficient processing. We develop a global criterion function that do not require the pair-wise similarity to be computed between two individual documents, rather measures the similarity at clustering level utilising structural information of the XML documents. The experimental analysis shows the method to be fast and accurate.

Impact and interest:

7 citations in Scopus
Search Google Scholar™
5 citations in Web of Science®

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 10161
Item Type: Journal Article
Refereed: Yes
DOI: 10.1007/11731139_35
ISBN: 3540332065
Divisions: Past > QUT Faculties & Divisions > Faculty of Science and Technology
Copyright Owner: Copyright 2006 (please consult authors)
Copyright Statement: Conference proceedings published, by Springer Verlag, will be available via SpringerLink. http://www.springerlink.com
Deposited On: 15 Oct 2007 00:00
Last Modified: 29 Feb 2012 13:21

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page