Clustering with random indexing K-tree and XML structure

De Vries, Christopher M., Geva, Shlomo, & De Vine, Lance (2010) Clustering with random indexing K-tree and XML structure. In Focused Retrieval and Evaluation : Proceedings of 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2009, Springer, Brisbane, Queensland, pp. 407-415.

This is the latest version of this eprint.

View at publisher


This paper describes the approach taken to the clustering task at INEX 2009 by a group at the Queensland University of Technology. The Random Indexing (RI) K-tree has been used with a representation that is based on the semantic markup available in the INEX 2009 Wikipedia collection. The RI K-tree is a scalable approach to clustering large document collections. This approach has produced quality clustering when evaluated using two different methodologies.

Impact and interest:

2 citations in Scopus
2 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

168 since deposited on 01 Aug 2010
7 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 33656
Item Type: Conference Paper
Refereed: Yes
Additional Information: See additional URL to download software.
Additional URLs:
DOI: 10.1007/978-3-642-14556-8_40
ISBN: 9783642145551
ISSN: 0302-9743
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Pattern Recognition and Data Mining (080109)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > LIBRARY AND INFORMATION STUDIES (080700) > Information Retrieval and Web Search (080704)
Divisions: Past > Schools > Computer Science
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Current > Research Centres > High Performance Computing and Research Support
Copyright Owner: Copyright 2010 Springer
Copyright Statement:

This is the author-version of the work.

Conference proceedings published, by Springer Verlag, will be available via

Deposited On: 01 Aug 2010 23:57
Last Modified: 29 Feb 2012 14:17

Available Versions of this Item

  • Clustering with random indexing K-tree and XML structure. (deposited 01 Aug 2010 23:57) [Currently Displayed]

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page