QUT ePrints

Clustering with random indexing K-tree and XML structure

De Vries, Christopher M., Geva, Shlomo, & De Vine, Lance (2010) Clustering with random indexing K-tree and XML structure. In Focused Retrieval and Evaluation : Proceedings of 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2009, Springer, Brisbane, Queensland, pp. 407-415.

View at publisher

Abstract

This paper describes the approach taken to the clustering task at INEX 2009 by a group at the Queensland University of Technology. The Random Indexing (RI) K-tree has been used with a representation that is based on the semantic markup available in the INEX 2009 Wikipedia collection. The RI K-tree is a scalable approach to clustering large document collections. This approach has produced quality clustering when evaluated using two different methodologies.

Impact and interest:

2 citations in Scopus
Search Google Scholar™
2 citations in Web of Science®

Citation countsare sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

132 since deposited on 01 Aug 2010
18 in the past twelve months

Full-text downloadsdisplays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 33656
Item Type: Conference Paper
Additional Information: See additional URL to download software.
Additional URLs:
DOI: 10.1007/978-3-642-14556-8_40
ISBN: 9783642145551
ISSN: 0302-9743
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Pattern Recognition and Data Mining (080109)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > LIBRARY AND INFORMATION STUDIES (080700) > Information Retrieval and Web Search (080704)
Divisions: Past > Schools > Computer Science
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Current > Research Centres > High Performance Computing and Research Support
Copyright Owner: Copyright 2010 Springer
Copyright Statement: This is the author-version of the work. Conference proceedings published, by Springer Verlag, will be available via http://www.springer.de/comp/lncs/
Deposited On: 02 Aug 2010 09:57
Last Modified: 01 Mar 2012 00:17

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page