A virtual evaluation track for cross language link discovery

, Trotman, Andrew, & (2009) A virtual evaluation track for cross language link discovery. In Zhai, C, Zobel, J, Allan, J, Aslam, A, & Sanderson, M (Eds.) Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery (ACM), United States, pp. 1-7.

View at publisher

Description

The Wikipedia has become the most popular online source of encyclopedic information. The English Wikipedia collection, as well as some other languages collections, is extensively linked. However, as a multilingual collection the Wikipedia is only very weakly linked. There are few cross-language links or cross-dialect links (see, for example, Chinese dialects). In order to link the multilingual-Wikipedia as a single collection, automated cross language link discovery systems are needed – systems that identify anchor-texts in one language and targets in another. The evaluation of Link Discovery approaches within the English version of the Wikipedia has been examined in the INEX Link the-Wiki track since 2007, whilst both CLEF and NTCIR emphasized the investigation and the evaluation of cross-language information retrieval. In this position paper we propose a new virtual evaluation track: Cross Language Link Discovery (CLLD). The track will initially examine cross language linking of Wikipedia articles. This virtual track will not be tied to any one forum; instead we hope it can be connected to each of (at least): CLEF, NTCIR, and INEX as it will cover ground currently studied by each. The aim is to establish a virtual evaluation environment supporting continuous assessment and evaluation, and a forum for the exchange of research ideas. It will be free from the difficulties of scheduling and synchronizing groups of collaborating researchers and alleviate the necessity to travel across the globe in order to share knowledge. We aim to electronically publish peer-reviewed publications arising from CLLD in a similar fashion: online, with open access, and without fixed submission deadlines.

Impact and interest:

Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 46170
Item Type: Chapter in Book, Report or Conference volume (Conference contribution)
ORCID iD:
Geva, Shlomoorcid.org/0000-0003-1340-2802
Measurements or Duration: 7 pages
Keywords: Cross Language, Evaluation, Information Retrieval, Link Discovery
ISBN: 978-1-60558-483-6
Pure ID: 31896515
Divisions: Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Current > Research Centres > Australian Research Centre for Aerospace Automation
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 26 Sep 2011 06:36
Last Modified: 03 Mar 2024 09:56