QUT ePrints

The effectiveness of cross-lingual link discovery

Tang, Ling-Xiang, Itakura, Kelly, Geva, Shlomo, Trotman, Andrew , & Xu, Yue (2011) The effectiveness of cross-lingual link discovery. In Kishida, Kazuaki, Sanderson, Mark, Webber, William, Kando, Noriko, Ishikawa, Noriko, & Sugimoto, Miho (Eds.) Proceedings of The Fourth International Workshop on Evaluating Information Access, National Insitute of Informatics (Japan), National Institute of Informatics, Tokyo, pp. 1-8.

View at publisher

Abstract

This paper describes the evaluation in benchmarking the effectiveness of cross-lingual link discovery (CLLD). Cross lingual link discovery is a way of automatically finding prospective links between documents in different languages, which is particularly helpful for knowledge discovery of different language domains.

A CLLD evaluation framework is proposed for system performance benchmarking. The framework includes standard document collections, evaluation metrics, and link assessment and evaluation tools. The evaluation methods described in this paper have been utilised to quantify the system performance at NTCIR-9 Crosslink task. It is shown that using the manual assessment for generating gold standard can deliver a more reliable evaluation result.

Impact and interest:

Citation countsare sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 49128
Item Type: Conference Paper
Keywords: Wikipedia, Cross-lingual link discovery, Assessment, Evaluation framework, Assessment tool, Evaluation metrics
ISBN: 9784860490577
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Natural Language Processing (080107)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > COMPUTER SOFTWARE (080300) > Open Software (080306)
Divisions: Current > Schools > School of Electrical Engineering & Computer Science
Current > QUT Faculties and Divisions > Science & Engineering Faculty
Copyright Owner: © 2011 National Institute of Informatics
Deposited On: 14 Mar 2012 08:28
Last Modified: 14 Mar 2012 08:28

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page