The effectiveness of cross-lingual link discovery

Tang, Ling-Xiang, Itakura, Kelly, Geva, Shlomo, Trotman, Andrew, & Xu, Yue (2011) The effectiveness of cross-lingual link discovery. In Kishida, Kazuaki, Sanderson, Mark, Webber, William, Kando, Noriko, Ishikawa, Noriko, & Sugimoto, Miho (Eds.) Proceedings of The Fourth International Workshop on Evaluating Information Access, National Insitute of Informatics (Japan), National Institute of Informatics, Tokyo, pp. 1-8.

View at publisher


This paper describes the evaluation in benchmarking the effectiveness of cross-lingual link discovery (CLLD). Cross lingual link discovery is a way of automatically finding prospective links between documents in different languages, which is particularly helpful for knowledge discovery of different language domains.

A CLLD evaluation framework is proposed for system performance benchmarking. The framework includes standard document collections, evaluation metrics, and link assessment and evaluation tools. The evaluation methods described in this paper have been utilised to quantify the system performance at NTCIR-9 Crosslink task. It is shown that using the manual assessment for generating gold standard can deliver a more reliable evaluation result.

Impact and interest:

Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 49128
Item Type: Conference Paper
Refereed: Yes
Keywords: Wikipedia, Cross-lingual link discovery, Assessment, Evaluation framework, Assessment tool, Evaluation metrics
ISBN: 9784860490577
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Natural Language Processing (080107)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > COMPUTER SOFTWARE (080300) > Open Software (080306)
Divisions: Current > Schools > School of Electrical Engineering & Computer Science
Current > QUT Faculties and Divisions > Science & Engineering Faculty
Copyright Owner: © 2011 National Institute of Informatics
Deposited On: 13 Mar 2012 22:28
Last Modified: 13 Mar 2012 22:28

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page