Query expansion using term relationships in language models for information retrieval

Bai, Jing, Song, Dawei, Bruza, Peter D., Nie, Jian-Yun, & Cao, Guihong (2005) Query expansion using term relationships in language models for information retrieval. In Herzog , O, Schek, H-J., Chowdhury, A., & Teiken , W. (Eds.) Proceedings of the 14th ACM international conference on Information and knowledge management - CIKM '05, Association for Computing Machinery, pp. 688-695.

View at publisher


Language Modeling (LM) has been successfully applied to Information Retrieval (IR). However, most of the existing LM approaches only rely on term occurrences in documents, queries and document collections. In traditional unigram based models, terms (or words) are usually considered to be independent. In some recent studies, dependence models have been proposed to incorporate term relationships into LM, so that links can be created between words in the same sentence, and term relationships (e.g. synonymy) can be used to expand the document model. In this study, we further extend this family of dependence models in the following two ways: (1) Term relationships are used to expand query model instead of document model, so that query expansion process can be naturally implemented; (2) We exploit more sophisticated inferential relationships extracted with Information Flow (IF). Information flow relationships are not simply pairwise term relationships as those used in previous studies, but are between a set of terms and another term. They allow for context-dependent query expansion. Our experiments conducted on TREC collections show that we can obtain large and significant improvements with our approach. This study shows that LM is an appropriate framework to implement effective query expansion.

Impact and interest:

128 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

84 since deposited on 01 Nov 2011
5 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 46752
Item Type: Conference Paper
Refereed: Yes
DOI: 10.1145/1099554.1099725
ISBN: 1-59593-140-6
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > INFORMATION SYSTEMS (080600)
Divisions: Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > Schools > Information Systems
Copyright Owner: The authors
Deposited On: 01 Nov 2011 00:26
Last Modified: 01 Mar 2012 00:52

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page