Is the unigram relevance model term independent? Classifying term dependencies in query expansion
Symonds, Michael, Bruza, Peter D., Zuccon, Guido, Sitbon, Laurianne, & Turner, Ian (2012) Is the unigram relevance model term independent? Classifying term dependencies in query expansion. In ADCS 2012, ACM, University of Otago, Dunedin.
This paper develops a framework for classifying term dependencies in query expansion with respect to the role terms play in structural linguistic associations. The framework is used to classify and compare the query expansion terms produced by the unigram and positional relevance models. As the unigram relevance model does not explicitly model term dependencies in its estimation process it is often thought to ignore dependencies that exist between words in natural language.
The framework presented in this paper is underpinned by two types of linguistic association, namely syntagmatic and paradigmatic associations. It was found that syntagmatic associations were a more prevalent form of linguistic association used in query expansion. Paradoxically, it was the unigram model that exhibited this association more than the positional relevance model. This surprising finding has two potential implications for information retrieval models:
(1) if linguistic associations underpin query expansion, then a probabilistic term dependence assumption based on position is inadequate for capturing them;
(2) the unigram relevance model captures more term dependency information than its underlying theoretical model suggests, so its normative position as a baseline that ignores term dependencies should perhaps be reviewed.
Impact and interest:
Citation counts are sourced monthly from and citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from theindexing service can be viewed at the linked Google Scholar™ search.
Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.
|Item Type:||Conference Paper|
|Keywords:||Structural Linguistics, Information Retrieval, Word Associations, Relevance Models|
|Subjects:||Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > INFORMATION SYSTEMS (080600) > Information Systems Theory (080611)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > OTHER INFORMATION AND COMPUTING SCIENCES (089900)
Australian and New Zealand Standard Research Classification > LANGUAGES COMMUNICATION AND CULTURE (200000) > LINGUISTICS (200400) > Computational Linguistics (200402)
|Divisions:||Past > QUT Faculties & Divisions > Faculty of Science and Technology
Current > Schools > School of Information Systems
Current > Schools > School of Mathematical Sciences
Current > QUT Faculties and Divisions > Science & Engineering Faculty
Past > Schools > School of Information Systems
|Copyright Owner:||Copyright 2012 ACM|
|Copyright Statement:||Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee.
ADCS ’12 December 5-6, 2012, Dunedin, New Zealand.
Copyright 2012 ACM 978-1-4503-1411-4/12/12 ...$15.00.
|Deposited On:||12 Nov 2012 23:42|
|Last Modified:||26 Mar 2014 02:15|
Repository Staff Only: item control page