Patterns of query reformulation during web searching

Jansen, Bernard, Booth, Danielle, & (2009) Patterns of query reformulation during web searching. Journal of the Association for Information Science and Technology, 60(7), pp. 1358-1371.

View at publisher

Description

Query reformulation is a key user behavior during Web search. Our research goal is to develop predictive models of query reformulation during Web searching. This article reports results from a study in which we automatically classified the query-reformulation patterns for 964,780 Web searching sessions, composed of 1,523,072 queries, to predict the next query reformulation. We employed an n-gram modeling approach to describe the probability of users transitioning from one query-reformulation state to another to predict their next state. We developed first-, second-, third-, and fourth-order models and evaluated each model for accuracy of prediction, coverage of the dataset, and complexity of the possible pattern set. The results show that Reformulation and Assistance account for approximately 45% of all query reformulations; furthermore, the results demonstrate that the first- and second-order models provide the best predictability, between 28 and 40% overall and higher than 70% for some patterns. Implications are that the n-gram approach can be used for improving searching systems and searching assistance.

Impact and interest:

128 citations in Scopus
81 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 28889
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
Measurements or Duration: 14 pages
Keywords: Assisted Searching, Human Computer Interaction, Predictive Models, Query Refinement, Search Behaviour
DOI: 10.1002/asi.21071
ISSN: 1532-2882
Pure ID: 31958039
Divisions: Past > QUT Faculties & Divisions > Faculty of Education
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 25 Nov 2009 22:58
Last Modified: 11 Jun 2024 09:14