Defining a session on Web search engines

Jansen, Bernard J., Spink, Amanda H., Blakely, Chris, & Koshman, Sherry (2007) Defining a session on Web search engines. Journal of the American Society for Information Science and Technology, 58(6), pp. 862-871.

View at publisher

Abstract

Detecting query reformulations within a session by a Web searcher is an important area of research for designing more helpful searching systems and targeting content to particular users. Methods explored by other researchers include both qualitative (i.e., the use of human judges to manually analyze query patterns on usually small samples) and nondeterministic algorithms, typically using large amounts of training data to predict query modification during sessions. In this article, we explore three alternative methods for detection of session boundaries. All three methods are computationally straightforward and therefore easily implemented for detection of session changes. We examine 2,465,145 interactions from 534,507 users of Dogpile.com on May 6, 2005. We compare session analysis using (a) Internet Protocol address and cookie; (b) Internet Protocol address, cookie, and a temporal limit on intrasession interactions; and (c) Internet Protocol address, cookie, and query reformulation patterns. Overall, our analysis shows that defining sessions by query reformulation along with Internet Protocol address and cookie provides the best measure, resulting in an 82% increase in the count of sessions. Regardless of the method used, the mean session length was fewer than three queries, and the mean session duration was less than 30 min. Searchers most often modified their query by changing query terms (nearly 23% of all query modifications) rather than adding or deleting terms. Implications are that for measuring searching traffic, unique sessions may be a better indicator than the common metric of unique visitors. This research also sheds light on the more complex aspects of Web searching involving query modifications and may lead to advances in searching tools.

Impact and interest:

89 citations in Scopus
Search Google Scholar™
45 citations in Web of Science®

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 47875
Item Type: Journal Article
Refereed: Yes
DOI: 10.1002/asi.20564
ISSN: 15322882
Divisions: Current > Research Centres > Office of Education Research
Current > QUT Faculties and Divisions > Faculty of Education
Copyright Owner: Wiley
Deposited On: 20 Dec 2011 08:56
Last Modified: 29 Feb 2012 13:35

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page