Information Seeking on the Web by Women in IT Professions
Chun Wei Choo
Faculty of Information Studies
University of Toronto
140, St. George Street
Toronto, Ontario
Canada M5S 3G6
Christine Marton
School of Library, Archival and Information Studies
1956 Main Mall
The University of British Columbia
Vancouver, B.C.
Canada V6T 1Z1
Information Seeking on the Web by Women in IT Professions
Abstract
The paper develops a behavioral model of Web information seeking that identifies four complementary modes of information seeking: undirected viewing, conditioned viewing, informal search, and formal search. In each mode of viewing or searching, users would adopt distinctive patterns of browser moves: starting, chaining, browsing, differentiating, monitoring, and extracting. We apply the model empirically to analyze the Web information seeking behavior of 24 women in IT professions over a two-week period. Our results show that participants engaged in all four modes of information seeking on the Web, and that each mode may be characterized by certain browser actions. Overall, the study suggests that a behavioral approach that links information seeking modes (goals and reasons for browsing and searching) to moves (actions used to find and view information) may be helpful in understanding Web-based information seeking.
Keywords: Information seeking, WWW, women, Internet searching, Internet browsing.
Introduction
This paper presents findings of a research study on how women in IT professions seek information on the Web as part of their daily work. We begin with the premise that in order to analyze information seeking, we need to examine the context of the overall information task. That is, we need to understand the information needs, goals, and intended information use that motivate the information seeking in the first place. Furthermore, since we are interested in information seeking on the Web, we take advantage of the WebÕs ability to generate data about itself and how it is being used. Specifically, we analyze log files that capture data about how users employ their Web browsers to look for information. Our research is driven by the belief that a more complete understanding of information seeking behavior may be gained by linking the general contexts of information seeking tasks with the detailed sequences of Web-based actions and events that constitute the information seeking itself.
Research on Information Seeking on the Web
Catledge and Pitkow (1995) modified the source code for a version of Xmosaic to generate a client-side log file that showed user navigation strategies and interface selections. They determined session boundaries by measuring the time between each event for all events, and adopted the heuristic that a lapse of 25.5 minutes or greater indicated the end of a Òsession.Ó This heuristic is currently the most-commonly used for delimiting sessions. The study found that Web pages users bookmarked did not match the most-popular sites visited as a whole from the group. Only 2% of Web pages were either saved locally or printed. Catledge and Pitkow also hypothesized that users in their study categorized as ÒbrowsersÓ spent less time on a Web page than Òsearchers.Ó
Tauscher and Greenberg (1997a, 1997b) focused on the history mechanisms that Web browsers use to manage recently-requested Web pages, and also modified the XMosaic browser to collect Web browsing data for over six weeks from 23 participants. They found that 58% of the pages visited during a Web browsing session were re-visits. Users not only accessed a few pages frequently (60% once, and 19% twice), they also browsed in very small clusters of pages. Thus, Web browsing is a Òrecurring systemÉ where users predominantly repeat activities they had invoked before, while still selecting new actions from the many that are possible.Ó (1997a, p. 400) People explained that they revisited Web pages because Òthe information contained by them changes; they wish to explore the page further; the page has a special purpose (e.g. search engine, home page); they are authoring a page; or the page is on a path to another revisited page.Ó (1997a, p. 400).
In a well known study, Huberman, Pirolli, Pitkow and Lukose (1998) analyzed AOL (America Online) usage data collected over a five-day period, and found a mathematical Òlaw of surfing, É that determines the probability distribution of the depthÑthat is, the number of pages a user visits within a Web site.Ó (p. 95) They started with a probabilistic model of the number of links a user might follow on a Web site. Next they calculated a value for the current page and related this to the value of the next page to be accessed. When the cost of moving to the next Web page is more than its expected value, the user stops Web surfing. Huberman et al also examined Web server logs of the Xerox external Web site in order to obtain a bounded set of Web page requests. Cookies were used to track the paths of individual users as they surfed the site. By applying the surfing model with a spreading activation algorithm, they were able to predict the number of requests for each Web page in a Web site.
Hšlscher and Strube (2000) investigated the effects of domain-knowledge and web-knowledge on searching behavior and outcomes. Through two experiments, they found that participants who could rely on both types of expertise were overall most successful in their searches. Participants strong in domain knowledge but lacking in Web searching knowledge relied heavily on terminology and avoided the use of search operators and modifiers. Participants with lower levels of knowledge were less flexible in their search strategies and tended to return to earlier stages of their searches rather than trying new approaches.
Cockburn and McKenzie (2001) analyzed four months of client-side log data that recorded user actions with Netscape Navigator, including page title, URL and time of each page visit, how often they visited each page, how long they spent at each page, the growth and content of bookmark collections, and other aspects of user interaction. The results showed that (1) re-visiting web pages is a much more prevalent activity than previously reported (approximately 81% of pages have been previously visited by the user), (2) most pages are visited for a surprisingly short period of time, (3) users maintain large (and possibly overwhelming) bookmark collections, and (4) there is a marked lack of commonality in the pages visited by different users.
Cothey (2002) analyzed the real world Web information searching behavior of 206 college students over a 10-month period. The study used a longitudinal transaction log analysis of the URLs accessed during 5,431 user days of Web information searching to detect changes in information searching behavior associated with increased experience of using the Web. Contrary to expectations, the study found that as users became more experienced, they adopted a more passive or browsing approach to Web information searching, and grew more eclectic in their selection of Web hosts.
Kim and Allen (2002) investigated the effects of task, cognitive style, and search engine on searching activities and outcomes. Two experiments generated nearly 160 completed searches by 80 individuals. The study found that of the three variables, only task showed a significant main effect. (Participants were assigned to a Òknown-itemÓ search task and a ÒsubjectÓ search task.) Moreover, task interacted significantly with both search engine and cognitive abilities to influence how users searched the Web. Thus, although the flexibility of the Web search engines allows different users to complete different search tasks successfully, search activities and outcomes were strongly influenced by task effects, and the techniques used and the efficiency of searches depend on how well the individual searcher fits with the specific task.
Towards a New Behavioral Model of Information Seeking on the Web
Modes of Browsing and Searching
Marchionini (1995) reviewed the research on browsing and observed that Òthere seems to be agreement on three general types of browsing that may be differentiated by the object of search (the information needed) and by the systematicity of tactics used.Ó (p. 106) Directed browsing occurs when browsing is systematic, focused, and directed by a specific object or target. Semi-directed browsing occurs when browsing is predictive or generally purposeful: the target is less definite and browsing is less systematic. Finally, undirected browsing occurs when there is no real goal and very little focus. In a similar vein, Wilson (1997) identifies four categories of information seeking and acquisition. Passive attention, where there may be no information-seeking intended, but information acquisition may take place nevertheless. Passive search, when one type of search results in the acquisition of information that happens to be relevant to the individual. Active search, where an individual actively seeks out information. Ongoing search, where active searching has already established the basic framework of ideas, beliefs, but occasional continuing search is carried out to update or expand one's framework.
In organization science, a comparable categorization of modes of organizational scanning or ÒbrowsingÓ has been developed, based on both empirical and theoretical research. The initial field work of Aguilar (1967) and the subsequent theoretical expansion by Weick and Daft (1983) suggest that organizations scan in four distinct modes: undirected viewing, conditioned viewing, informal search, formal search (Figure 1). In undirected viewing, the individual is exposed to information with no specific informational need in mind. The overall purpose is to scan broadly in order to detect signals of change early. Many and varied sources of information are used, and large amounts of information are browsed. In conditioned viewing, the individual directs viewing to information about selected topics or to certain types of information. The individual has isolated a number of areas of potential concern from undirected viewing, and is now sensitized to assess the significance of developments in those areas. During informal search, the individual actively looks for information to deepen the knowledge and understanding of a specific issue. It is informal in that it involves a relatively limited and unstructured effort. During formal search, the individual makes a deliberate or planned effort to obtain specific information or types of information about a particular issue. Search is formal because it is structured according to some procedure or methodology. The overall purpose is to systematically retrieve information in order to provide a basis for developing a decision or course of action.
Scanning Modes
Information Need
Information
SeekingInformation Use
Undirected
Viewing
General areas of interest; specific need to be revealed
¥ Scan broadly a diversity of sources, taking advantage of whatÕs easily accessible
Serendipitous discovery
ÒBrowsingÓ
Conditioned
Viewing
Able to recognize topics of interest
¥ Browse in pre-selected sources on pre-specified topics of interest
Increase knowledge about topics of interest
ÒLearningÓ
Informal
Search
Able to formulate simple queries
¥ Search is focused on area or topic, but a good-enough search is satisfactory
Increase knowledge on area within narrow boundaries
ÒSatisficingÓ
Formal
Search
Able to specify targets in detail
¥ Systematic gathering of information about an entity, following some method or procedure
Formal use of information for decision-, policy-making
ÒRetrievingÓ
Figure 1. Modes of Scanning
EllisÕ Model of Information Seeking Behaviors
Ellis (1989), Ellis et al (1993), and Ellis and Haugan (1997) developed a general model of information seeking behaviors based on the information seeking patterns of social scientists, research physicists and chemists, and engineers and research scientists in an industrial firm. One version of the model describes six categories of information seeking activities as generic: starting, chaining, browsing, differentiating, monitoring, and extracting.
Starting comprises those activities that form the initial search for information Ñ identifying sources of interest that could serve as starting points of the search. While viewing the initial sources, they are likely to point to, suggest, or recommend additional sources or references. Following up on these new leads from an initial source is the activity of Chaining. Having located sources and documents, Browsing is semi-directed viewing in areas of potential search. The individual often simplifies browsing by looking through tables of contents, lists of titles and names, abstracts and summaries. During Differentiating, the individual selects and prioritizes from among the sources scanned by noticing differences between the nature and quality of the information offered. Monitoring is keeping abreast of developments in an area by regularly following a small number of what are perceived to be core sources. Extracting is systematically working through a particular source or sources in order to identify material of interest. Retrospective searching is labor intensive, and is more likely when there is a need for comprehensive or historical information on a topic.
Ellis (1989) thought that hypertext-based systems would have the capabilities to implement functions indicated by his behavioral model. Indeed, most of the information seeking behavior categories in EllisÕ model are already being supported by common Web browser software. Thus, an individual could begin surfing the Web from one of a few favourite pages or sites (starting); follow links to related information resources Ñ in both backward and forward linking directions (chaining); scan the Web pages of the sources selected (browsing); bookmark useful sources for future reference and visits (differentiating); subscribe to e-mail based services that alert the user of new information or developments (monitoring); and search a particular source or site for all information on that site on a particular topic (extracting). Plausible extensions of the activities to Web information seeking (labeled Web Moves), are compared with the original formulations (Literature Search Moves) in Figure 2 below.
Starting
Chaining
Browsing
Differentiating
Monitoring
Extracting
Literature Search Moves
(Ellis et al)Identifying sources of interest
Following up references found in given material
Scanning tables of contents or headings
Assessing or restricting information according to their usefulness
Receiving regular reports or summaries from selected sources
Systematically working a source to identify material of interest
Anticipated Web Moves
Identifying websites containing or pointing to information of interest
Following links on starting pages to other content-related sites
Scanning top-level pages: lists, headings,
site mapsSelecting useful pages by bookmarking, printing, copying, etc. Choosing differentiated, pre-selected site
Receiving site updates via alerts or subscriptions
Revisiting ÔfavoriteÕ sitesSystematically searches a local site to extract information of interest at that site
Figure 2. Information Seeking Behaviors and Web Moves
Towards a Behavioral Model of Information Seeking on the Web
Aguilar's modes of scanning and Ellis's seeking behaviors may be combined in a behavioral model of information seeking on the Web. The model hypothesizes that there are four main modes of information seeking on the Web: undirected viewing, conditioned viewing, informal search, and formal search. For each mode, certain browser actions are likely to occur more frequently, based on our extension of the work of Ellis.
Starting
Chaining
Browsing
Differentiating
Monitoring
Extracting
Undirected
ViewingIdentifying selecting starting pages, sites
Following links on initial pages
Conditioned
Viewing
Browsing entry pages, headings,
site mapsBookmarking, printing, copying
Going directly to known siteRevisiting ÔfavoriteÕ or bookmarked sites; alerts, subscriptions
Informal
Search
Bookmarking, printing, copying
Going directly to known siteRevisiting ÔfavoriteÕ or bookmarked sites; alerts, subscriptions
Using (local) search engines to extract information
Formal Search
Revisiting ÔfavoriteÕ or bookmarked sites; alerts, subscriptions
Using search engines to extract information
Figure 3. Behavioral Modes and Moves of Information Seeking on the Web
In the undirected viewing mode on the Web, we may expect to see many instances of starting and chaining. Starting occurs when people begin their web use at general information or omnibus sites to start their viewing (such as news, newspaper, or magazine sites). Chaining occurs when people notice items of interest, and follow links to more information on those items. In the conditioned viewing mode on the Web, we expect browsing, differentiating, and monitoring to be common. Differentiating occurs as viewers select Web sites or pages that they expect to provide relevant information. Sites may be differentiated based on prior personal visits, or recommendations by others. Differentiated sites may be bookmarked. At these sites, users browse by looking through tables of contents, site maps, or list of items and categories. Users may also monitor differentiated sites by returning regularly, or by signing up for alerts. During informal search on the Web, we expect differentiating and extracting to be typical. Extracting is relatively ÒinformalÓ in that it makes use of keywords and basic search features, sometimes limiting the search to content at a particular selected site. Users may also monitor sites for new content. During formal search on the Web, we expect user to employ search engines that cover the Web comprehensively, and that provide a powerful set of search features to focus retrieval. In order not to miss any important information, there is a willingness to spend more time in the search. Formal search may also be supported by monitoring activity (e.g. e-mail alerts and subscriptions).
Research Methods
Participants
Twenty four participants from twenty organizations took part in the study. The primary vehicle for publicizing the study was the Internet, through announcements on IT mailing lists (particularly listservs of women in IT associations), and web sites. Participants were also recruited using print flyers that were distributed at meetings of a local womenÕs Internet group (Toronto WebGrrls, renamed Digital Eve), and a media design lecture held at the University of Toronto. Notices were posted on bulletin boards in various buildings in downtown Toronto that rented space to IT companies. The study focuses on women in IT professions for two reasons. First, women form an increasingly important and numerous group in the IT skilled labor force as well as in the larger community of proficient Web users. Second, most participants of Web studies of this kind are predominantly male. (We have conducted an earlier, related study that analyzed Web information seeking by 34 male IT professionals [Choo, Detlor and Turnbull, 2000a])
ParticipantsÕ organizations included telecommunications and IT firms, small and medium web design and development businesses, community organizations, government agencies, and libraries. All users in this study employed the Web as an integral part of their work activities. In most cases, participants were connected to the Internet through LAN, DSL or cable modems, and used relatively high-powered machines. Many of the participants would be generally regarded as technically proficient Web users.
Data Collection
Two methods of data collection were employed: a WebTracker application that recorded Web browser actions; and personal interviews with participants. History files and bookmark files were also collected.
Web Tracking
To observe participantsÕ actual information-seeking behavior on the Web in an unobtrusive manner, custom-developed WebTracker software was installed on their computers to record two weeks of continuous Web activity. A ten-day recording period was deemed sufficient since IT workers make heavy use of the Web on a daily basis. WebTracker is a Windows application that logs a Web userÕs browser actions during a session of Web use (Fig. 4)
![]()
Figure 4. WebTracker Application Screen with Privacy/Security Notice
WebTracker runs in the background and gathers menu, button, and keyboard choices from the browser, including ÒOpen URL or File,Ó ÒReload,Ó ÒBack,Ó ÒForward,Ó ÒAdd to Bookmarks,Ó ÒGo to Bookmark,Ó ÒPrint,Ó and ÒCopyÓ (Fig. 5). Because URL calls and menu selections were date-time stamped, we were able to reconstruct sequences of browser actions.
STARTUP
8/25/00 9:36
LINK_TO
8/25/00 9:51
http://intra.mbs.gov.on.ca/,"MBS INTRANET","master"
LINK_TO
8/25/00 9:52
http://jobs.gc.ca/home_e.shtml,"PSC - Recruitment Menu","master"
key copy
8/25/00 9:52
http://jobs.gc.ca/home_e.shtml,"PSC - Recruitment Menu","master"
LINK_TO
8/25/00 10:36
http://intra.gov.on.ca/,"Welcome to the Ontario Public Service Intranet","August"
LINK_TO
8/25/00 10:37
http://intra.cpb.gov.on.ca/,"Corporate Policy Branch Intranet","August"
LINK_TO
8/25/00 11:11
http://www.interland.com/hosting/dedicated/windows/default.asp,"Windows 2000 Plans and Pricing - Interland Web Solutions","August"
button print
8/25/00 11:36
http://www.interland.com/hosting/dedicated/windows/default.asp,"Windows 2000 Plans and Pricing - Interland Web Solutions","August"
LINK_TO
8/25/00 11:37
http://intra.cpb.gov.on.ca/,"Corporate Policy Branch Intranet","August"
key forward
8/25/00 12:06
http://intra.cpb.gov.on.ca/html/sites.html,"Corporate Policy Branch Intranet","August"
LINK_TO
8/25/00 12:06
http://www.bgbrownc.on.ca/,"BorderManager Information Alert","August"
LINK_TO
8/25/00 12:06
http://www.gbrownc.on.ca/sitemap.html,"George Brown College: Site Map","August"
LINK_TO
8/25/00 12:07
http://www.gbrownc.on.ca/News/faq.html#reg,"Guide to Students' most frequently asked questions","August"
button back
8/25/00 12:08
http://www.gbrownc.on.ca/News/faq.html#reg,"Guide to Students' most frequently asked questions","August"
LINK_TO
8/25/00 12:08
http://www.gbrownc.on.ca/News/faq.html,"Guide to Students' most frequently asked questions","August"
menu forward
8/25/00 14:39
http://www.gbrownc.on.ca/News/faq.html,"Guide to Students' most frequently asked questions","August"
LINK_TO
8/25/00 14:39
http://intra.cpb.gov.on.ca/,"Corporate Policy Branch Intranet","August"
button reload
8/25/00 14:39
http://intra.cpb.gov.on.ca/html/sites.html,"Corporate Policy Branch Intranet","August"
LINK_TO
8/25/00 14:40
http://intra.managersmap.gov.on.ca/scripts/index.asp,"CSB Manager's Map Log In","August"
Post-WebTracker Interview
After the tracking logs had been collected, each participant was interviewed. During interviews, the WebTracker log files and bookmark file were reviewed with the participant. The researcher presented the participant with printouts of log files and discussed the context of the information seeking episode, sites visited, and Web browser actions. The interview included these questions:
Can you briefly describe the context behind your use of Web-based information sources?
By Web-based information sources, I am referring to websites and communication sources available on the Web (Web-based chatrooms and bulletin boards), as well as communication sources accessed through a Web browser, such as newsgroups, and listservs (mailing lists).
Why did you visit the Web-based information sources shown in the tracking log?
Did you find the information you were looking for?
Did you run into difficulties looking for this information?
How did you use the information once it was found?
Was this an episode of browsing or searching?
If browsing: Which of the following best fits your description of this episode?
a) ÒI was broadly browsing a variety of websites/listservs/chatrooms/newsgroups/boards, taking advantage of what was easily accessible.Ó
b) ÒI was browsing the same website/listserv/chatroom/newsgroup/boards I usually visit for this type of information.Ó
If searching: Which of the following best fits your description of this episode?
a) ÒI was searching for information in an unstructured way that was Ôgood enoughÕ to help me solve a problem or move me along.Ó
b) ÒI was searching for information in a structured way to help me make a decision or set a course of action.Ó
Additionally, participants were asked to recount incidents where they found information on the Web that was valuable and led them to make a significant decision or change the way they did things. While participants were often unable to recall specific incidents, their elaborations of the Webtracker log data revealed many significant episodes of information seeking that were consequential. For the most part, they found that information on the Web verified or added to information they already possessed from other sources. Participants frequently commented that the Web was the only information source that could provide them with current information on software and IT developments.
Data Analysis
Data analysis proceeded in two stages. First, significant episodes of Web-based information seeking were identified from the personal interview transcripts as well as the WebTracker logs. Each episode was analyzed according to its information need, amount of effort, number of web sources consulted, and information use. Using the framework developed in Section 3.1, an episode would be categorized, where applicable, as one of the four modes of scanning (undirected/conditioned viewing; informal/formal searching). WebTracker logs were also examined to identify additional viewing or searching episodes. Two criteria were used to select episodes: the episode consumed a substantial amount of time and effort; or the episode was a frequently repeated activity.
For each episode identified in the first stage, corresponding sections of the WebTracker log were examined to determine the browser-based actions that best characterized each episode. Using the framework developed in Section 3.2, browser moves were categorized, where applicable, as starting, chaining, browsing, differentiating, monitoring, or extracting.
Results and Discussion
A total of eighty episodes of ÔsignificantÕ information seeking were identified and categorized using the framework developed in Section 3. The majority of the episodes were classified as conditioned viewing (31) and informal search (21). A smaller number of episodes were undirected viewing (14) and formal search (14).
Of the 80 episodes, 14 were categorized as undirected viewing. The most common example of undirected viewing consisted of visits to general news websites such as those of my.yahoo.com, my.netscape.com, and major newspapers (Table 1). Some participants have customized these pages as their start-up pages. General news sites acted as gateways to information covering many different subject areas, and provided an efficient way of surveying current developments without a specific information need in mind.
In the undirected viewing episodes, WebTracker data indicated that the most frequently occurring moves were starting and chaining. Thus, participants began at favorite starting pages (news or portal sites) and followed interesting links.
Table 1. Undirected Viewing Examples
Information Seeking in the
Undirected Viewing ModeBrowser Moves:
Starting, ChainingKept up to date by scanning ca.my.yahoo.com page daily.
Took one afternoon to set up my.yahoo page as start up page. Clicked on 1 link per day.
Kept up to date with news in torontostar.com about Toronto.
Scanned and followed links on site to stories of interest.
Kept up to date with daily news in my.netscape.com.
Had long used personalized my.netscape.com as gateway.
Kept up to date with news in my.yahoo:
Ògo there periodically throughout the day É just look at news and things.ÓBrowsed my.yahoo.com.
Followed links in dailynews.yahoo.com.Kept up to date with news sites canoe.ca, Globe & Mail, Toronto Star. Read headlines.
Thirty one episodes were categorized as conditioned viewing, forming the largest group. The most common examples were regular return visits to bookmarked sites, and browsing from a selected page that contained links to sites of interest (Table 2). Thus, participants regularly visited the websites of software and hardware vendors, competitors, web development resources, and jobsites. Several participants subscribed to newsletters and discussion groups, and returned periodically to look at these resources.
In the conditioned viewing episodes, the most frequently occurring moves were differentiating, browsing, and monitoring. Thus, participants selected a bookmarked page/site, or entered the URL of a site they had committed to memory (differentiating). These sites/pages were then examined to locate new content of interest (browsing). The most important characteristic of conditioned viewing was that participants regularly or periodically returned to their selected or differentiated sites/pages to check for new information (monitoring).
Table 2. Conditioned Viewing Examples
Information Seeking in the
Conditioned Viewing ModeBrowser Moves:
Differentiating, Browsing, MonitoringBrowsed and looked up postings within discussion group on Framemaker in egroups.com.
Subscribed to discussion group.
Looked for website mentioned in discussion group posting.Used alerts.web.ca to view alerts on womenÕs issues.
Browsed in alerts.web.ca
Followed links.Looked at breaking news in PC World. Looked for items of interest (e.g. news about Oracle).
E-mail subscription. Browsed links in pcworld.com/news/ articles. Followed links to 3 articles at different times, days.
Read up on topics of interest (eg. WML, wireless, web design).
Twenty one episodes were categorized as informal search. The most common examples of informal search were when participants made use of specific query terms such as names of companies, products or technologies to perform simple searches on easily accessible search engines (Table 3). The use of the Google search engine was common. There were several examples of selecting search engines that were specific to the site site (e.g. a search engine maintained by an organization that indexed only its own web pages). Thus, participants used the search engine on the website of ACM, Compaq, and IBM to do local searches.
In the informal search episodes, the most frequently observed moves were differentiating and localized extracting. Thus, participants went directly to selected sites where they expected to find the needed information (differentiating). Searching at these sites would make use of local search engines that only retrieved information from those sites (localized extracting). Some participants returned to specific sites to repeat their informal searches (monitoring).
Table 3. Informal Search Examples
Information Seeking in the
Informal Search ModeBrowser Moves:
Differentiating, Extracting/Local SearchSearched for new director of research funding agency.
Simple Google name search.
Also searched for books authored.Searched for info on GIS software tools, and what others are using.
Searched Google using simple text string.
Searched for ISO 9000 and ACM Professional Chapter Toolkit.
ACM site search using local search engine.
Looked for groupware product info, led to vendor site.
Simple Google keyword search.
Linked to product pages.Preparing article for newsletter.
Looked for websites.Looked in Association site.
Found page on ÔMedspeak.Õ
Fourteen episodes were categorized as formal search. Here, participants were intending to use the information formally (e.g. to write reports, to make decisions), and were prepared to invest effort in a more thorough search. A handful of search episodes had an elapsed time of more than two hours, and a couple were spread over two days (Table 4).
In the formal search episodes, the most frequently occurring move was extracting. Here, extracting was more extended and extensive than the localized extracting of informal searching. Thus, participants systematically worked through a number of search engines or meta search engines so as to find (all) important information about a topic or item. Formal searches were often accompanied by additional information-extracting activities such as printing and saving pages, downloading PDF documents and PowerPoint slides.
Table 4. Formal Search Examples
Information Seeking in the
Formal Search ModeBrowser Moves:
ExtractingSearched for info on Òhow to hold meetingÓ to organize and facilitate a meeting.
Extended, multiple searches (food, themes, contests). Printed pages. Elapsed time: 2 hours.
Searched for telephone numbers in province needed to test new feature on the provincial telephone switch.
Searched Canada 411 for province phone numbers of several businesses.
Elapsed time: 75 minutes.Searched for documents on Cable versus DSL evaluation. Found market research reports from IDC, Forrester.
Searched internal host, back/ forward moves. Printed several PDF documents.
Elapsed time: 3+ hours.Collected information on enterprise security software for project, to write report, make recommendation.
Searched internal search engine as well as that on vendor site. To Òquote chapter and verse.Ó
Over 2 days.Searched for internet filtering/ blocking software to be installed on library public terminals.
Figure 6 shows the distribution of information seeking episodes found in this study. Episodes were categorized according to their information needs, seeking, and use. It was possible to categorize all episodes into one of the four modes of viewing and searching. Moreover, each mode could be characterized by frequently observed browser actions, as predicted by the model developed in Section 3 (note however comments on monitoring and extracting later in this section).
![]()
Figure 6. Information Seeking Modes and Browser Moves (2002)
In this study, the largest category of information seeking episodes was conditioned viewing (thirty one instances out of a total of eighty). Differentiating and monitoring moves were frequently observed in the conditioned viewing episodes. Thus, participants were able to recognize topics of special interest (differentiating), and they returned periodically to selected sites for new information (monitoring). Participants selected a bookmark or entered a URL to revisit a site; or they would subscribe to mailing lists, or email newsletters about new content on interested sites. Contrary to the modelÕs predictions, there were no instances of monitoring during formal searching episodes.
The phenomenon of returning to familiar websites has been noted in a number of recent field studies. In their analysis of client-side log data, Cockburn and McKenzie (2000) observed that users were re-visiting web pages more frequently: ÒMore surprisingly, the revisitation rate has increased from approximately 60 to 81%; on average, users have previously seen four out of five pages that they visit.Ó (p. 921) Cothey (2002) noted a trend among more experienced Web users: ÒÉ as Web users become more experienced they rely less on formal querying (typically using search engine) to obtain their Web-based information. However, as users become more experienced, so their Web usage was more sporadic, which suggests possible greater selectivity. It appears that each user may inhabit an individual niche of Web hosts that becomes more distinctive as the user becomes more experienced.Ó (p. 77) Cothey suggests that users adopt a more passive or browsing approach to Web information searching as they gained experience.
Extracting as a Web activity implies searching through one or more sites in order to locate information of interest at those sites. In this study, most episodes of extracting employed basic searching strategies. For the most part, search formulations were relatively simple, with advanced features such as Boolean operators, and word truncation or proximity operators rarely utilized. This was the case even when participants appeared to be working in the formal search mode. There were no instances of participants accessing search-engine help instruction pages to improve their searches. The phenomenon of Òsimple searchingÓ has been observed in a number of recent studies. For example, Jansen, Spink and Saracevic (2000) analyzed the transaction log of over 50,000 queries by 18,000 users of the Excite search engine, and found that most queries were short (a query contained 2.21 terms on average, with two in three containing one or two terms); and that Boolean and more advanced operators were seldom used (one in 18 users used any Boolean capabilities, and only one in 190 queries used nested logic).
Summary
The research presented here suggests that people who use the Web as an information resource to support their daily work activities engage in a range of complementary modes of information seeking, varying from undirected viewing that does not pursue a specific information need, to formal searching that retrieves focused information for action or decision making. Each mode of information seeking on the Web is distinguished by the nature of information needs, information seeking tactics, and the purpose of information use. The information seeking tactics characterizing each mode are revealed by recurrent sequences of browser actions initiated by the information seeker. At an operational level, we may postulate ways of supporting and enhancing the information seeking of Web users as they engage in these four modes of information seeking (Table 5, adapted from Choo, Detlor, Turnbull, 2000b).
Table 5. Enhancing Web Use
Web Use Mode
Enhancing Web Use
Undirected viewing:
starting and chaining¥ Introduce systems that can search or recommend new starting-sites or "similar-to" sites
¥ Encourage people in a group to share bookmarks, Web pages, URLs
¥ Design corporate portals to support undirected, broad-based viewing
Conditioned viewing: browsing, differentiating, monitoring
¥ Train users to evaluate and escalate priority or importance of information
¥ Make it easy for users to share Web-based information via email or in online forums
¥ Introduce users to services that allow them to subscribe to and be notified about new content on Web pages
Informal search: differentiating, monitoring,
extracting¥ Pre-select high quality sources and search engines for quick, informal searches
¥ Prepackage good search strategies developed by subject matter experts; allow users to view these strategies and learn from them
¥ Educate users on how to evaluate information provenance and quality
Formal search:
extracting¥ Educate users about full range of information sources that should be considered for comprehensive search: print, online, human sources
¥ Educate users about when to use: commercial online database services; the library or information resource center; information brokers/professionals
¥ Train users on advanced search techniques: narrowing or broadening a search; balancing precision and recall; backward and forward chaining.
Overall, the study suggests that a behavioral framework that links modes (strategies and reasons for viewing and searching) and moves (actions used to find and use information) may be helpful in analyzing Web-based information seeking. The study also suggests that multiple, complementary methods of collecting qualitative and quantitative data may be integrated within a single study to compose a more nuanced portrayal of how individuals seek and use Web-based information in their natural work settings.
(This research is supported by a grant from the Social Sciences and Humanities Research Council of Canada.)
References
Aguilar, F. J. (1967) Scanning the Business Environment, Macmillan Co., New York, NY.
Catledge, L. D. and J. E. Pitkow. (1995) ÒCharacterizing Browsing Strategies in the World Wide Web.Ó Available at http://www.igd.fhg.de/www/www95/papers/80/userpatterns/
UserPatterns.Paper4.formatted.html
Choo, C. W., Detlor, B. and Turnbull, D. (2000a) "Working the Web: An Empirical Model of Web Use." Presented at 33rd Hawaii International Conference on Systems Science, Muai, Hawaii, 2000.
Choo, C. W., Detlor, B. and Turnbull, D. (2000b) Web Work: Information Seeking and Knowledge Work on the World Wide Web, Kluwer Academic Publishers, Dordrecht, Netherlands.
Cockburn, A. and McKenzie, B. (2001) What Do Web Users Do? An Empirical Analysis of Web Use. International Journal of Human-Computer Studies, Vol 54 No 6, 903-922.
Cothey, V. (2002) A Longitudinal Study of World Wide Web Users' Information-searching Behavior. Journal of American Society for Information Science & Technology, Vol 53 No 2, 67-78.
Daft, R. L. and Weick, K. E. (1984) Toward a Model of Organizations as Interpretation Systems. Academy of Management Review, Vol 9 No 2, 284-295.
Ellis, D. and Haugan, M. (1997) Modelling the Information Seeking Patterns of Engineers and Research Scientists in an Industrial Environment. Journal of Documentation, Vol 53 No 4, 384-403.
Ellis, D., Cox, D. and Hall, K. (1993) A Comparison of the Information Seeking Patterns of Researchers in the Physical and Social Sciences. Journal of Documentation, Vol 49 No 4, 356-369.
Ellis, D. (1989) A Behavioural Model for Information Retrieval System Design. Journal of Information Science, Vol 15 No 4/5, 237-247.
Hšlscher, C. and Strube, G. (2000) Web Search Behavior of Internet Experts and Newbies. Computer Networks, Vol 33 No 1-6, 337-346.
Huberman, B. A., Pirolli, P. L., Pitkow, J. E. and Lukose, R. M. (1998) Strong Regularities in World Wide Web Surfing. Science, Vol 280 No 5360.
Jansen, B. J., Spink, A. and Saracevic, T. (2000) Real Life, Real Users, and Real Needs: A Study and Analysis of User Queries on the Web. Information Processing & Management, Vol 36 No 2, 207-228.
Kim, K.-S. and Allen, B. (2002) Cognitive and Task Influences on Web Searching Behavior. Journal of American Society for Information Science & Technology, Vol 53 No 2, 109-119.
Marchionini, G. M. (1995) Information Seeking in Electronic Environments, Cambridge University Press, Cambridge, UK.
Tauscher, L. and Greenberg, S. (1997) How People Revisit Web Pages: Empirical Findings and Implications for the Design of History Systems. International Journal of Human-Computer Studies, Vol 47 No 97-137.
Tauscher, L. and Greenberg, S. (1997) In CHI 97 Human Factors in Computing Systems (Ed, Pemberton, S.) CHI 97, Atlanta, Georgia, USA, pp. 399-406.
Weick, K. E. and Daft, R. L. (1983) In Organizational Effectiveness: A Comparison of Multiple Models (Eds, Cameron, K. S. and Whetten, D. A.) Academic Press, New York, NY, pp. 71-93.
Wilson, T. D. (1997) Information Behaviour: An Interdisciplinary Perspective. Information Processing & Management, Vol 33 No 4, 551-572.