{"title":"Interactive query formulation","authors":"Nina Wacholder","doi":"10.1002/aris.2011.1440450111","DOIUrl":null,"url":null,"abstract":"Query formulation (QF) is the stage of the interactive information access process in which an information seeker translates an information need into a query and submits the query to an information access system such as a search engine (e.g., Google) or a library database (e.g., Factiva or PubMed). The system performs some computation to match the query with the documents most likely to be relevant to the query and returns a ranked list of relevant documents to the information seeker. QF is a key component of the information access process (Bates, 1986; Belkin, Cool, Croft, & Callan, 1993; Blair & Maron, 1985; Hall, Negus, & Dancy, 1972; Kuhlthau, 1991; Marchionini, 1995; Oddy, 1977). It is also very hard (Beghtol, 1986; Belkin, Oddy, & Brooks, 1991, 1982; Swanson, 1960; Taylor, 1968; ter Hofstede, Proper, & van der Weide, 1996). The difficulty of QF is an impediment to development of optimally effective systems—if only users would formulate better queries, information retrieval systems could do a much better job of returning relevant documents. In addition to being difficult to perform, QF is difficult to study. There are at least three reasons. First, the QF process takes place inside the human mind. The user’s mental representation of the information need and the process of translating that need into a query are not visible to outside observers. Selfreporting, although useful for studying some aspects of information seeking, is of limited use for studying mental representations and cognitive processes that take place below the level of consciousness (Wilson, 1994). Therefore the QF process must be studied indirectly, by analysis of queries or by other externally observable behaviors. Second, the impact of user characteristics on QF is hard to isolate. It is very difficult to assess the effectiveness of the product of QF—the queries—outside of the context of particular users. For example, in her review of literature on the impact of domain expertise, Wildemuth (2004) concludes that it is possible to generalize about the impact of domain expertise on the vocabulary used in the queries but not about the impact of domain expertise on effectiveness of results. Designing CHAPTER 4","PeriodicalId":55509,"journal":{"name":"Annual Review of Information Science and Technology","volume":"45 1","pages":"157-196"},"PeriodicalIF":0.0000,"publicationDate":"2013-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/aris.2011.1440450111","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual Review of Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aris.2011.1440450111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31
Abstract
Query formulation (QF) is the stage of the interactive information access process in which an information seeker translates an information need into a query and submits the query to an information access system such as a search engine (e.g., Google) or a library database (e.g., Factiva or PubMed). The system performs some computation to match the query with the documents most likely to be relevant to the query and returns a ranked list of relevant documents to the information seeker. QF is a key component of the information access process (Bates, 1986; Belkin, Cool, Croft, & Callan, 1993; Blair & Maron, 1985; Hall, Negus, & Dancy, 1972; Kuhlthau, 1991; Marchionini, 1995; Oddy, 1977). It is also very hard (Beghtol, 1986; Belkin, Oddy, & Brooks, 1991, 1982; Swanson, 1960; Taylor, 1968; ter Hofstede, Proper, & van der Weide, 1996). The difficulty of QF is an impediment to development of optimally effective systems—if only users would formulate better queries, information retrieval systems could do a much better job of returning relevant documents. In addition to being difficult to perform, QF is difficult to study. There are at least three reasons. First, the QF process takes place inside the human mind. The user’s mental representation of the information need and the process of translating that need into a query are not visible to outside observers. Selfreporting, although useful for studying some aspects of information seeking, is of limited use for studying mental representations and cognitive processes that take place below the level of consciousness (Wilson, 1994). Therefore the QF process must be studied indirectly, by analysis of queries or by other externally observable behaviors. Second, the impact of user characteristics on QF is hard to isolate. It is very difficult to assess the effectiveness of the product of QF—the queries—outside of the context of particular users. For example, in her review of literature on the impact of domain expertise, Wildemuth (2004) concludes that it is possible to generalize about the impact of domain expertise on the vocabulary used in the queries but not about the impact of domain expertise on effectiveness of results. Designing CHAPTER 4