{"title":"FIRST: Flexible Information Retrieval System for Text","authors":"R. Dattola","doi":"10.1002/asi.4630300103","DOIUrl":null,"url":null,"abstract":"An on‐line document retrieval system is described which combines a data base management system with automatic processing of natural language queries and abstracts. Data consists of an abstract, from which index terms are automatically extracted, along with bibliographic and descriptive information. The data base management system is used to store bibliographic and descriptive information, providing direct access to documents with specified bibliographic or descriptor items. Methods originally developed in the SMART project are used for abstract analysis: stemming algorithm, cosine function for query‐document comparisons, ranked output, and clustered document collection. Searches are entered and performed on‐line, with output consisting of document abstracts ranked in decreasing order of similarity with the query. Additional facilities include off‐line searches, SDI, and display of data base statistics. Future plans and improvements are also discussed.","PeriodicalId":50013,"journal":{"name":"Journal of the American Society for Information Science and Technology","volume":"35 1","pages":"9-14"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Society for Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/asi.4630300103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 29
Abstract
An on‐line document retrieval system is described which combines a data base management system with automatic processing of natural language queries and abstracts. Data consists of an abstract, from which index terms are automatically extracted, along with bibliographic and descriptive information. The data base management system is used to store bibliographic and descriptive information, providing direct access to documents with specified bibliographic or descriptor items. Methods originally developed in the SMART project are used for abstract analysis: stemming algorithm, cosine function for query‐document comparisons, ranked output, and clustered document collection. Searches are entered and performed on‐line, with output consisting of document abstracts ranked in decreasing order of similarity with the query. Additional facilities include off‐line searches, SDI, and display of data base statistics. Future plans and improvements are also discussed.