先生:一个统计信息检索系统

C. D. Parsons
{"title":"先生:一个统计信息检索系统","authors":"C. D. Parsons","doi":"10.1145/800257.808921","DOIUrl":null,"url":null,"abstract":"This paper describes the techniques and results of an information retrieval system utilizing an IBM 7094 installation at Phillips Petroleum Company. The Statistical Information Retrieval (SIR) system employs a -and-ldquo;co-ordinate concept-and-rdquo; with a logic based on the statistical probability that a desired document will be abstracted with a relatively high percentage of the identical or synonym keywords used in posing an inquiry concerning the document subjects. This approach to retrieval allows the use of an unlimited vocabulary, thus eliminating the need for a dictionary or thesaurus. The SIR programming incorporates a unique computing technique, vector manipulations and a search strategy which permit the system to operate efficiently on a large-scale computer. A bibliography and a representative example of the SIR program input and output are contained in an Appendix.","PeriodicalId":167902,"journal":{"name":"Proceedings of the 1964 19th ACM national conference","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1964-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SIR: A statistical information retrieval system\",\"authors\":\"C. D. Parsons\",\"doi\":\"10.1145/800257.808921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes the techniques and results of an information retrieval system utilizing an IBM 7094 installation at Phillips Petroleum Company. The Statistical Information Retrieval (SIR) system employs a -and-ldquo;co-ordinate concept-and-rdquo; with a logic based on the statistical probability that a desired document will be abstracted with a relatively high percentage of the identical or synonym keywords used in posing an inquiry concerning the document subjects. This approach to retrieval allows the use of an unlimited vocabulary, thus eliminating the need for a dictionary or thesaurus. The SIR programming incorporates a unique computing technique, vector manipulations and a search strategy which permit the system to operate efficiently on a large-scale computer. A bibliography and a representative example of the SIR program input and output are contained in an Appendix.\",\"PeriodicalId\":167902,\"journal\":{\"name\":\"Proceedings of the 1964 19th ACM national conference\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1964-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1964 19th ACM national conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/800257.808921\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1964 19th ACM national conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/800257.808921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了菲利普斯石油公司利用IBM 7094安装的信息检索系统的技术和结果。统计资料检索(SIR)系统采用“和”的概念和“和”的概念。使用基于统计概率的逻辑,期望的文档将被抽象为在提出关于文档主题的查询时使用相对较高百分比的相同或同义词关键字。这种检索方法允许使用无限的词汇表,从而消除了对字典或同义词库的需要。SIR编程结合了一种独特的计算技术,矢量操作和搜索策略,允许系统在大型计算机上有效地运行。参考书目和SIR程序输入和输出的代表性示例包含在附录中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
SIR: A statistical information retrieval system
This paper describes the techniques and results of an information retrieval system utilizing an IBM 7094 installation at Phillips Petroleum Company. The Statistical Information Retrieval (SIR) system employs a -and-ldquo;co-ordinate concept-and-rdquo; with a logic based on the statistical probability that a desired document will be abstracted with a relatively high percentage of the identical or synonym keywords used in posing an inquiry concerning the document subjects. This approach to retrieval allows the use of an unlimited vocabulary, thus eliminating the need for a dictionary or thesaurus. The SIR programming incorporates a unique computing technique, vector manipulations and a search strategy which permit the system to operate efficiently on a large-scale computer. A bibliography and a representative example of the SIR program input and output are contained in an Appendix.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信