基于句子上下文的智能搜索

A. Chickinsky
{"title":"基于句子上下文的智能搜索","authors":"A. Chickinsky","doi":"10.1109/THS.2008.4534428","DOIUrl":null,"url":null,"abstract":"Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.","PeriodicalId":366416,"journal":{"name":"2008 IEEE Conference on Technologies for Homeland Security","volume":"47-48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Intelligent Searching using Sentence Context\",\"authors\":\"A. Chickinsky\",\"doi\":\"10.1109/THS.2008.4534428\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.\",\"PeriodicalId\":366416,\"journal\":{\"name\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"volume\":\"47-48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/THS.2008.4534428\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Technologies for Homeland Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/THS.2008.4534428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

核聚变中心可以从企业、联邦政府、州政府和地方政府那里获得100字节的信息。这些资料包括由电脑产生的资料库,以及由执法人员所作的面谈笔录。搜索笔记和抄本既困难又耗时,因为人类不使用一组短语的评论。由于过去的经历、出生地和代际差异,短语会有所不同。搜索引擎试图通过执行上下文搜索来弥补这些差异。上下文搜索用其他预定的单词替换搜索请求中的特定单词。人们可以通过基于语法和英语句子结构的智能搜索来减少误报。智能句子搜索仅使用预定义字典中的单词将每个文档转换为一组简单句子。这些简单的句子抓住了文件的精髓。转换方法使用同义词、习惯表达、语法、语音模式和单词位置来创建可搜索的索引。由于有限的字典和大多数歧义的消除,搜索可以避免误报。本文介绍了一份具有代表性的执法报告的句子语境方法、实例和测试结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Intelligent Searching using Sentence Context
Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信