{"title":"基于句子上下文的智能搜索","authors":"A. Chickinsky","doi":"10.1109/THS.2008.4534428","DOIUrl":null,"url":null,"abstract":"Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.","PeriodicalId":366416,"journal":{"name":"2008 IEEE Conference on Technologies for Homeland Security","volume":"47-48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Intelligent Searching using Sentence Context\",\"authors\":\"A. Chickinsky\",\"doi\":\"10.1109/THS.2008.4534428\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.\",\"PeriodicalId\":366416,\"journal\":{\"name\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"volume\":\"47-48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE Conference on Technologies for Homeland Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/THS.2008.4534428\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Conference on Technologies for Homeland Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/THS.2008.4534428","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.