语义学和Goffman间接方法的实验

Ann Helmuth Shimko
{"title":"语义学和Goffman间接方法的实验","authors":"Ann Helmuth Shimko","doi":"10.1016/0020-0271(74)90045-X","DOIUrl":null,"url":null,"abstract":"<div><p>Goffman's indirect method partitions a group of documents into classes based on the number of attributes each pair of documents have in common. The classes produced depend on the threshold value chosen to generate the classes. The indirect method was used here to classify a set of one hundred and nine medical documents, using the index terms assigned by <em>Index Medicus</em>. The attributes used in the indirect method were defined in three ways. First, only exact matches between index terms were considered as attributes in common, resulting in a very fine partition. Next, MeSH tree structure was used to approximate the relationships between index terms, so that the attributes in common were words within a small semantic distance of each other. This produced a broader partition, but several of the documents related in the exact word match dropped below the threshold. To compensate for this, a third definition was used, to give extra weight to exact match relationships. This produced a reasonable classification with all classes nameable. Graphic representations of the three partitions illustrate the structure of the set of documents.</p></div>","PeriodicalId":100670,"journal":{"name":"Information Storage and Retrieval","volume":"10 11","pages":"Pages 387-392"},"PeriodicalIF":0.0000,"publicationDate":"1974-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0020-0271(74)90045-X","citationCount":"1","resultStr":"{\"title\":\"An experiment with semantics and Goffman's indirect method\",\"authors\":\"Ann Helmuth Shimko\",\"doi\":\"10.1016/0020-0271(74)90045-X\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Goffman's indirect method partitions a group of documents into classes based on the number of attributes each pair of documents have in common. The classes produced depend on the threshold value chosen to generate the classes. The indirect method was used here to classify a set of one hundred and nine medical documents, using the index terms assigned by <em>Index Medicus</em>. The attributes used in the indirect method were defined in three ways. First, only exact matches between index terms were considered as attributes in common, resulting in a very fine partition. Next, MeSH tree structure was used to approximate the relationships between index terms, so that the attributes in common were words within a small semantic distance of each other. This produced a broader partition, but several of the documents related in the exact word match dropped below the threshold. To compensate for this, a third definition was used, to give extra weight to exact match relationships. This produced a reasonable classification with all classes nameable. Graphic representations of the three partitions illustrate the structure of the set of documents.</p></div>\",\"PeriodicalId\":100670,\"journal\":{\"name\":\"Information Storage and Retrieval\",\"volume\":\"10 11\",\"pages\":\"Pages 387-392\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1974-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/0020-0271(74)90045-X\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Storage and Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/002002717490045X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Storage and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/002002717490045X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

Goffman的间接方法根据每对文档共有的属性数量将一组文档划分为若干类。生成的类取决于为生成类而选择的阈值。本文采用间接分类法对109份医学文献进行分类,并使用index Medicus指定的索引项。间接方法中使用的属性以三种方式定义。首先,只有索引项之间的精确匹配才被认为是公共属性,从而产生非常精细的分区。其次,利用MeSH树结构近似索引项之间的关系,使共有属性为彼此语义距离较小的词。这产生了一个更大的分区,但是与精确单词匹配相关的几个文档降到了阈值以下。为了弥补这一点,使用了第三个定义,赋予精确匹配关系额外的权重。这产生了一个合理的分类,所有的类都是可命名的。三个分区的图形表示说明了文档集的结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An experiment with semantics and Goffman's indirect method

Goffman's indirect method partitions a group of documents into classes based on the number of attributes each pair of documents have in common. The classes produced depend on the threshold value chosen to generate the classes. The indirect method was used here to classify a set of one hundred and nine medical documents, using the index terms assigned by Index Medicus. The attributes used in the indirect method were defined in three ways. First, only exact matches between index terms were considered as attributes in common, resulting in a very fine partition. Next, MeSH tree structure was used to approximate the relationships between index terms, so that the attributes in common were words within a small semantic distance of each other. This produced a broader partition, but several of the documents related in the exact word match dropped below the threshold. To compensate for this, a third definition was used, to give extra weight to exact match relationships. This produced a reasonable classification with all classes nameable. Graphic representations of the three partitions illustrate the structure of the set of documents.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信