利用自然性评价医学本体系统的形式化方法

Y. J. An, Kuo-Chuan Huang, Soon Ae Chun, J. Geller
{"title":"利用自然性评价医学本体系统的形式化方法","authors":"Y. J. An, Kuo-Chuan Huang, Soon Ae Chun, J. Geller","doi":"10.4018/jcmam.2010072001","DOIUrl":null,"url":null,"abstract":"Ontologies, terminologies and vocabularies are popular repositories for collecting the terms used in a domain. It may be expected that in the future more such ontologies will be created for domain experts. However, there is increasing interest in making the language of experts understandable to casual users. For example, cancer patients often research their cases on the Web. The authors consider the problem of objectively evaluating the quality of ontologies (QoO). This article formalizes the notion of naturalness as a component of QoO and quantitatively measures naturalness for well-known ontologies (UMLS, WordNet, OpenCyc) based on their concepts, IS-A relationships and semantic relationships. To compute numeric values characterizing the naturalness of an ontology, this article defines appropriate metrics. As absolute numbers in such a pursuit are often meaningless, we concentrate on using relative naturalness metrics. That allows us to say that a certain ontology is relatively more natural than another one. DOI: 10.4018/jcmam.2010072001 IGI PUBLISHING This paper appears in the publication, International Journal of Computational Models and Algorithms in Medicine, Volume 1, Issue 1 edited by Aryya Gangopadhyay © 2010, IGI Global 701 E. Chocolate Avenue, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.igi-global.com ITJ 5526 2 International Journal of Computational Models and Algorithms in Medicine, 1(1), 1-18, January-March 2010 Copyright © 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. factory result, but the user might be satisfied with the search result for the more general term Penicillin. Finding broader or narrower concepts of a given concept is an important technique, which is recommended as a Web search strategy. According to Kalfoglou & Hu (2006), application ontologies are converging with the Web. Thus the knowledge provided by ontologies should be filtered dynamically by understanding the needs of Web users. There are several well-known ontologies, which many researchers have used and referenced, such as UMLS, WordNet and OpenCyc. Some researchers have presented modified or enriched ontological models by adding new types and trimming some detailed relationships from existing ontologies (Stone et al., 2004). On the other hand, research that investigates these ontologies not only from the view point of experts but also from the perspective of casual users is rare. Assessing difficulties in understanding and using ontologies for emerging user communities on the Semantic Web should be conducted as a stage of implementing the Semantic Web (Finin et al., 2007). In his original work on ontologies, Gruber (1993) stressed that ontologies are about knowledge sharing. We raise the question whether existing ontologies are constructed so that they may succeed at knowledge sharing. Zeng et al. (2005) showed that communication through terminologies can be significantly facilitated if words labeling concepts are comprehensible to users. Finding concepts which are likely to be recognized by users is a trend in ontology engineering, which is different from the traditional approach of building terminologies understandable mainly by experts of a domain. We are focusing on an ontology’s role, that is, knowledge sharing supported by an explicit specification of a conceptualization. The key idea of naturalness is based on the need for making terminologies understandable, as described in previous research (An et al., 2006). Some researchers (Staab and Maedche, 2000) have made efforts in making explicit the meaning of some semantic relationships in the form of axioms. However, this declarative knowledge with universal truths about concepts cannot provide answers for all the forms of knowledge inquiries (Mizoguchi, 2004). It is widely assumed that ontologies represent information in a form that is at least similar to how human knowledge is represented (Smith, 1982). Note that the distinction between primitive and defined concepts (Baneyx et al., 2005) is not employed in this research. It is easy to give precise definitions in mathematically-oriented domains. However, in real world applications this is often not the case. To many researchers, an ontology concept is a meaningless label, unless it is given a definition. However, any definition itself will contain logical symbols and other labels. Logical symbols do not cause a problem because they are domain independent. However, how can the defining labels themselves be defined? This leads to an infinite regression or circular definitions. Thus, we assume that, at some level, labels have to be understandable by being known to the recipient (program or human). As there are labels that are better known, what we call “more natural,” and labels that are less well known to humans, we prefer to use the more natural labels even for programs. We note that the meaning and the naturalness of concepts are orthogonal factors, as will be explained in more detail below. The author of a document usually has very little choice concerning the meaning he needs to get across. However, he can choose the most natural term for a specific meaning. In related work about the quality of ontologies, Ram & Park (2004) have focused on semantic interoperability. To enhance the semantics of ontologies, a methodology and analysis have been presented in (Supekar et al., 2004; Brachman, 1992). The ontologies that these researchers presented are domain-specific, so the sizes of the ontologies used are relatively small. In a broader view, some existing ontologies are described in (Noy & Hafner, 1997), however, numeric values and classifications of the ontologies are discussed without being supported by a mathematical model. 16 more pages are available in the full version of this document, which may be purchased using the \"Add to Cart\" button on the product's webpage: www.igi-global.com/article/formal-approach-evaluatingmedical-ontology/38941?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Medicine, Healthcare, and Life Science. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2","PeriodicalId":162417,"journal":{"name":"Int. J. Comput. Model. Algorithms Medicine","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A Formal Approach to Evaluating Medical Ontology Systems using Naturalness\",\"authors\":\"Y. J. An, Kuo-Chuan Huang, Soon Ae Chun, J. Geller\",\"doi\":\"10.4018/jcmam.2010072001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ontologies, terminologies and vocabularies are popular repositories for collecting the terms used in a domain. It may be expected that in the future more such ontologies will be created for domain experts. However, there is increasing interest in making the language of experts understandable to casual users. For example, cancer patients often research their cases on the Web. The authors consider the problem of objectively evaluating the quality of ontologies (QoO). This article formalizes the notion of naturalness as a component of QoO and quantitatively measures naturalness for well-known ontologies (UMLS, WordNet, OpenCyc) based on their concepts, IS-A relationships and semantic relationships. To compute numeric values characterizing the naturalness of an ontology, this article defines appropriate metrics. As absolute numbers in such a pursuit are often meaningless, we concentrate on using relative naturalness metrics. That allows us to say that a certain ontology is relatively more natural than another one. DOI: 10.4018/jcmam.2010072001 IGI PUBLISHING This paper appears in the publication, International Journal of Computational Models and Algorithms in Medicine, Volume 1, Issue 1 edited by Aryya Gangopadhyay © 2010, IGI Global 701 E. Chocolate Avenue, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.igi-global.com ITJ 5526 2 International Journal of Computational Models and Algorithms in Medicine, 1(1), 1-18, January-March 2010 Copyright © 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. factory result, but the user might be satisfied with the search result for the more general term Penicillin. Finding broader or narrower concepts of a given concept is an important technique, which is recommended as a Web search strategy. According to Kalfoglou & Hu (2006), application ontologies are converging with the Web. Thus the knowledge provided by ontologies should be filtered dynamically by understanding the needs of Web users. There are several well-known ontologies, which many researchers have used and referenced, such as UMLS, WordNet and OpenCyc. Some researchers have presented modified or enriched ontological models by adding new types and trimming some detailed relationships from existing ontologies (Stone et al., 2004). On the other hand, research that investigates these ontologies not only from the view point of experts but also from the perspective of casual users is rare. Assessing difficulties in understanding and using ontologies for emerging user communities on the Semantic Web should be conducted as a stage of implementing the Semantic Web (Finin et al., 2007). In his original work on ontologies, Gruber (1993) stressed that ontologies are about knowledge sharing. We raise the question whether existing ontologies are constructed so that they may succeed at knowledge sharing. Zeng et al. (2005) showed that communication through terminologies can be significantly facilitated if words labeling concepts are comprehensible to users. Finding concepts which are likely to be recognized by users is a trend in ontology engineering, which is different from the traditional approach of building terminologies understandable mainly by experts of a domain. We are focusing on an ontology’s role, that is, knowledge sharing supported by an explicit specification of a conceptualization. The key idea of naturalness is based on the need for making terminologies understandable, as described in previous research (An et al., 2006). Some researchers (Staab and Maedche, 2000) have made efforts in making explicit the meaning of some semantic relationships in the form of axioms. However, this declarative knowledge with universal truths about concepts cannot provide answers for all the forms of knowledge inquiries (Mizoguchi, 2004). It is widely assumed that ontologies represent information in a form that is at least similar to how human knowledge is represented (Smith, 1982). Note that the distinction between primitive and defined concepts (Baneyx et al., 2005) is not employed in this research. It is easy to give precise definitions in mathematically-oriented domains. However, in real world applications this is often not the case. To many researchers, an ontology concept is a meaningless label, unless it is given a definition. However, any definition itself will contain logical symbols and other labels. Logical symbols do not cause a problem because they are domain independent. However, how can the defining labels themselves be defined? This leads to an infinite regression or circular definitions. Thus, we assume that, at some level, labels have to be understandable by being known to the recipient (program or human). As there are labels that are better known, what we call “more natural,” and labels that are less well known to humans, we prefer to use the more natural labels even for programs. We note that the meaning and the naturalness of concepts are orthogonal factors, as will be explained in more detail below. The author of a document usually has very little choice concerning the meaning he needs to get across. However, he can choose the most natural term for a specific meaning. In related work about the quality of ontologies, Ram & Park (2004) have focused on semantic interoperability. To enhance the semantics of ontologies, a methodology and analysis have been presented in (Supekar et al., 2004; Brachman, 1992). The ontologies that these researchers presented are domain-specific, so the sizes of the ontologies used are relatively small. In a broader view, some existing ontologies are described in (Noy & Hafner, 1997), however, numeric values and classifications of the ontologies are discussed without being supported by a mathematical model. 16 more pages are available in the full version of this document, which may be purchased using the \\\"Add to Cart\\\" button on the product's webpage: www.igi-global.com/article/formal-approach-evaluatingmedical-ontology/38941?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Medicine, Healthcare, and Life Science. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2\",\"PeriodicalId\":162417,\"journal\":{\"name\":\"Int. J. Comput. Model. Algorithms Medicine\",\"volume\":\"67 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Comput. Model. Algorithms Medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/jcmam.2010072001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Comput. Model. Algorithms Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/jcmam.2010072001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

本体、术语和词汇表是收集领域中使用的术语的常用存储库。可以预期,在未来将为领域专家创建更多这样的本体。然而,越来越多的人热衷于让专家的语言能够被普通用户理解。例如,癌症患者经常在网上研究他们的病例。作者考虑了客观评价本体质量的问题。本文将自然性的概念形式化为QoO的一个组成部分,并基于众所周知的本体(UMLS、WordNet、OpenCyc)的概念、IS-A关系和语义关系,定量地度量它们的自然性。为了计算表征本体自然度的数值,本文定义了适当的度量。因为在这种追求中,绝对数字通常是没有意义的,所以我们专注于使用相对自然度度量。这就允许我们说某个本体论相对来说比另一个更自然。DOI: 10.4018 / jcmam.2010072001这篇论文发表在国际医学计算模型和算法杂志上,第1卷,第1期,由Aryya Gangopadhyay编辑©2010,IGI Global 701 E. Chocolate Avenue, Hershey PA 17033-1240, USA Tel: 717/533-8845;传真717/533 - 8661;URL-http://www.igi-global.com ITJ 5526 2国际医学计算模型与算法杂志,1(1),1- 18,2010年1- 3月版权所有©2010,IGI Global。未经IGI Global书面许可,禁止以印刷或电子形式复制或分发。但是用户可能会对更通用的术语Penicillin的搜索结果感到满意。查找给定概念的更宽或更窄的概念是一项重要的技术,建议将其作为Web搜索策略。根据Kalfoglou & Hu(2006)的说法,应用程序本体正在与Web融合。因此,本体提供的知识应该通过理解Web用户的需求来进行动态过滤。有几个著名的本体,许多研究人员已经使用和引用,如UMLS, WordNet和OpenCyc。一些研究人员提出了修改或丰富的本体论模型,增加新的类型,并从现有的本体论中删除一些详细的关系(Stone et al., 2004)。另一方面,既从专家的角度,又从休闲用户的角度来调查这些本体的研究很少。评估理解和使用语义Web上新兴用户社区本体的困难应该作为实现语义Web的一个阶段(Finin et al., 2007)。格鲁伯(Gruber, 1993)在本体论的原创著作中强调本体论是关于知识共享的。我们提出了一个问题,即现有的本体是否被构造成能够成功地进行知识共享。Zeng等人(2005)的研究表明,如果词语标注概念对用户来说是可理解的,那么通过术语进行交流就会大大便利。寻找可能被用户识别的概念是本体工程的一个趋势,这与传统的主要由一个领域的专家可以理解的术语构建方法不同。我们关注的是本体的角色,即由概念化的明确规范支持的知识共享。正如之前的研究所描述的那样,自然性的关键思想是基于使术语易于理解的需求(An et al., 2006)。一些研究者(Staab和Maedche, 2000)已经努力以公理的形式明确一些语义关系的意义。然而,这种具有关于概念的普遍真理的陈述性知识不能为所有形式的知识查询提供答案(沟口,2004)。人们普遍认为,本体表示信息的形式至少与人类知识的表示方式相似(Smith, 1982)。请注意,本研究没有使用原始概念和定义概念之间的区别(Baneyx et al., 2005)。在面向数学的领域中,很容易给出精确的定义。然而,在现实世界的应用程序中,情况往往并非如此。对于许多研究者来说,本体概念是一个没有意义的标签,除非给它一个定义。然而,任何定义本身都将包含逻辑符号和其他标签。逻辑符号不会引起问题,因为它们是领域独立的。但是,如何定义定义标签本身呢?这导致了无限回归或循环定义。因此,我们假设,在某种程度上,标签必须是可理解的,因为收件人(程序或人)知道。因为有些标签是我们更熟悉的,我们称之为“更自然”的,有些标签是人类不太熟悉的,我们更喜欢使用更自然的标签,甚至对于程序也是如此。 我们注意到,概念的意义和自然性是正交因素,下面将更详细地解释这一点。一份文件的作者通常没有多少选择的余地来表达他需要表达的意思。然而,他可以选择最自然的术语来表达特定的含义。在本体质量的相关工作中,Ram和Park(2004)关注语义互操作性。为了增强本体的语义,Supekar et al., 2004;Brachman, 1992)。这些研究人员提出的本体是特定于领域的,因此所使用的本体的大小相对较小。从更广泛的角度来看,(Noy & Hafner, 1997)中描述了一些现有的本体,然而,在没有数学模型支持的情况下,讨论了本体的数值和分类。本文档的完整版还有16页,可通过产品网页上的“添加到购物车”按钮购买:www.igi-global.com/article/formal-approach-evaluatingmedical-ontology/38941?camid=4v1。本标题可在infosci -期刊、infosci -期刊学科医学、医疗保健和生命科学中找到。向您的图书管理员推荐此产品:www.igi-global.com/e-resources/libraryrecommendation/?id=2
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Formal Approach to Evaluating Medical Ontology Systems using Naturalness
Ontologies, terminologies and vocabularies are popular repositories for collecting the terms used in a domain. It may be expected that in the future more such ontologies will be created for domain experts. However, there is increasing interest in making the language of experts understandable to casual users. For example, cancer patients often research their cases on the Web. The authors consider the problem of objectively evaluating the quality of ontologies (QoO). This article formalizes the notion of naturalness as a component of QoO and quantitatively measures naturalness for well-known ontologies (UMLS, WordNet, OpenCyc) based on their concepts, IS-A relationships and semantic relationships. To compute numeric values characterizing the naturalness of an ontology, this article defines appropriate metrics. As absolute numbers in such a pursuit are often meaningless, we concentrate on using relative naturalness metrics. That allows us to say that a certain ontology is relatively more natural than another one. DOI: 10.4018/jcmam.2010072001 IGI PUBLISHING This paper appears in the publication, International Journal of Computational Models and Algorithms in Medicine, Volume 1, Issue 1 edited by Aryya Gangopadhyay © 2010, IGI Global 701 E. Chocolate Avenue, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.igi-global.com ITJ 5526 2 International Journal of Computational Models and Algorithms in Medicine, 1(1), 1-18, January-March 2010 Copyright © 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. factory result, but the user might be satisfied with the search result for the more general term Penicillin. Finding broader or narrower concepts of a given concept is an important technique, which is recommended as a Web search strategy. According to Kalfoglou & Hu (2006), application ontologies are converging with the Web. Thus the knowledge provided by ontologies should be filtered dynamically by understanding the needs of Web users. There are several well-known ontologies, which many researchers have used and referenced, such as UMLS, WordNet and OpenCyc. Some researchers have presented modified or enriched ontological models by adding new types and trimming some detailed relationships from existing ontologies (Stone et al., 2004). On the other hand, research that investigates these ontologies not only from the view point of experts but also from the perspective of casual users is rare. Assessing difficulties in understanding and using ontologies for emerging user communities on the Semantic Web should be conducted as a stage of implementing the Semantic Web (Finin et al., 2007). In his original work on ontologies, Gruber (1993) stressed that ontologies are about knowledge sharing. We raise the question whether existing ontologies are constructed so that they may succeed at knowledge sharing. Zeng et al. (2005) showed that communication through terminologies can be significantly facilitated if words labeling concepts are comprehensible to users. Finding concepts which are likely to be recognized by users is a trend in ontology engineering, which is different from the traditional approach of building terminologies understandable mainly by experts of a domain. We are focusing on an ontology’s role, that is, knowledge sharing supported by an explicit specification of a conceptualization. The key idea of naturalness is based on the need for making terminologies understandable, as described in previous research (An et al., 2006). Some researchers (Staab and Maedche, 2000) have made efforts in making explicit the meaning of some semantic relationships in the form of axioms. However, this declarative knowledge with universal truths about concepts cannot provide answers for all the forms of knowledge inquiries (Mizoguchi, 2004). It is widely assumed that ontologies represent information in a form that is at least similar to how human knowledge is represented (Smith, 1982). Note that the distinction between primitive and defined concepts (Baneyx et al., 2005) is not employed in this research. It is easy to give precise definitions in mathematically-oriented domains. However, in real world applications this is often not the case. To many researchers, an ontology concept is a meaningless label, unless it is given a definition. However, any definition itself will contain logical symbols and other labels. Logical symbols do not cause a problem because they are domain independent. However, how can the defining labels themselves be defined? This leads to an infinite regression or circular definitions. Thus, we assume that, at some level, labels have to be understandable by being known to the recipient (program or human). As there are labels that are better known, what we call “more natural,” and labels that are less well known to humans, we prefer to use the more natural labels even for programs. We note that the meaning and the naturalness of concepts are orthogonal factors, as will be explained in more detail below. The author of a document usually has very little choice concerning the meaning he needs to get across. However, he can choose the most natural term for a specific meaning. In related work about the quality of ontologies, Ram & Park (2004) have focused on semantic interoperability. To enhance the semantics of ontologies, a methodology and analysis have been presented in (Supekar et al., 2004; Brachman, 1992). The ontologies that these researchers presented are domain-specific, so the sizes of the ontologies used are relatively small. In a broader view, some existing ontologies are described in (Noy & Hafner, 1997), however, numeric values and classifications of the ontologies are discussed without being supported by a mathematical model. 16 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/formal-approach-evaluatingmedical-ontology/38941?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Medicine, Healthcare, and Life Science. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信