{"title":"什么才是好概念?","authors":"Naren Khatwani, James Geller","doi":"arxiv-2409.06150","DOIUrl":null,"url":null,"abstract":"A good medical ontology is expected to cover its domain completely and\ncorrectly. On the other hand, large ontologies are hard to build, hard to\nunderstand, and hard to maintain. Thus, adding new concepts (often multi-word\nconcepts) to an existing ontology must be done judiciously. Only \"good\"\nconcepts should be added; however, it is difficult to define what makes a\nconcept good. In this research, we propose a metric to measure the goodness of\na concept. We identified factors that appear to influence goodness judgments of\nmedical experts and combined them into a single metric. These factors include\nconcept name length (in words), concept occurrence frequency in the medical\nliterature, and syntactic categories of component words. As an added factor, we\nused the simplicity of a term after mapping it into a specific foreign\nlanguage. We performed Bayesian optimization of factor weights to achieve\nmaximum agreement between the metric and three medical experts. The results\nshowed that our metric had a 50.67% overall agreement with the experts, as\nmeasured by Krippendorff's alpha.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"6 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"What makes a good concept anyway ?\",\"authors\":\"Naren Khatwani, James Geller\",\"doi\":\"arxiv-2409.06150\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A good medical ontology is expected to cover its domain completely and\\ncorrectly. On the other hand, large ontologies are hard to build, hard to\\nunderstand, and hard to maintain. Thus, adding new concepts (often multi-word\\nconcepts) to an existing ontology must be done judiciously. Only \\\"good\\\"\\nconcepts should be added; however, it is difficult to define what makes a\\nconcept good. In this research, we propose a metric to measure the goodness of\\na concept. We identified factors that appear to influence goodness judgments of\\nmedical experts and combined them into a single metric. These factors include\\nconcept name length (in words), concept occurrence frequency in the medical\\nliterature, and syntactic categories of component words. As an added factor, we\\nused the simplicity of a term after mapping it into a specific foreign\\nlanguage. We performed Bayesian optimization of factor weights to achieve\\nmaximum agreement between the metric and three medical experts. The results\\nshowed that our metric had a 50.67% overall agreement with the experts, as\\nmeasured by Krippendorff's alpha.\",\"PeriodicalId\":501281,\"journal\":{\"name\":\"arXiv - CS - Information Retrieval\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Information Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06150\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06150","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A good medical ontology is expected to cover its domain completely and
correctly. On the other hand, large ontologies are hard to build, hard to
understand, and hard to maintain. Thus, adding new concepts (often multi-word
concepts) to an existing ontology must be done judiciously. Only "good"
concepts should be added; however, it is difficult to define what makes a
concept good. In this research, we propose a metric to measure the goodness of
a concept. We identified factors that appear to influence goodness judgments of
medical experts and combined them into a single metric. These factors include
concept name length (in words), concept occurrence frequency in the medical
literature, and syntactic categories of component words. As an added factor, we
used the simplicity of a term after mapping it into a specific foreign
language. We performed Bayesian optimization of factor weights to achieve
maximum agreement between the metric and three medical experts. The results
showed that our metric had a 50.67% overall agreement with the experts, as
measured by Krippendorff's alpha.