{"title":"使用自动词汇生成和传递文本挖掘与阿育吠陀有关的假设生成","authors":"Harsha Gopal Goud Vaka, S. Mukhopadhyay","doi":"10.1109/NBiS.2009.30","DOIUrl":null,"url":null,"abstract":"Automated extraction of knowledge from voluminous documents is a vast research area. Text mining is a promising approach for extracting knowledge from unstructured textual documents. The objective of this paper is to mine documents pertaining to Ayurveda, which are retrieved from PubMed into a databank, and find novel transitive associations among biological objects. This paper discusses the extraction of biological objects from the databank using an Automated Vocabulary Discovery (AVD) algorithm. A text-mining process is described for finding transitive (novel) associations among the extracted biological objects. The text mining algorithm, in addition to identifying novel associations (termed hypotheses), also assigns a numerical significance score to them. The expectation is that those with higher score have greater likelihood of being true than those with lower scores. Experimental results as well as their validation results are presented, demonstrating that the method has the potential to predict novel and interesting true associations.","PeriodicalId":312802,"journal":{"name":"2009 International Conference on Network-Based Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Hypotheses Generation Pertaining to Ayurveda Using Automated Vocabulary Generation and Transitive Text Mining\",\"authors\":\"Harsha Gopal Goud Vaka, S. Mukhopadhyay\",\"doi\":\"10.1109/NBiS.2009.30\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automated extraction of knowledge from voluminous documents is a vast research area. Text mining is a promising approach for extracting knowledge from unstructured textual documents. The objective of this paper is to mine documents pertaining to Ayurveda, which are retrieved from PubMed into a databank, and find novel transitive associations among biological objects. This paper discusses the extraction of biological objects from the databank using an Automated Vocabulary Discovery (AVD) algorithm. A text-mining process is described for finding transitive (novel) associations among the extracted biological objects. The text mining algorithm, in addition to identifying novel associations (termed hypotheses), also assigns a numerical significance score to them. The expectation is that those with higher score have greater likelihood of being true than those with lower scores. Experimental results as well as their validation results are presented, demonstrating that the method has the potential to predict novel and interesting true associations.\",\"PeriodicalId\":312802,\"journal\":{\"name\":\"2009 International Conference on Network-Based Information Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Network-Based Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NBiS.2009.30\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Network-Based Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NBiS.2009.30","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hypotheses Generation Pertaining to Ayurveda Using Automated Vocabulary Generation and Transitive Text Mining
Automated extraction of knowledge from voluminous documents is a vast research area. Text mining is a promising approach for extracting knowledge from unstructured textual documents. The objective of this paper is to mine documents pertaining to Ayurveda, which are retrieved from PubMed into a databank, and find novel transitive associations among biological objects. This paper discusses the extraction of biological objects from the databank using an Automated Vocabulary Discovery (AVD) algorithm. A text-mining process is described for finding transitive (novel) associations among the extracted biological objects. The text mining algorithm, in addition to identifying novel associations (termed hypotheses), also assigns a numerical significance score to them. The expectation is that those with higher score have greater likelihood of being true than those with lower scores. Experimental results as well as their validation results are presented, demonstrating that the method has the potential to predict novel and interesting true associations.