Chidchanok Choksuchat, Suphaksa Ngamphak, Benjaporn Maneesaeng, Yuwathida Chiwpreechar, C. Chantrapornchai
{"title":"Parallel health tourism information extraction and ontology storage","authors":"Chidchanok Choksuchat, Suphaksa Ngamphak, Benjaporn Maneesaeng, Yuwathida Chiwpreechar, C. Chantrapornchai","doi":"10.1109/JCSSE.2014.6841873","DOIUrl":null,"url":null,"abstract":"Health tourism is now popular and being promoted in Thailand since it is one of the growing industry. Health tourism information is scattered around in many places especially, in the websites. A health tourism service provider may be in many forms such as in the hotel, as a separated business, as a hospital etc. Each service provider may have its own website as well as the websites from common providers such as Tripadvisor, Agoda, or Atsiam, etc. Each website provides different information about just one service provider. In this work, we are interested in information gathering process for health tourism. We introduce the use of parallel Java platform to gather the tourism information particularly, using a Java concurrent program and merge the information using MapReduce. The information we gather are preprocessed and combined with the information manually collected. Google Refine is used to merge all the information into single health tourism ontology.","PeriodicalId":331610,"journal":{"name":"2014 11th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"121 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 11th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2014.6841873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Health tourism is now popular and being promoted in Thailand since it is one of the growing industry. Health tourism information is scattered around in many places especially, in the websites. A health tourism service provider may be in many forms such as in the hotel, as a separated business, as a hospital etc. Each service provider may have its own website as well as the websites from common providers such as Tripadvisor, Agoda, or Atsiam, etc. Each website provides different information about just one service provider. In this work, we are interested in information gathering process for health tourism. We introduce the use of parallel Java platform to gather the tourism information particularly, using a Java concurrent program and merge the information using MapReduce. The information we gather are preprocessed and combined with the information manually collected. Google Refine is used to merge all the information into single health tourism ontology.