Asty Nabilah ’Izzaturrahmah, F. Nhita, I. Kurniawan
{"title":"支持向量机在基于文本的药品审查内容GERD检测中的实现","authors":"Asty Nabilah ’Izzaturrahmah, F. Nhita, I. Kurniawan","doi":"10.1109/ICADEIS52521.2021.9702073","DOIUrl":null,"url":null,"abstract":"GERD or Gastroesophageal Reflux Disease is a situation when the reflux of stomach contents leads to unpleasant symptoms and/or complications. The prevalence range of GERD is approximately 18.1% to 27.8% in North America, 8.8% to 25.9% in Europe, 2.5% to 7.8% in East Asia, 8.7% to 33.1% in the Middle East, 11.6% in Australia, and 23.0% in South America. The numbers may seem small, but GERD will lead to several complications including esophagitis, peptic stricture, and Barrett’s esophagus if left untreated. The most common diagnostic test for the assessment of GERD along with its possible complications is the upper gastrointestinal endoscopy, or esophagogastroduodenoscopy (EGD). However, endoscopy has several risks. Disease detection using machine learning can be done and is needed due to the increment in medical data, new detection, and diagnostic modalities being developed. One of the machine learning algorithms often used in text classification is Support Vector Machine (SVM). This research applies SVM to do text-based classification, classifying data into two classes, namely GERD patient” and “not GERD patient using drug review data. The best model has 91.32% accuracy, 91% f1-score, and 91.32% AUC score with unigram as the n-gram range, and RBF with C is 1000, and gamma auto as the SVM kernel.","PeriodicalId":422702,"journal":{"name":"2021 International Conference Advancement in Data Science, E-learning and Information Systems (ICADEIS)","volume":"43 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Implementation of Support Vector Machine on Text-based GERD Detection by using Drug Review Content\",\"authors\":\"Asty Nabilah ’Izzaturrahmah, F. Nhita, I. Kurniawan\",\"doi\":\"10.1109/ICADEIS52521.2021.9702073\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"GERD or Gastroesophageal Reflux Disease is a situation when the reflux of stomach contents leads to unpleasant symptoms and/or complications. The prevalence range of GERD is approximately 18.1% to 27.8% in North America, 8.8% to 25.9% in Europe, 2.5% to 7.8% in East Asia, 8.7% to 33.1% in the Middle East, 11.6% in Australia, and 23.0% in South America. The numbers may seem small, but GERD will lead to several complications including esophagitis, peptic stricture, and Barrett’s esophagus if left untreated. The most common diagnostic test for the assessment of GERD along with its possible complications is the upper gastrointestinal endoscopy, or esophagogastroduodenoscopy (EGD). However, endoscopy has several risks. Disease detection using machine learning can be done and is needed due to the increment in medical data, new detection, and diagnostic modalities being developed. One of the machine learning algorithms often used in text classification is Support Vector Machine (SVM). This research applies SVM to do text-based classification, classifying data into two classes, namely GERD patient” and “not GERD patient using drug review data. The best model has 91.32% accuracy, 91% f1-score, and 91.32% AUC score with unigram as the n-gram range, and RBF with C is 1000, and gamma auto as the SVM kernel.\",\"PeriodicalId\":422702,\"journal\":{\"name\":\"2021 International Conference Advancement in Data Science, E-learning and Information Systems (ICADEIS)\",\"volume\":\"43 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference Advancement in Data Science, E-learning and Information Systems (ICADEIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICADEIS52521.2021.9702073\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference Advancement in Data Science, E-learning and Information Systems (ICADEIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICADEIS52521.2021.9702073","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Implementation of Support Vector Machine on Text-based GERD Detection by using Drug Review Content
GERD or Gastroesophageal Reflux Disease is a situation when the reflux of stomach contents leads to unpleasant symptoms and/or complications. The prevalence range of GERD is approximately 18.1% to 27.8% in North America, 8.8% to 25.9% in Europe, 2.5% to 7.8% in East Asia, 8.7% to 33.1% in the Middle East, 11.6% in Australia, and 23.0% in South America. The numbers may seem small, but GERD will lead to several complications including esophagitis, peptic stricture, and Barrett’s esophagus if left untreated. The most common diagnostic test for the assessment of GERD along with its possible complications is the upper gastrointestinal endoscopy, or esophagogastroduodenoscopy (EGD). However, endoscopy has several risks. Disease detection using machine learning can be done and is needed due to the increment in medical data, new detection, and diagnostic modalities being developed. One of the machine learning algorithms often used in text classification is Support Vector Machine (SVM). This research applies SVM to do text-based classification, classifying data into two classes, namely GERD patient” and “not GERD patient using drug review data. The best model has 91.32% accuracy, 91% f1-score, and 91.32% AUC score with unigram as the n-gram range, and RBF with C is 1000, and gamma auto as the SVM kernel.