International Journal of Advances in Intelligent Informatics最新文献

筛选
英文 中文
Analysis and review of the possibility of using the generative model as a compression technique in DNA data storage: review and future research agenda 分析和回顾在DNA数据存储中使用生成模型作为压缩技术的可能性:回顾和未来的研究议程
International Journal of Advances in Intelligent Informatics Pub Date : 2023-11-01 DOI: 10.26555/ijain.v9i3.1063
Muhammad Rafi Muttaqin, Yeni Herdiyeni, Agus Buono, Karlisa Priandana, Iskandar Zulkarnaen Siregar
{"title":"Analysis and review of the possibility of using the generative model as a compression technique in DNA data storage: review and future research agenda","authors":"Muhammad Rafi Muttaqin, Yeni Herdiyeni, Agus Buono, Karlisa Priandana, Iskandar Zulkarnaen Siregar","doi":"10.26555/ijain.v9i3.1063","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.1063","url":null,"abstract":"The amount of data in this world is getting higher, and overwriting technology also has severe challenges. Data growth is expected to grow to 175 ZB by 2025. Data storage technology in DNA is an alternative technology with potential in information storage, mainly digital data. One of the stages of storing information on DNA is synthesis. This synthesis process costs very high, so it is necessary to integrate compression techniques for digital data to minimize the costs incurred. One of the models used in compression techniques is the generative model. This paper aims to see if compression using this generative model allows it to be integrated into data storage methods on DNA. To this end, we have conducted a Systematic Literature Review using the PRISMA method in selecting papers. We took the source of the papers from four leading databases and other additional databases. Out of 2440 papers, we finally decided on 34 primary papers for detailed analysis. This systematic literature review (SLR) presents and categorizes based on research questions, namely discussing machine learning methods applied in DNA storage, identifying compression techniques for DNA storage, knowing the role of deep learning in the compression process for DNA storage, knowing how generative models are associated with deep learning, knowing how generative models are applied in the compression process, and knowing latent space can be formed. The study highlights open problems that need to be solved and provides an identified research direction.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135112501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of code smells using machine learning techniques combined with data-balancing methods 使用机器学习技术结合数据平衡方法检测代码气味
International Journal of Advances in Intelligent Informatics Pub Date : 2023-11-01 DOI: 10.26555/ijain.v9i3.981
Nasraldeen Alnor Adam Khleel, Károly Nehéz
{"title":"Detection of code smells using machine learning techniques combined with data-balancing methods","authors":"Nasraldeen Alnor Adam Khleel, Károly Nehéz","doi":"10.26555/ijain.v9i3.981","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.981","url":null,"abstract":"Code smells are prevalent issues in software design that arise when implementation or design principles are violated. These issues manifest as symptoms or anomalies in the source code. Timely identification of code smells plays a crucial role in enhancing software quality and facilitating software maintenance. Previous studies have shown that code smell detection can be accomplished through the utilization of machine learning (ML) methods. However, despite their increasing popularity, research suggests that the suitability of these methods are not always appropriate due to the problem of imbalanced data. Consequently, the effectiveness of ML models may be negatively affected. This study aims to propose a novel method for detecting code smells by employing five ML algorithms, namely decision tree (DT), k-nearest neighbors (K-NN), support vector machine (SVM), XGboost (XGB), and multi-layer perceptron (MLP). Additionally, to tackle the challenge of imbalanced data, the proposed method incorporates the random oversampling technique. Experiments were conducted in this study using four datasets that encompassed code smells, specifically god-class, data-class, long-method, and feature-envy. The experimental outcomes were evaluated and compared using various performance metrics. Upon comparing the outcomes of our models on both the balanced and original datasets, we found that the XGB model achieved the highest accuracy of 100% for detecting the data class and long method on the original datasets. In contrast, the highest accuracy of 100% was obtained for the data class and long method using DT, SVM, and XGB models on the balanced datasets. According to the empirical findings, there is significant promise in using ML techniques for the accurate prediction of code smells.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"30 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135112497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fragile watermarking for image authentication using dyadic walsh ordering 使用二进沃尔什排序的图像认证脆弱水印
International Journal of Advances in Intelligent Informatics Pub Date : 2023-11-01 DOI: 10.26555/ijain.v9i3.1017
Prajanto Wahyu Adi, Adi Wibowo, Guruh Aryotejo, Ferda Ernawan
{"title":"Fragile watermarking for image authentication using dyadic walsh ordering","authors":"Prajanto Wahyu Adi, Adi Wibowo, Guruh Aryotejo, Ferda Ernawan","doi":"10.26555/ijain.v9i3.1017","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.1017","url":null,"abstract":"A digital image is subjected to the most manipulation. This is driven by the easy manipulating process through image editing software which is growing rapidly. These problems can be solved through the watermarking model as an active authentication system for the image. One of the most popular methods is Singular Value Decomposition (SVD) which has good imperceptibility and detection capabilities. Nevertheless, SVD has high complexity and can only utilize one singular matrix S, and ignore two orthogonal matrices. This paper proposes the use of the Walsh matrix with dyadic ordering to generate a new S matrix without the orthogonal matrices. The experimental results showed that the proposed method was able to reduce computational time by 22% and 13% compared to the SVD-based method and similar methods based on the Hadamard matrix respectively. This research can be used as a reference to speed up the computing time of the watermarking methods without compromising the level of imperceptibility and authentication.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135112511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic literature review of dermoscopic pigmented skin lesions classification using convolutional neural network (CNN) 基于卷积神经网络(CNN)的皮肤镜下色素性皮肤病变分类系统文献综述
International Journal of Advances in Intelligent Informatics Pub Date : 2023-10-15 DOI: 10.26555/ijain.v9i3.961
Erwin Setyo Nugroho, Igi Ardiyanto, Hanung Adi Nugroho
{"title":"Systematic literature review of dermoscopic pigmented skin lesions classification using convolutional neural network (CNN)","authors":"Erwin Setyo Nugroho, Igi Ardiyanto, Hanung Adi Nugroho","doi":"10.26555/ijain.v9i3.961","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.961","url":null,"abstract":"The occurrence of pigmented skin lesions (PSL), including melanoma, are rising, and early detection is crucial for reducing mortality. To assist Pigmented skin lesions, including melanoma, are rising, and early detection is crucial in reducing mortality. To aid dermatologists in early detection, computational techniques have been developed. This research conducted a systematic literature review (SLR) to identify research goals, datasets, methodologies, and performance evaluation methods used in categorizing dermoscopic lesions. This review focuses on using convolutional neural networks (CNNs) in analyzing PSL. Based on specific inclusion and exclusion criteria, the review included 54 primary studies published on Scopus and PubMed between 2018 and 2022. The results showed that ResNet and self-developed CNN were used in 22% of the studies, followed by Ensemble at 20% and DenseNet at 9%. Public datasets such as ISIC 2019 were predominantly used, and 85% of the classifiers used were softmax. The findings suggest that the input, architecture, and output/feature modifications can enhance the model's performance, although improving sensitivity in multiclass classification remains a challenge. While there is no specific model approach to solve the problem in this area, we recommend simultaneously modifying the three clusters to improve the model's performance.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136184544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of multi-class arrhythmia using heuristic and deep neural network on edge device 基于边缘设备的启发式深度神经网络多类心律失常检测
International Journal of Advances in Intelligent Informatics Pub Date : 2023-10-15 DOI: 10.26555/ijain.v9i3.1061
Arief Kurniawan, Eko Mulyanto Yuniarno, Eko Setijadi, Mochamad Yusuf Alsagaff, Gijsbertus Jacob Verkerke, I Ketut Eddy Purnama
{"title":"Detection of multi-class arrhythmia using heuristic and deep neural network on edge device","authors":"Arief Kurniawan, Eko Mulyanto Yuniarno, Eko Setijadi, Mochamad Yusuf Alsagaff, Gijsbertus Jacob Verkerke, I Ketut Eddy Purnama","doi":"10.26555/ijain.v9i3.1061","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.1061","url":null,"abstract":"Heart disease is a heart condition that sometimes causes a person to die suddenly. One indication is a rhythm disorder known as arrhythmia. Multi-class Arrhythmia Detection has followed: QRS complex detection procedure and arrhythmia classification based on the QRS complex morphology. We proposed an edge device that detects QRS complexes based on variance analysis (QVAT) and the arrhythmia classification based on the QRS complex spectrogram. The classifier uses two-dimensional convolutional neural network (2D CNN) deep learning. We use a single board computer and neural network compute stick to implement the edge device. The outcomes are a prototype device cardiologists use as a supporting tool for analysing ECG signals, and patients can also use it for self-tests to figure out their heart health. To evaluate the performance of our edge device, we tested using the MIT-BIH database because other methods also use the data. The QVAT sensitivity and predictive positive are 99.81% and 99.90%, respectively. Our classifier's accuracy, sensitivity, predictive positive, specificity, and F1-score are 99.82%, 99.55%, 99.55%, 99.89%, and 99.55%, respectively. The experiment result of arrhythmia classification shows that our method outperforms the others. Still, for r-peak detection, the QVAT implemented in an edge device is comparable to the other methods. In future work, we can improve the performance of r-peak detection using the double-check algorithm in QVAT and cross-check the QRS complex detection by adding 1 class to the classifier, namely the non-QRS class.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136184559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting and monitoring the development stages of wild flowers and plants using computer vision: approaches, challenges and opportunities 利用计算机视觉检测和监测野生花卉和植物的发育阶段:方法、挑战和机遇
International Journal of Advances in Intelligent Informatics Pub Date : 2023-10-15 DOI: 10.26555/ijain.v9i3.1012
João Videira, Pedro Dinis Gaspar, Vasco Nuno da Gama de Jesus Soares, João Manuel Leitão Pires Caldeira
{"title":"Detecting and monitoring the development stages of wild flowers and plants using computer vision: approaches, challenges and opportunities","authors":"João Videira, Pedro Dinis Gaspar, Vasco Nuno da Gama de Jesus Soares, João Manuel Leitão Pires Caldeira","doi":"10.26555/ijain.v9i3.1012","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.1012","url":null,"abstract":"Wild flowers and plants play an important role in protecting biodiversity and providing various ecosystem services. However, some of them are endangered or threatened and are entitled to preservation and protection. This study represents a first step to develop a computer vision system and a supporting mobile app for detecting and monitoring the development stages of wild flowers and plants, aiming to contribute to their preservation. It first introduces the related concepts. Then, surveys related work and categorizes existing solutions presenting their key features, strengths, and limitations. The most promising solutions and techniques are identified. Insights on open issues and research directions in the topic are also provided. This paper paves the way to a wider adoption of recent results in computer vision techniques in this field and for the proposal of a mobile application that uses YOLO convolutional neural networks to detect the stages of development of wild flowers and plants.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136184743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure medical image watermarking based on reversible data hiding with Arnold's cat map 安全医学图像水印基于可逆数据隐藏与阿诺德的猫地图
International Journal of Advances in Intelligent Informatics Pub Date : 2023-10-15 DOI: 10.26555/ijain.v9i3.1029
Aulia Arham, Novia Lestari
{"title":"Secure medical image watermarking based on reversible data hiding with Arnold's cat map","authors":"Aulia Arham, Novia Lestari","doi":"10.26555/ijain.v9i3.1029","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.1029","url":null,"abstract":"The process of restoring medical images to their original form after the extraction process in application watermarking is crucial for ensuring their authenticity. Inaccurate diagnoses can occur due to distortions in medical images from conventional data embedding applications. To address this issue, reversible data hiding (RDH) method has been proposed by several researchers in recent years to embed data in medical images. After the extraction process, images can be restored to their original form with a reversible data-hiding method. In the past few years, several RDH methods have been rapidly developed, which are based on the concept of difference expansion (DE). However, it is crucial to pay attention to the security of the medical image watermarking method, the embedded data with RDH method can be easily modified, accessed, and altered by unauthorized individuals if they know the employed method. This research suggests a new approach to secure the RDH method through the use of Chaotic Map-based Arnold's Cat Map algorithms on the medical images. Data embedding was performed on random medical images using a DE method. Four gray-scale medical image modalities were used to assess the proposed method's efficacy. In our approach, we can incorporate capacity up to 0.62 bpp while maintaining a visual quality up to 41.02 dB according to PSNR and 0.9900 according to SSIM. The results indicated that it can enhance the security of the RDH method while retaining the ability to embed data and preserving the visual appearance of the medical images.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"1167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136185887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of sleep stage classification using feature importance of EEG signal for big data healthcare 基于脑电信号特征重要性的睡眠阶段分类在大数据医疗中的评价
International Journal of Advances in Intelligent Informatics Pub Date : 2023-10-15 DOI: 10.26555/ijain.v9i3.1008
Mera Kartika Delimayanti, Mauldy Laya, Anggi Mardiyono, Bambang Warsuta, Reisa Siva Nandika, Mohammad Reza Faisal
{"title":"Evaluation of sleep stage classification using feature importance of EEG signal for big data healthcare","authors":"Mera Kartika Delimayanti, Mauldy Laya, Anggi Mardiyono, Bambang Warsuta, Reisa Siva Nandika, Mohammad Reza Faisal","doi":"10.26555/ijain.v9i3.1008","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.1008","url":null,"abstract":"Sleep analysis is widely and experimentally considered due to its importance to body health care. Since its sufficiency is essential for a healthy life, people often spend almost a third of their lives sleeping. In this case, a similar sleep pattern is not practiced by every individual, regarding pure healthiness or disorders such as insomnia, apnea, bruxism, epilepsy, and narcolepsy. Therefore, this study aims to determine the classification patterns of sleep stages, using big data for health care. This used a high-dimensional FFT extraction algorithm, as well as a feature importance and tuning classifier, to develop accurate classification. The results showed that the proposed method led to more accurate classification than previous techniques. This was because the previous experiments had been conducted with the feature selection model, with accuracy implemented as a performance evaluation. Meanwhile, the EEG Sleep Stages classification model in this present report was composed of the feature selection and importance of the extraction stage. The previous and present experiments also reached the highest values of accuracy, with the Random Forest and SVM models using 2000 and 3000 features (87.19% and 89.19%, respectively. In this article, we proposed an analysis that the feature importance subsequently influenced the model's accuracy. This was because the proposed method was easily fine-tuned and optimized for each subject to improve sensitivity and reduce false negative occurrences.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136184740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning pest detection on Indonesian red chili pepper plant based on fine-tuned YOLOv5 基于微调YOLOv5的印尼红辣椒植物深度学习害虫检测
International Journal of Advances in Intelligent Informatics Pub Date : 2023-10-15 DOI: 10.26555/ijain.v9i3.864
Indra Agustian, Ruvita Faurina, Sahrial Ihsani Ishak, Ferzha Putra Utama, Kusmea Dinata Dinata, Novalio Daratha
{"title":"Deep learning pest detection on Indonesian red chili pepper plant based on fine-tuned YOLOv5","authors":"Indra Agustian, Ruvita Faurina, Sahrial Ihsani Ishak, Ferzha Putra Utama, Kusmea Dinata Dinata, Novalio Daratha","doi":"10.26555/ijain.v9i3.864","DOIUrl":"https://doi.org/10.26555/ijain.v9i3.864","url":null,"abstract":".This research developed a pest detection model for Indonesian red chili pepper based on fine-tuned YOLOv5. Indonesian red chili pepper is the third largest vegetable commodity produced in Indonesia. Pest attacks disrupt the quantity and quality of crop yields. To control pests effectively, it is necessary to detect the type of pest correctly. A viable solution is to leverage computer vision and deep learning technologies. However, no previous studies have developed a pest detection model for Indonesian red chili pepper based on this technology. YOLOv5 is a variant of the YOLO object detection algorithm, which has major advantages in terms of computation cost and execution speed. The dataset comprises 4,994 image files collected from a chili plantation in Bengkulu province, Indonesia, covering 4 different classes and a total of 10,683 pests. The image is 1216 x1216 px with the smallest, largest, and average object dimensions of 2%, 35%, and 4% of the image dimensions. The training model used is fine-tuning YOLOv5s with variations of patience as an early stop parameter of 100, 200, and 300. The evaluation of the trained model is based on train loss, validation loss, and mAP@0.5:0.95, the best-trained model is the 445th epoch on patience 100 with the best confidence value of 0.321 and the highest TF1 of 0.74. From the best-trained model testing on the test dataset, the mAP@0.5 performance for all classes is 81.3%. The model not only detected large pests but was also able to detect objects that were small in size compared to the image size. The best-trained model's best mAP@0.5 performance and speed are 82.6% and 20 ms/image, or 50 fps on NVIDIA P100 GPU.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136185888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multidisciplinary classification for Indonesian scientific articles abstract using pre-trained BERT model 使用预训练的 BERT 模型对印尼科学文章摘要进行多学科分类
International Journal of Advances in Intelligent Informatics Pub Date : 2023-07-08 DOI: 10.26555/ijain.v9i2.1051
Antonius Angga Kurniawan, S. Madenda, Setia Wirawan, Ruddy J. Suhatril
{"title":"Multidisciplinary classification for Indonesian scientific articles abstract using pre-trained BERT model","authors":"Antonius Angga Kurniawan, S. Madenda, Setia Wirawan, Ruddy J. Suhatril","doi":"10.26555/ijain.v9i2.1051","DOIUrl":"https://doi.org/10.26555/ijain.v9i2.1051","url":null,"abstract":"Scientific articles now have multidisciplinary content. These make it difficult for researchers to find out relevant information. Some submissions are irrelevant to the journal's discipline. Categorizing articles and assessing their relevance can aid researchers and journals. Existing research still focuses on single-category predictive outcomes. Therefore, this research takes a new approach by applying a multidisciplinary classification for Indonesian scientific article abstracts using a pre-trained BERT model, showing the relevance between each category in an abstract. The dataset used was 9,000 abstracts with 9 disciplinary categories. On the dataset, text preprocessing is performed. The classification model was built by combining the pre-trained BERT model with Artificial Neural Network. Fine-tuning the hyperparameters is done to determine the most optimal hyperparameter combination for the model. The hyperparameters consist of batch size, learning rate, number of epochs, and data ratio. The best hyperparameter combination is a learning rate of 1e-5, batch size 32, epochs 3, and data ratio 9:1, with a validation accuracy value of 90.8%. The confusion matrix results of the model are compared with the confusion matrix results by experts. In this case, the highest accuracy result obtained by the model is 99.56%. A software prototype used the most accurate model to classify new data, displaying the top two prediction probabilities and the dominant category. This research produces a model that can be used to solve Indonesian text classification-related problems.","PeriodicalId":52195,"journal":{"name":"International Journal of Advances in Intelligent Informatics","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139361355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信