{"title":"a Schema Extraction of Document-Oriented Database for Data Warehouse","authors":"A. Istiqamah, Kemas Wiharja","doi":"10.21108/ijoict.v7i2.584","DOIUrl":"https://doi.org/10.21108/ijoict.v7i2.584","url":null,"abstract":"\u0000 \u0000 \u0000The data warehouse is a very famous solution for analyzing business data from heterogeneous sources. Unfortunately, a data warehouse only can analyze structured data. Whereas, nowadays, thanks to the popularity of social media and the ease of creating data on the web, we are experiencing a flood of unstructured data. Therefore, we need an approach that can \"structure\" the unstructured data into structured data that can be processed by the data warehouse. To do this, we propose a schema extraction approach using Google Cloud Platform that will create a schema from unstructured data. Based on our experiment, our approach successfully produces a schema from unstructured data. To the best of our knowledge, we are the first in using Google Cloud Platform for extracting a schema. We also prove that our approach helps the database developer to understand the unstructured data better. \u0000 \u0000 \u0000","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121271068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shofura Shofura, Sri Suryani M.Si, L. Salma, S. Harini
{"title":"The Effect of Number of Factors and Data on Monthly Weather Classification Performance Using Artificial Neural Networks","authors":"Shofura Shofura, Sri Suryani M.Si, L. Salma, S. Harini","doi":"10.21108/ijoict.v7i2.602","DOIUrl":"https://doi.org/10.21108/ijoict.v7i2.602","url":null,"abstract":"Current weather-related research only focuses on weather prediction based on raw data and the factors used are generally 4 factors: average temperature, solar radiation, air pressure, and wind. In this research, monthly weather prediction is done using 5 factors where the additional factor used is rainfall in the previous time. In contrast to previous prediction research, the prediction process carried out in this study emphasizes the modeling of training data according to the desired prediction model.. These two things distinguish this research from previous studies. The prediction model used in this study is a classification-based prediction model that is the Artificial Neural Network (ANN) method combined with the backpropagation algorithm for calculating the weight of the ANN network. The data used are meteorological data from 2010 to 2018 in the Bogor area, where data from 2010 to 2016 are used as training data, and data from 2017 to 2018 are used as test data. The results of this study indicate that the design of the model with the use of data for 6 years with feature data of 5 factors has an accuracy rate of 83.33%.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127715594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ghinaa Zain Nabiilah, Said Al Faraby, Mahendra Dwifebri Purbolaksono
{"title":"Classification of Hadith Topic of Indonesian Translation Using K-Nearest Neighbor and Chi-Square","authors":"Ghinaa Zain Nabiilah, Said Al Faraby, Mahendra Dwifebri Purbolaksono","doi":"10.21108/ijoict.v7i2.573","DOIUrl":"https://doi.org/10.21108/ijoict.v7i2.573","url":null,"abstract":"Hadith is the main way of life for Muslims besides the Qur'an whose can be applied in everyday life. Hadith also contains all the words or deeds of the Prophet Muhammad which are used as a source of the law of Islam. Therefore, many readers, especially Muslims, are interested in studying hadith. However, the large number of hadiths makes it difficult for readers or those who are still unfamiliar with Islam to read them. Therefore, we conducted a study to classify hadith textually based on the type of teaching, so that readers can get an overview or other reference in reading and searching for hadith based on the type of teaching more easily. This study uses KNN and chi-square methods as feature selection. We also carried out several test scenarios, including implementing stopword removal modifications in preprocessing and experimenting with selecting k values for KNN to determine the best performance. The best performance was obtained by using the value of k = 7 on KNN without implementing chi-square and with stopword removal modification with a hammer loss value of 0.1042 or about 89.58% of the data correctly classified.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127145117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"STL Decomposition and SARIMA Model: The Case for Estimating Value-at-Risk of Covid-19 Increment Rate in DKI Jakarta","authors":"Agnes Zahrani, Aniq A. Rohmawati, Siti Sa’adah","doi":"10.21108/ijoict.v7i2.553","DOIUrl":"https://doi.org/10.21108/ijoict.v7i2.553","url":null,"abstract":"In this research, we propose an extreme values measure, the Value-at-Risk (VaR) based Seasonal Trend Loess (STL) Decomposition and Seasonal Autoregressive Integrated Moving Average (SARIMA) models, which is more sensitive to the seasonality of extreme value than the conventional VaR. We consider the problem of the seasonality and extreme value for increment rate of Covid-19 forecasting. For stakeholder, government and regulator, VaR estimation can be implemented to face the extreme wave of new positive Covid-19 in the future and minimize the losses that possibly affected in term of financial and human resources. Specifically, the estimation of VaR is developed with the difference lies on parameter estimators of STL and SARIMA model. The VaR has coverage probability as well as close 1-α. Thus, we propose to set α as parameter to estimate VaR. Consequently, the performance of VaR will depend not only on parameter model but also α. Our aim estimates VaR with minimum α based on correct VaR value. Numerical analysis is carried out to illustrate the estimative VaR.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"1043 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123331195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Toxic Comment Classification on Social Media Using Support Vector Machine and Chi Square Feature Selection","authors":"N. Azzahra, D. Murdiansyah, K. Lhaksmana","doi":"10.21108/ijoict.v7i1.552","DOIUrl":"https://doi.org/10.21108/ijoict.v7i1.552","url":null,"abstract":"The use of social media in society continues to increase over time and the ease of access and familiarity of social media then make it easier for an irresponsible user to do unethical things such as spreading hatred, defamation, radicalism, pornography so on. Although there are regulations that govern all the activities on social media. However, the regulations are still not working effectively. In this study, we conducted a classification of toxic comments containing unethical matters using the SVM method with TF-IDF as the feature extraction and Chi Square as the feature selection. The best performance result based on the experiment that has been carried out is by using the SVM model with a linear kernel, without implementing Chi Square, and using stemming and stopwords removal with the F1 − Score equal to 76.57%.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131477736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Neural Network on Stock Prediction using the Stock Prices Feature and Indonesian Financial News Titles","authors":"Nur Ghaniaviyanto Ramadhan, Imelda Atastina","doi":"10.21108/ijoict.v7i1.544","DOIUrl":"https://doi.org/10.21108/ijoict.v7i1.544","url":null,"abstract":"Stocks are the most popular investments among entrepreneurs or other investors. When investing in stocks these investors tend to learn how to invest stocks correctly and when is the right time. For the problem of how to invest shares correctly can be used a variety of basic theories that already exist, but for the problem when the right time needs further learning. In this paper will purpose about stock price prediction using stock data indicators and financial headline data in Bahasa Indonesia. The machine learning model used is a multi-layer perceptron neural network (MLP-NN) with the highest accuracy produced by 80%.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128297050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Karimah, Fiqqih Maulana Susanto, Aji G. Putrada
{"title":"Comparative Analysis of QoE Multipath TCP Congestion Control LIA, CUBIC, and WVEGAS on Video Streaming","authors":"S. Karimah, Fiqqih Maulana Susanto, Aji G. Putrada","doi":"10.21108/ijoict.v7i1.534","DOIUrl":"https://doi.org/10.21108/ijoict.v7i1.534","url":null,"abstract":"Transmission Control Protocol (TCP) is a type of protocol that allows a collection of computers to communicate and exchange data within a network. Nowadays electronic devices such as tablets, personal computers and smartphones can use more than one network at the same time, but this is not supported by the characteristics of TCP which can only use one path on the network. To solve this condition there are several new generations of standardized network protocols. Multipath TCP is a development of TCP, Multipath which is a new generation network protocol that allows traffic to use multiple paths in the network. In addition to being able to use multiple paths on multipath TCP, there are several congestion control algorithms including LIA, CUBIC and WVEGAS congestion control algorithms. Tests conducted in this study were to compare the performance of congestion control LIA, CUBIC and WVEGAS to improve the quality of video streaming. From the test results, CUBIC is better than WVEGAS and LIA because the QoS and QoE video streaming test for CUBIC in all testing environment has better results than others.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127508248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rosmelina Deliani Satrisna, Aniq A. Rohmawati, Siti Sa’adah
{"title":"Forecasting the COVID-19 Increment Rate in DKI Jakarta Using Non-Robust STL Decomposition and SARIMA Model","authors":"Rosmelina Deliani Satrisna, Aniq A. Rohmawati, Siti Sa’adah","doi":"10.21108/ijoict.v7i1.554","DOIUrl":"https://doi.org/10.21108/ijoict.v7i1.554","url":null,"abstract":"The Corona virus known as COVID-19 was first present in Wuhan, China at this time has troubled many countries and its spread is very fast and wide. Data on daily confirmed COVID-19 cases were collected from the DKI Jakarta province between early May 2020 and late January 2021. The daily increase in confirmed COVID-19 cases has a percentage of the value of increase in total cases. In this study, modeling and analysis of forecasting the increment rate in daily number of new cases COVID-19 DKI Jakarta was carried out using the Seasonal-Trend Loess (STL) Decomposition and Seasonal Autoregressive Integrated Moving Average (SARIMA) models. STL Decomposition is a form of algorithm developed to help decompose a Time Series, and techniques considering seasonal and non-stationary observation. The results of the best forecasting accuracy are proven by STL-ARIMA, there are MAPE and MSE which only have an error value of 0.15. This proposed approach can be used for consideration for the DKI Jakarta government in making policies for handling COVID-19, as well as for the public to adhere to health protocols.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123020447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dwi Joko Suroso, F. Y. M. Adiyatma, Ahmad Eko Kurniawan, P. Cherntanomwong
{"title":"Performance Comparison of Several Range-based Techniques for Indoor Localization Based on RSSI","authors":"Dwi Joko Suroso, F. Y. M. Adiyatma, Ahmad Eko Kurniawan, P. Cherntanomwong","doi":"10.21108/ijoict.v7i1.550","DOIUrl":"https://doi.org/10.21108/ijoict.v7i1.550","url":null,"abstract":"The classical rang-based technique for position estimation is still reliably used for indoor localization. Trilateration and multilateration, which include three or more references to locate the indoor object, are two common examples. These techniques use at least three intersection-locations of the references' distance and conclude that the intersection is the object's position. However, some challenges have appeared when using a simple power-to-distance parameter, i.e., received signal strength indicator (RSSI). RSSI is known for its fluctuated values when used as the localization parameter. The improvement of classical range-based has been proposed, namely min-max and iRingLA algorithms. These algorithms or methods use the approximation in a bounding-box and rings for min-max and iRingLA, respectively. This paper discusses the comparison performance of min-max and iRingLA with multilateration as the classical method. We found that min-max gives the best performance, and in some positions, iRingLA gives the best accuracy error. Hence, the approximation method can be promising for indoor localization, especially when using a simple and straightforward RSSI parameter.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129130371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fatri Nurul Inayah, Sri Suryani Prasetiyowati, Yuliant sibaroni
{"title":"Classification of Dengue Hemorrhagic Fever (DHF) Spread in Bandung using Hybrid Naïve Bayes, K-Nearest Neighbor, and Artificial Neural Network Methods","authors":"Fatri Nurul Inayah, Sri Suryani Prasetiyowati, Yuliant sibaroni","doi":"10.21108/ijoict.v7i1.562","DOIUrl":"https://doi.org/10.21108/ijoict.v7i1.562","url":null,"abstract":"Dengue fever is a dangerous disease caused by the dengue virus. One of the factors causing dengue fever is due to the place where you live in the tropics, so that cases of dengue fever in Indonesia, especially in the Bandung Regency area, will continue to show high numbers. Therefore, information is needed on the spread of this disease by requiring the accuracy and speed of diagnosis as early prevention. In terms of compiling this information, classification techniques can be done using a combination of methods Naïve Bayes, K-Nearest Neighbor(KNN), and Artificial Neural Network(ANN) to build predictions of the classification of dengue fever, and the data used in this Final Project are dataset affected by the spread of dengue fever in Bandung regency in the 2012-2018 period. The hybrid classifier results can improve accuracy with the voting method with an accuracy level of 90% in the classification of dengue fever.","PeriodicalId":137090,"journal":{"name":"International Journal on Information and Communication Technology (IJoICT)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125601237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}