Adam Huda Nugraha, Achmad Benny Mutiara, Dewi Agushinta Rahayu
{"title":"CONCEPTUAL REGIONAL ORIGIN RECOGNITION USING CNN CONVOUTION NEURAL NETWORK ON BANDUNG, BOGOR AND CIREBON REGIONAL ACCENTS","authors":"Adam Huda Nugraha, Achmad Benny Mutiara, Dewi Agushinta Rahayu","doi":"10.56127/ijml.v2i2.696","DOIUrl":null,"url":null,"abstract":"Sound detection is a challenge in machine learning due to the noisy nature of signals, and the small amount of (labeled) data that is usually available. The need for sound detection in Indonesia is quite important because there are many community organizations that form groups according to the land of their origin. Especially in big cities, where people from various tribes gather and exchange cultures. However, it has a disadvantage that affects these tribes, namely the loss of the original culture of certain areas. The Sundanese are the object of this research, including Bandung, Bogor and Cirebon. Voice data is divided into 2 types, namely male and female, each region consists of 50 respondents with 25 male and female voices with a maximum voting time of 1 minute. The method used is CNN architecture based on supervised learning, preprocessing using MFCC (Mel Frequency Cepstral Coefficients) to obtain feature extraction from voice data. CNN architecture is carried out 3 times convolution with max pooling and dropout on each convolution.","PeriodicalId":155984,"journal":{"name":"International Journal Multidisciplinary Science","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal Multidisciplinary Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56127/ijml.v2i2.696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sound detection is a challenge in machine learning due to the noisy nature of signals, and the small amount of (labeled) data that is usually available. The need for sound detection in Indonesia is quite important because there are many community organizations that form groups according to the land of their origin. Especially in big cities, where people from various tribes gather and exchange cultures. However, it has a disadvantage that affects these tribes, namely the loss of the original culture of certain areas. The Sundanese are the object of this research, including Bandung, Bogor and Cirebon. Voice data is divided into 2 types, namely male and female, each region consists of 50 respondents with 25 male and female voices with a maximum voting time of 1 minute. The method used is CNN architecture based on supervised learning, preprocessing using MFCC (Mel Frequency Cepstral Coefficients) to obtain feature extraction from voice data. CNN architecture is carried out 3 times convolution with max pooling and dropout on each convolution.
声音检测在机器学习中是一个挑战,因为信号有噪声,而且通常可用的(标记的)数据很少。在印度尼西亚,需要进行声音检测是非常重要的,因为有许多社区组织根据其原籍地组成群体。尤其是在大城市,来自不同部落的人们聚集在一起交流文化。然而,它对这些部落有一个不利的影响,即某些地区原有文化的丧失。巽他人是本次研究的对象,包括万隆、茂物和锡伯伦。语音数据分为男性和女性两种类型,每个区域由50个受访者组成,男性和女性声音各25个,最长投票时间为1分钟。使用的方法是基于监督学习的CNN架构,使用MFCC (Mel Frequency Cepstral Coefficients)进行预处理,从语音数据中获得特征提取。CNN架构进行了3次卷积,每次卷积最大池化和dropout。