{"title":"基于人工智能的民族音乐风格迁移算法","authors":"Beini Liu","doi":"10.61707/dstapd75","DOIUrl":null,"url":null,"abstract":"This paper introduces a novel approach that is designed for the purpose of enhancing the migration of Ethnic Music (EM) styles into diverse musical content using Artificial Intelligence (AI) techniques. EM is considered to be rich in cultural significance. It is characterized by unique scales, rhythms, and instruments, so it presents specific challenges in style transfer, mainly in terms of preserving the authenticity and integrity of traditional sounds while, at the same time, trying to introduce new stylistic elements. The proposed Migration Algorithm (MA) employs a combination of techniques such as Short-Time Fourier Transform (STFT) for preprocessing, Feature Extraction (FE) using Mel scale filtering and Mel-Frequency Cepstral Coefficients (MFCC), and feature transformation modified convolutional neural networks (STLCNet for style and CONCNet for content). The output from both networks is combined using the mathematical theory of evidence, and its output spectrogram is reconstructed into an audio signal using the phase gradient heap integration (PGHI) algorithm. An additional 60,000 recordings have been included in the Smithsonian Folkways Collection for testing. The approach succeeded significantly on key metrics, including KL Divergence, Signal-to-Distortion Ratio (SDR), and Signal-to-Noise Ratio (SNR), and the empirical findings proved that type of transfer accuracy, quality of sound, and text integrity were all properly addressed. ","PeriodicalId":508212,"journal":{"name":"International Journal of Religion","volume":"7 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ethnic Music Style Migration Algorithm Based on Artificial Intelligence\",\"authors\":\"Beini Liu\",\"doi\":\"10.61707/dstapd75\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper introduces a novel approach that is designed for the purpose of enhancing the migration of Ethnic Music (EM) styles into diverse musical content using Artificial Intelligence (AI) techniques. EM is considered to be rich in cultural significance. It is characterized by unique scales, rhythms, and instruments, so it presents specific challenges in style transfer, mainly in terms of preserving the authenticity and integrity of traditional sounds while, at the same time, trying to introduce new stylistic elements. The proposed Migration Algorithm (MA) employs a combination of techniques such as Short-Time Fourier Transform (STFT) for preprocessing, Feature Extraction (FE) using Mel scale filtering and Mel-Frequency Cepstral Coefficients (MFCC), and feature transformation modified convolutional neural networks (STLCNet for style and CONCNet for content). The output from both networks is combined using the mathematical theory of evidence, and its output spectrogram is reconstructed into an audio signal using the phase gradient heap integration (PGHI) algorithm. An additional 60,000 recordings have been included in the Smithsonian Folkways Collection for testing. The approach succeeded significantly on key metrics, including KL Divergence, Signal-to-Distortion Ratio (SDR), and Signal-to-Noise Ratio (SNR), and the empirical findings proved that type of transfer accuracy, quality of sound, and text integrity were all properly addressed. \",\"PeriodicalId\":508212,\"journal\":{\"name\":\"International Journal of Religion\",\"volume\":\"7 9\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Religion\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.61707/dstapd75\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Religion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.61707/dstapd75","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ethnic Music Style Migration Algorithm Based on Artificial Intelligence
This paper introduces a novel approach that is designed for the purpose of enhancing the migration of Ethnic Music (EM) styles into diverse musical content using Artificial Intelligence (AI) techniques. EM is considered to be rich in cultural significance. It is characterized by unique scales, rhythms, and instruments, so it presents specific challenges in style transfer, mainly in terms of preserving the authenticity and integrity of traditional sounds while, at the same time, trying to introduce new stylistic elements. The proposed Migration Algorithm (MA) employs a combination of techniques such as Short-Time Fourier Transform (STFT) for preprocessing, Feature Extraction (FE) using Mel scale filtering and Mel-Frequency Cepstral Coefficients (MFCC), and feature transformation modified convolutional neural networks (STLCNet for style and CONCNet for content). The output from both networks is combined using the mathematical theory of evidence, and its output spectrogram is reconstructed into an audio signal using the phase gradient heap integration (PGHI) algorithm. An additional 60,000 recordings have been included in the Smithsonian Folkways Collection for testing. The approach succeeded significantly on key metrics, including KL Divergence, Signal-to-Distortion Ratio (SDR), and Signal-to-Noise Ratio (SNR), and the empirical findings proved that type of transfer accuracy, quality of sound, and text integrity were all properly addressed.