Cong Minh Dinh, L. Do, Hyung-Jeong Yang, Soohyung Kim, Gueesang Lee
{"title":"改进的基于词典驱动的和弦符号识别在音乐图像","authors":"Cong Minh Dinh, L. Do, Hyung-Jeong Yang, Soohyung Kim, Gueesang Lee","doi":"10.5392/IJOC.2016.12.4.053","DOIUrl":null,"url":null,"abstract":"Although extensively developed, optical music recognition systems have mostly focused on musical symbols (notes, rests, etc.), while disregarding the chord symbols. The process becomes difficult when the images are distorted or slurred, although this can be resolved using optical character recognition systems. Moreover, the appearance of outliers (lyrics, dynamics, etc.) increases the complexity of the chord recognition. Therefore, we propose a new approach addressing these issues. After binarization, un-distortion, and stave and lyric removal of a musical image, a rule-based method is applied to detect the potential regions of chord symbols. Next, a lexicon-driven approach is used to optimally and simultaneously separate and recognize characters. The score that is returned from the recognition process is used to detect the outliers. The effectiveness of our system is demonstrated through impressive accuracy of experimental results on two datasets having a variety of resolutions.","PeriodicalId":31343,"journal":{"name":"International Journal of Contents","volume":"12 1","pages":"53-61"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Improved Lexicon-driven based Chord Symbol Recognition in Musical Images\",\"authors\":\"Cong Minh Dinh, L. Do, Hyung-Jeong Yang, Soohyung Kim, Gueesang Lee\",\"doi\":\"10.5392/IJOC.2016.12.4.053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although extensively developed, optical music recognition systems have mostly focused on musical symbols (notes, rests, etc.), while disregarding the chord symbols. The process becomes difficult when the images are distorted or slurred, although this can be resolved using optical character recognition systems. Moreover, the appearance of outliers (lyrics, dynamics, etc.) increases the complexity of the chord recognition. Therefore, we propose a new approach addressing these issues. After binarization, un-distortion, and stave and lyric removal of a musical image, a rule-based method is applied to detect the potential regions of chord symbols. Next, a lexicon-driven approach is used to optimally and simultaneously separate and recognize characters. The score that is returned from the recognition process is used to detect the outliers. The effectiveness of our system is demonstrated through impressive accuracy of experimental results on two datasets having a variety of resolutions.\",\"PeriodicalId\":31343,\"journal\":{\"name\":\"International Journal of Contents\",\"volume\":\"12 1\",\"pages\":\"53-61\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Contents\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5392/IJOC.2016.12.4.053\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Contents","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5392/IJOC.2016.12.4.053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Lexicon-driven based Chord Symbol Recognition in Musical Images
Although extensively developed, optical music recognition systems have mostly focused on musical symbols (notes, rests, etc.), while disregarding the chord symbols. The process becomes difficult when the images are distorted or slurred, although this can be resolved using optical character recognition systems. Moreover, the appearance of outliers (lyrics, dynamics, etc.) increases the complexity of the chord recognition. Therefore, we propose a new approach addressing these issues. After binarization, un-distortion, and stave and lyric removal of a musical image, a rule-based method is applied to detect the potential regions of chord symbols. Next, a lexicon-driven approach is used to optimally and simultaneously separate and recognize characters. The score that is returned from the recognition process is used to detect the outliers. The effectiveness of our system is demonstrated through impressive accuracy of experimental results on two datasets having a variety of resolutions.