{"title":"用于手语翻译的神经网络图像分割","authors":"Vegim Shala, E. Bytyçi","doi":"10.46338/ijetae0323_11","DOIUrl":null,"url":null,"abstract":"The use of neural networks to recognize and classify objects in images is a popular field in computer science. It is highly likely that an object in an image chosen for classification will have a representation matrix with significantly less pixels than the background or other elements of the image. As a result, the initial plan would be to divide or segment that object from the other portions of the image that are not essential for categorization. This also serves as the study's objective, for which we employ segmentation to separate the components essential to the classification procedure and assess any room for improvement in the final classification outcome. Mask Region Convolutional Neural Network was the model used for segmentation, and Convolutional Neural Network was the model used for classification. The study's findings demonstrate a notable improvement in the classification in the case of sign language. Further advancement of image segmentation models implies better more accurate results for classification models once they are combined. Keywords— Neural network, Image segmentation, Sign language, Classification, Mask Regional Convolutional Neural Network.","PeriodicalId":169403,"journal":{"name":"International Journal of Emerging Technology and Advanced Engineering","volume":"21 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Neural Network Image Segmentation for Sign Language Interpretation\",\"authors\":\"Vegim Shala, E. Bytyçi\",\"doi\":\"10.46338/ijetae0323_11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of neural networks to recognize and classify objects in images is a popular field in computer science. It is highly likely that an object in an image chosen for classification will have a representation matrix with significantly less pixels than the background or other elements of the image. As a result, the initial plan would be to divide or segment that object from the other portions of the image that are not essential for categorization. This also serves as the study's objective, for which we employ segmentation to separate the components essential to the classification procedure and assess any room for improvement in the final classification outcome. Mask Region Convolutional Neural Network was the model used for segmentation, and Convolutional Neural Network was the model used for classification. The study's findings demonstrate a notable improvement in the classification in the case of sign language. Further advancement of image segmentation models implies better more accurate results for classification models once they are combined. Keywords— Neural network, Image segmentation, Sign language, Classification, Mask Regional Convolutional Neural Network.\",\"PeriodicalId\":169403,\"journal\":{\"name\":\"International Journal of Emerging Technology and Advanced Engineering\",\"volume\":\"21 5\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Emerging Technology and Advanced Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.46338/ijetae0323_11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Emerging Technology and Advanced Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46338/ijetae0323_11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
利用神经网络对图像中的物体进行识别和分类是计算机科学的一个热门领域。选择用于分类的图像中的对象极有可能具有比图像的背景或其他元素少得多的像素表示矩阵。因此,最初的计划是将该对象与图像中不需要分类的其他部分分开或分割。这也是本研究的目标,为此,我们采用分割来分离分类过程中必不可少的组件,并评估最终分类结果的改进空间。Mask Region Convolutional Neural Network作为分割模型,Convolutional Neural Network作为分类模型。这项研究的结果表明,在手语的情况下,分类有了显著的改善。图像分割模型的进一步发展意味着分类模型结合后的结果会更好更准确。关键词:神经网络,图像分割,手语,分类,掩码区域卷积神经网络
Neural Network Image Segmentation for Sign Language Interpretation
The use of neural networks to recognize and classify objects in images is a popular field in computer science. It is highly likely that an object in an image chosen for classification will have a representation matrix with significantly less pixels than the background or other elements of the image. As a result, the initial plan would be to divide or segment that object from the other portions of the image that are not essential for categorization. This also serves as the study's objective, for which we employ segmentation to separate the components essential to the classification procedure and assess any room for improvement in the final classification outcome. Mask Region Convolutional Neural Network was the model used for segmentation, and Convolutional Neural Network was the model used for classification. The study's findings demonstrate a notable improvement in the classification in the case of sign language. Further advancement of image segmentation models implies better more accurate results for classification models once they are combined. Keywords— Neural network, Image segmentation, Sign language, Classification, Mask Regional Convolutional Neural Network.