Maria Masood, F. Azam, Muhammad Waseem Anwar, Jalees Ur Rahman
{"title":"基于深度学习的乌尔都语情感分析框架","authors":"Maria Masood, F. Azam, Muhammad Waseem Anwar, Jalees Ur Rahman","doi":"10.1109/ICoDT255437.2022.9787451","DOIUrl":null,"url":null,"abstract":"In recent times, Sentiment analysis has become a significant means for framing a successful business and can be very helpful in predicting customer trends to help organizations in their decision-making process. Though many software applications are available in the market for text analysis, one of the major limitations of such applications is that they are developed for rich languages like English, German, Spanish, Arabic, etc. and less popular languages like Urdu, Hindi, Roman Urdu are neglected due to lack of availability of resources. Therefore, this research project will provide an implementation of sentiment analysis in the Urdu language. Firstly, preprocessing is performed and a small-scale manual dictionary of 830 Urdu stem words is introduced for stemming. Then a deep learning-based framework of LSTM is used for Urdu sentiment analysis. Experimental results show high classification accuracy of 86.03% and 0.89 F1 Score with the use of LSTM that captures sequence information effectively to analyze sentiments than the conventional supervised machine learning approaches.","PeriodicalId":291030,"journal":{"name":"2022 2nd International Conference on Digital Futures and Transformative Technologies (ICoDT2)","volume":"252 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Deep-learning based framework for sentiment analysis in Urdu language\",\"authors\":\"Maria Masood, F. Azam, Muhammad Waseem Anwar, Jalees Ur Rahman\",\"doi\":\"10.1109/ICoDT255437.2022.9787451\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent times, Sentiment analysis has become a significant means for framing a successful business and can be very helpful in predicting customer trends to help organizations in their decision-making process. Though many software applications are available in the market for text analysis, one of the major limitations of such applications is that they are developed for rich languages like English, German, Spanish, Arabic, etc. and less popular languages like Urdu, Hindi, Roman Urdu are neglected due to lack of availability of resources. Therefore, this research project will provide an implementation of sentiment analysis in the Urdu language. Firstly, preprocessing is performed and a small-scale manual dictionary of 830 Urdu stem words is introduced for stemming. Then a deep learning-based framework of LSTM is used for Urdu sentiment analysis. Experimental results show high classification accuracy of 86.03% and 0.89 F1 Score with the use of LSTM that captures sequence information effectively to analyze sentiments than the conventional supervised machine learning approaches.\",\"PeriodicalId\":291030,\"journal\":{\"name\":\"2022 2nd International Conference on Digital Futures and Transformative Technologies (ICoDT2)\",\"volume\":\"252 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 2nd International Conference on Digital Futures and Transformative Technologies (ICoDT2)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICoDT255437.2022.9787451\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Digital Futures and Transformative Technologies (ICoDT2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICoDT255437.2022.9787451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep-learning based framework for sentiment analysis in Urdu language
In recent times, Sentiment analysis has become a significant means for framing a successful business and can be very helpful in predicting customer trends to help organizations in their decision-making process. Though many software applications are available in the market for text analysis, one of the major limitations of such applications is that they are developed for rich languages like English, German, Spanish, Arabic, etc. and less popular languages like Urdu, Hindi, Roman Urdu are neglected due to lack of availability of resources. Therefore, this research project will provide an implementation of sentiment analysis in the Urdu language. Firstly, preprocessing is performed and a small-scale manual dictionary of 830 Urdu stem words is introduced for stemming. Then a deep learning-based framework of LSTM is used for Urdu sentiment analysis. Experimental results show high classification accuracy of 86.03% and 0.89 F1 Score with the use of LSTM that captures sequence information effectively to analyze sentiments than the conventional supervised machine learning approaches.