{"title":"基于非核心学习的情感分析","authors":"Mahmudul Hasan, Ishrak Islam, K. Hasan","doi":"10.1109/ECACE.2019.8679298","DOIUrl":null,"url":null,"abstract":"Text sentiment detection for a particular language other than English is one of the challenging tasks presently. The reasons are; it needs a large dataset, language has no specific structure, one word has a different meaning, and it is hard for even human to understand the connotation of particular words. There exists several proposed architecture for detecting emotions in the Bengali language using machine learning and deep learning approaches, but they are not accurate enough to predict the perfect emotion of the sentence. And there is still no standalone architecture is available that can extract the sentiments hidden inside of a sentence in different languages. In this paper, we are proposing an abstract model that can enable sentiment analysis without any restriction of using a fixed language somewhat applicable to any language. With the use of natural language processing, we have extracted the features, and these features are then fed to different machine learning models for classification. As our main concern was to build up a general model, this model is confined to binary classification, i.e., positive and negative. Apart from this, In our system architecture, we have implemented stochastic gradient descent for optimization. So our model can be called out of core learning model where the model can be updated when new user data is inserted without training the whole model. For the evaluation of the performance of our model, we have trained the estimators against Bangla translated IMDB review dataset and calculated different evaluation metrics for our estimators. The dataset is translated into Bangla using google translator.","PeriodicalId":226060,"journal":{"name":"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Sentiment Analysis Using Out of Core Learning\",\"authors\":\"Mahmudul Hasan, Ishrak Islam, K. Hasan\",\"doi\":\"10.1109/ECACE.2019.8679298\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text sentiment detection for a particular language other than English is one of the challenging tasks presently. The reasons are; it needs a large dataset, language has no specific structure, one word has a different meaning, and it is hard for even human to understand the connotation of particular words. There exists several proposed architecture for detecting emotions in the Bengali language using machine learning and deep learning approaches, but they are not accurate enough to predict the perfect emotion of the sentence. And there is still no standalone architecture is available that can extract the sentiments hidden inside of a sentence in different languages. In this paper, we are proposing an abstract model that can enable sentiment analysis without any restriction of using a fixed language somewhat applicable to any language. With the use of natural language processing, we have extracted the features, and these features are then fed to different machine learning models for classification. As our main concern was to build up a general model, this model is confined to binary classification, i.e., positive and negative. Apart from this, In our system architecture, we have implemented stochastic gradient descent for optimization. So our model can be called out of core learning model where the model can be updated when new user data is inserted without training the whole model. For the evaluation of the performance of our model, we have trained the estimators against Bangla translated IMDB review dataset and calculated different evaluation metrics for our estimators. The dataset is translated into Bangla using google translator.\",\"PeriodicalId\":226060,\"journal\":{\"name\":\"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ECACE.2019.8679298\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECACE.2019.8679298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text sentiment detection for a particular language other than English is one of the challenging tasks presently. The reasons are; it needs a large dataset, language has no specific structure, one word has a different meaning, and it is hard for even human to understand the connotation of particular words. There exists several proposed architecture for detecting emotions in the Bengali language using machine learning and deep learning approaches, but they are not accurate enough to predict the perfect emotion of the sentence. And there is still no standalone architecture is available that can extract the sentiments hidden inside of a sentence in different languages. In this paper, we are proposing an abstract model that can enable sentiment analysis without any restriction of using a fixed language somewhat applicable to any language. With the use of natural language processing, we have extracted the features, and these features are then fed to different machine learning models for classification. As our main concern was to build up a general model, this model is confined to binary classification, i.e., positive and negative. Apart from this, In our system architecture, we have implemented stochastic gradient descent for optimization. So our model can be called out of core learning model where the model can be updated when new user data is inserted without training the whole model. For the evaluation of the performance of our model, we have trained the estimators against Bangla translated IMDB review dataset and calculated different evaluation metrics for our estimators. The dataset is translated into Bangla using google translator.