{"title":"带遗忘机制的自适应半监督意见分类器","authors":"Max Zimmermann, Eirini Ntoutsi, M. Spiliopoulou","doi":"10.1145/2554850.2555039","DOIUrl":null,"url":null,"abstract":"Opinion stream classification methods face the challenge of learning with a limited amount of labeled data: inspecting and labeling opinions is a tedious task, so systems analyzing opinions must devise mechanisms that label the arriving stream of opinionated documents with minimal human intervention. We propose an opinion stream classifier that only uses a seed of labeled documents as input and thereafter adapts itself, as it reads documents with unknown labels. Since the stream of opinions is subject to concept drift, we use two adaptation mechanisms: forward adaptation, where the classifier incorporates to the training set only those un-labeled documents that it considers informative enough in comparison to those seen thus far; and backward adaptation, where the classifier gradually forgets old documents by eliminating them from the model. We evaluate our method on opinionated tweets and show that it performs comparably or even better than a fully supervised baseline.","PeriodicalId":285655,"journal":{"name":"Proceedings of the 29th Annual ACM Symposium on Applied Computing","volume":"199 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Adaptive semi supervised opinion classifier with forgetting mechanism\",\"authors\":\"Max Zimmermann, Eirini Ntoutsi, M. Spiliopoulou\",\"doi\":\"10.1145/2554850.2555039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Opinion stream classification methods face the challenge of learning with a limited amount of labeled data: inspecting and labeling opinions is a tedious task, so systems analyzing opinions must devise mechanisms that label the arriving stream of opinionated documents with minimal human intervention. We propose an opinion stream classifier that only uses a seed of labeled documents as input and thereafter adapts itself, as it reads documents with unknown labels. Since the stream of opinions is subject to concept drift, we use two adaptation mechanisms: forward adaptation, where the classifier incorporates to the training set only those un-labeled documents that it considers informative enough in comparison to those seen thus far; and backward adaptation, where the classifier gradually forgets old documents by eliminating them from the model. We evaluate our method on opinionated tweets and show that it performs comparably or even better than a fully supervised baseline.\",\"PeriodicalId\":285655,\"journal\":{\"name\":\"Proceedings of the 29th Annual ACM Symposium on Applied Computing\",\"volume\":\"199 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 29th Annual ACM Symposium on Applied Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2554850.2555039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 29th Annual ACM Symposium on Applied Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2554850.2555039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive semi supervised opinion classifier with forgetting mechanism
Opinion stream classification methods face the challenge of learning with a limited amount of labeled data: inspecting and labeling opinions is a tedious task, so systems analyzing opinions must devise mechanisms that label the arriving stream of opinionated documents with minimal human intervention. We propose an opinion stream classifier that only uses a seed of labeled documents as input and thereafter adapts itself, as it reads documents with unknown labels. Since the stream of opinions is subject to concept drift, we use two adaptation mechanisms: forward adaptation, where the classifier incorporates to the training set only those un-labeled documents that it considers informative enough in comparison to those seen thus far; and backward adaptation, where the classifier gradually forgets old documents by eliminating them from the model. We evaluate our method on opinionated tweets and show that it performs comparably or even better than a fully supervised baseline.