An Active Learning Based Support Tool for Extracting Hints of Tourism Development from Blog Articles

M. Tokuhisa, H. Shahana, M. Murata, J. Murakami
{"title":"An Active Learning Based Support Tool for Extracting Hints of Tourism Development from Blog Articles","authors":"M. Tokuhisa, H. Shahana, M. Murata, J. Murakami","doi":"10.1109/IIAI-AAI.2012.29","DOIUrl":null,"url":null,"abstract":"The present paper proposes a tool to help analysts make tourism development ideas while reading blog articles. Since reading the entire text of an article is time consuming, it is useful to extract from the blog articles significant sentences that are relevant to tourism development. The proposed tool extracts such sentences using a support vector machine (SVM) and an active learning method. In the first learning step, the proposed tool is trained using corpora that include hint-tags. The analyst then provides target blog articles to the tool and receives sentences as the results of the SVM classification. Some of these sentences are analyzed manually in order to annotate new hint-tags. In the second learning step, both the original corpora and the annotation results are used. Finally, the analyst reads plausible sentences extracted from the second classification of the target articles. In the experiments, we confirmed that the proposed active learning method provides better results than the simple learning method.","PeriodicalId":103053,"journal":{"name":"2012 IIAI International Conference on Advanced Applied Informatics","volume":"1648 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IIAI International Conference on Advanced Applied Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2012.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The present paper proposes a tool to help analysts make tourism development ideas while reading blog articles. Since reading the entire text of an article is time consuming, it is useful to extract from the blog articles significant sentences that are relevant to tourism development. The proposed tool extracts such sentences using a support vector machine (SVM) and an active learning method. In the first learning step, the proposed tool is trained using corpora that include hint-tags. The analyst then provides target blog articles to the tool and receives sentences as the results of the SVM classification. Some of these sentences are analyzed manually in order to annotate new hint-tags. In the second learning step, both the original corpora and the annotation results are used. Finally, the analyst reads plausible sentences extracted from the second classification of the target articles. In the experiments, we confirmed that the proposed active learning method provides better results than the simple learning method.
从博客文章中提取旅游发展线索的主动学习支持工具
本文提出了一个工具来帮助分析人员在阅读博客文章的同时做出旅游发展的想法。由于阅读一篇文章的全文是很耗时的,所以从博客文章中提取与旅游发展相关的重要句子是有用的。该工具使用支持向量机(SVM)和主动学习方法提取此类句子。在第一个学习步骤中,使用包含提示标签的语料库对提出的工具进行训练。然后,分析人员向工具提供目标博客文章,并接收作为SVM分类结果的句子。其中一些句子是手动分析的,以便注释新的提示标签。在第二步学习中,同时使用原始语料库和标注结果。最后,分析人员阅读从目标文章的第二类中提取的似是而非的句子。在实验中,我们证实了所提出的主动学习方法比简单学习方法有更好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信