在线论坛建议挖掘中建议揭示句子的自动提取

A. Wicaksono, Sung-Hyon Myaeng
{"title":"在线论坛建议挖掘中建议揭示句子的自动提取","authors":"A. Wicaksono, Sung-Hyon Myaeng","doi":"10.1145/2479832.2479857","DOIUrl":null,"url":null,"abstract":"Web forums often contain explicit key learnings gleaned from people's experiences since they are platforms for personal communications on sharing information with others. One of the key learnings contained inWeb forums is often expressed in the form of advice. As part of human experience mining from Web resources, we aim to provide a methodology to extract advice-revealing sentences from Web forums due to its usefulness, especially in travel domain. Instead of viewing the problem as a simple classification, we define it as a sequence labeling problem using various features. We identify three different types of features (i.e., syntactic features, context features, and sentence informativeness) and propose a new way of using Hidden Markov Model (HMM) for labeling sequential sentences, which in our experiment gave the best performance for our task. Moreover, the sentence informativeness score serves as an important feature for this task. It is worth noting that this work is the first attempt to extract advice-revealing sentences from Web forums.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":"{\"title\":\"Automatic extraction of advice-revealing sentences foradvice mining from online forums\",\"authors\":\"A. Wicaksono, Sung-Hyon Myaeng\",\"doi\":\"10.1145/2479832.2479857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Web forums often contain explicit key learnings gleaned from people's experiences since they are platforms for personal communications on sharing information with others. One of the key learnings contained inWeb forums is often expressed in the form of advice. As part of human experience mining from Web resources, we aim to provide a methodology to extract advice-revealing sentences from Web forums due to its usefulness, especially in travel domain. Instead of viewing the problem as a simple classification, we define it as a sequence labeling problem using various features. We identify three different types of features (i.e., syntactic features, context features, and sentence informativeness) and propose a new way of using Hidden Markov Model (HMM) for labeling sequential sentences, which in our experiment gave the best performance for our task. Moreover, the sentence informativeness score serves as an important feature for this task. It is worth noting that this work is the first attempt to extract advice-revealing sentences from Web forums.\",\"PeriodicalId\":388497,\"journal\":{\"name\":\"Proceedings of the seventh international conference on Knowledge capture\",\"volume\":\"123 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"24\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the seventh international conference on Knowledge capture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2479832.2479857\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the seventh international conference on Knowledge capture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2479832.2479857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24

摘要

网络论坛通常包含从人们的经验中收集到的明确的关键知识,因为它们是与他人分享信息的个人交流平台。web论坛中包含的关键知识之一通常以建议的形式表示。作为从Web资源中挖掘人类经验的一部分,我们的目标是提供一种从Web论坛中提取建议揭示句子的方法,因为它很有用,特别是在旅游领域。我们不把这个问题看作一个简单的分类问题,而是把它定义为一个使用各种特征的序列标记问题。我们识别了三种不同类型的特征(即句法特征、上下文特征和句子信息性),并提出了一种使用隐马尔可夫模型(HMM)标记顺序句子的新方法,该方法在我们的实验中为我们的任务提供了最佳性能。此外,句子信息得分是该任务的一个重要特征。值得注意的是,这项工作是首次尝试从网络论坛中提取建议揭示语句。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatic extraction of advice-revealing sentences foradvice mining from online forums
Web forums often contain explicit key learnings gleaned from people's experiences since they are platforms for personal communications on sharing information with others. One of the key learnings contained inWeb forums is often expressed in the form of advice. As part of human experience mining from Web resources, we aim to provide a methodology to extract advice-revealing sentences from Web forums due to its usefulness, especially in travel domain. Instead of viewing the problem as a simple classification, we define it as a sequence labeling problem using various features. We identify three different types of features (i.e., syntactic features, context features, and sentence informativeness) and propose a new way of using Hidden Markov Model (HMM) for labeling sequential sentences, which in our experiment gave the best performance for our task. Moreover, the sentence informativeness score serves as an important feature for this task. It is worth noting that this work is the first attempt to extract advice-revealing sentences from Web forums.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信