Human Values Estimation on News Articles through BERT-extracted Opinion Expressions

Yihong Han, Yoko Nishihara, Junjie Shan
{"title":"Human Values Estimation on News Articles through BERT-extracted Opinion Expressions","authors":"Yihong Han, Yoko Nishihara, Junjie Shan","doi":"10.1109/TAAI57707.2022.00026","DOIUrl":null,"url":null,"abstract":"This paper proposes a human values estimation method with an opinion extraction approach. We assume that opinion sentences reflect human values the most. For a given article, the proposed method extracts the opinion sentences from the texts and estimates the human values included in the opinion sentences. The opinion sentence extraction is conducted by classifying each sentence as an opinion sentence or a non-opinion sentence. The proposed method concatenates sentences from the same article to extend the input texts as an upsampling approach while estimating the human values. The upsampling approach enriches the information volume of the training data. The distribution of human values from each opinion sentence shows the article's general human values. We conducted two evaluation experiments. We used the editorial articles from Mainichi Newspaper as the corpus data. The first experiment evaluated the performance of opinion sentence extraction. The training accuracy of opinion sentence extraction was 92%. In the evaluation test, the model reached a $F_{1}$ score of 85.5%. The results showed that opinion sentences could be extracted with high accuracy. The second experiment evaluated the performance of human values estimation. There are five categories for human values estimation. The experiment was conducted with the same editorial articles from Mainichi Newspaper as the corpus data. We applied a training data enhancement approach by increasing the sentence number of training input and achieved a training accuracy up of over 50%. The results showed that the human values of opinion sentences could be estimated with high accuracy.","PeriodicalId":111620,"journal":{"name":"2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAAI57707.2022.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper proposes a human values estimation method with an opinion extraction approach. We assume that opinion sentences reflect human values the most. For a given article, the proposed method extracts the opinion sentences from the texts and estimates the human values included in the opinion sentences. The opinion sentence extraction is conducted by classifying each sentence as an opinion sentence or a non-opinion sentence. The proposed method concatenates sentences from the same article to extend the input texts as an upsampling approach while estimating the human values. The upsampling approach enriches the information volume of the training data. The distribution of human values from each opinion sentence shows the article's general human values. We conducted two evaluation experiments. We used the editorial articles from Mainichi Newspaper as the corpus data. The first experiment evaluated the performance of opinion sentence extraction. The training accuracy of opinion sentence extraction was 92%. In the evaluation test, the model reached a $F_{1}$ score of 85.5%. The results showed that opinion sentences could be extracted with high accuracy. The second experiment evaluated the performance of human values estimation. There are five categories for human values estimation. The experiment was conducted with the same editorial articles from Mainichi Newspaper as the corpus data. We applied a training data enhancement approach by increasing the sentence number of training input and achieved a training accuracy up of over 50%. The results showed that the human values of opinion sentences could be estimated with high accuracy.
基于bert提取意见表达的新闻文章人文价值评估
本文提出了一种基于意见提取的人的价值估计方法。我们认为意见句最能反映人类的价值观。对于给定的文章,该方法从文本中提取意见句,并估计意见句中包含的人类价值。意见句提取是通过将每个句子分类为意见句或非意见句来进行的。该方法将同一篇文章中的句子连接起来,作为一种上采样方法来扩展输入文本,同时估计人的价值。上采样方法丰富了训练数据的信息量。每个观点句的人文价值分布显示了文章的一般人文价值。我们进行了两次评估实验。我们使用每日新闻的社论文章作为语料库数据。第一个实验评估了意见句提取的性能。意见句提取的训练正确率为92%。在评价测试中,模型达到了85.5%的$F_{1}$得分。结果表明,该方法能够以较高的准确率提取意见句。第二个实验评估了人类价值估计的性能。人类价值估计有五种类型。实验采用《每日新闻》的社论文章作为语料库数据。我们采用了训练数据增强的方法,通过增加训练输入的句子数,使训练准确率提高了50%以上。结果表明,该方法能够较准确地估计意见句的人的价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信