Automatic Text Summarization using Maximum Marginal Relevance for Health Ethics Protocol Document in Bahasa

Doni Putra Purbawa, Malikhah, Ratih Nur Esti Anggraini, R. Sarno
{"title":"Automatic Text Summarization using Maximum Marginal Relevance for Health Ethics Protocol Document in Bahasa","authors":"Doni Putra Purbawa, Malikhah, Ratih Nur Esti Anggraini, R. Sarno","doi":"10.1109/ICTS52701.2021.9607951","DOIUrl":null,"url":null,"abstract":"The rapid development of science and technology in the health sector cannot be separated from the support of health research results. Before the research with human as a subject is conducted, every health research in Indonesia is required to make a health ethics protocol document in Bahasa which must comply with basic ethical principles. To determine whether the health research ethics protocol document has met the ethical principles, the health research ethics protocol document will be reviewed by a competent reviewer. The health research ethics protocol document consists of several parts and has a large number of pages, so to conduct a review, reviewers need a long time to understand and analyze the health research ethics protocol document. To reduce the review time, an automatic text summarization (ATS) is needed. ATS extracts important information in health research ethics protocol documents and presents it to reviewers. This research uses cosine similarity and Maximum Marginal Relevance (MMR) and TextRank to summarize the document. The MMR method is considered to have more stable results than TextRank based on the ROUGE evaluation results. The evaluation of result with the ROUGE Toolkit showed F-score value of 19.92% for document 1 and 10.98% for document 2 using MMR.","PeriodicalId":6738,"journal":{"name":"2021 13th International Conference on Information & Communication Technology and System (ICTS)","volume":"31 1","pages":"324-329"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Information & Communication Technology and System (ICTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTS52701.2021.9607951","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The rapid development of science and technology in the health sector cannot be separated from the support of health research results. Before the research with human as a subject is conducted, every health research in Indonesia is required to make a health ethics protocol document in Bahasa which must comply with basic ethical principles. To determine whether the health research ethics protocol document has met the ethical principles, the health research ethics protocol document will be reviewed by a competent reviewer. The health research ethics protocol document consists of several parts and has a large number of pages, so to conduct a review, reviewers need a long time to understand and analyze the health research ethics protocol document. To reduce the review time, an automatic text summarization (ATS) is needed. ATS extracts important information in health research ethics protocol documents and presents it to reviewers. This research uses cosine similarity and Maximum Marginal Relevance (MMR) and TextRank to summarize the document. The MMR method is considered to have more stable results than TextRank based on the ROUGE evaluation results. The evaluation of result with the ROUGE Toolkit showed F-score value of 19.92% for document 1 and 10.98% for document 2 using MMR.
基于最大边际相关性的卫生伦理协议文件自动文本摘要
卫生科学技术的快速发展离不开卫生研究成果的支持。在进行以人类为研究对象的研究之前,印度尼西亚的每项卫生研究都必须制定一份卫生伦理协议文件,该文件必须符合基本的伦理原则。为确定卫生研究伦理方案文件是否符合伦理原则,卫生研究伦理方案文件将由有资格的审稿人进行审查。卫生研究伦理协议文件由几个部分组成,并且有大量的页面,因此要进行审查,审稿人需要很长时间来理解和分析卫生研究伦理协议文件。为了减少复习时间,需要自动文本摘要(ATS)。ATS从卫生研究伦理协议文件中提取重要信息并提交给审稿人。本研究使用余弦相似度和最大边际相关性(MMR)和TextRank来总结文档。基于ROUGE评价结果,MMR方法被认为比TextRank方法具有更稳定的结果。ROUGE Toolkit对结果的评价显示,MMR对文献1和文献2的f评分值分别为19.92%和10.98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信