使用自定义写作风格的N-gram模型进行内容开发

J. Dhar, Vipul Gandhi
{"title":"使用自定义写作风格的N-gram模型进行内容开发","authors":"J. Dhar, Vipul Gandhi","doi":"10.1109/INCITE.2016.7857630","DOIUrl":null,"url":null,"abstract":"Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.","PeriodicalId":59618,"journal":{"name":"下一代","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Content development using N-gram model in custom writing style\",\"authors\":\"J. Dhar, Vipul Gandhi\",\"doi\":\"10.1109/INCITE.2016.7857630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.\",\"PeriodicalId\":59618,\"journal\":{\"name\":\"下一代\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"下一代\",\"FirstCategoryId\":\"1092\",\"ListUrlMain\":\"https://doi.org/10.1109/INCITE.2016.7857630\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"下一代","FirstCategoryId":"1092","ListUrlMain":"https://doi.org/10.1109/INCITE.2016.7857630","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

业余作家通常会发现,当他们用一种与自己写作风格不同的风格来构建内容时,这很难,而且经常会犯错误。这会导致读者失去兴趣,有时甚至会误解作者想要传达的实际思想。这项工作试图通过对适合作者想要采用的写作风格的最佳可用选择的单词进行排名来开始这个问题陈述。我们的方法允许作者从默认,正式和文学写作风格中进行选择。此外,作者可以通过开发自己的文章的自定义语料库,从自己过去的作品中推断出单词和痕迹。为了实现上述目标,本文提出了一个基于拼写检查器和n图的统计模型以及基于语料库的技术。Rank 4 N-gram和backoff平滑为我们的工作提供了最佳结果。为了证明该方法的有效性,我们在实时数据上进行了测试,性能评估取得了满意的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Content development using N-gram model in custom writing style
Amateur writers usually find it difficult and often make errors while building up content when they are doing so in a style different from their own writing style. This causes loss of interest by the readers and sometimes even misinterpretations of actual thoughts desired to be conveyed by author. This work attempts to embark upon this problem statement by ranking the best available choices of words fitting the style of writing that the author would like to adopt. Our methodology allows authors to choose from amongst default, formal and literature style of writing. Also, authors can infer words and traces from his own past writings by developing a custom corpus of his own write-ups. A spell checker and N-gram based statistical model along with a corpus based technique is proposed to achieve above objectives. Rank 4 N-gram along with backoff smoothing provided optimum results for our work. To showcase the effectiveness of this method, we have tested it on real time data and performance evaluation fetched satisfactory results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
6212
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信