A Machine Learning Model for the Dating of Ancient Chinese Texts

2019 International Conference on Asian Language Processing (IALP) Pub Date : 2019-11-01 DOI:10.1109/IALP48816.2019.9037653

Xuejin Yu, W. Huangfu

引用次数: 2

Abstract

This paper, with the intent of solving the issues on the dating of ancient Chinese texts, takes advantage of the Long-Short Term Memory Network (LSTM) to analyze and process the character sequence in ancient Chinese. In this model, each character is transformed into a high-dimensional vector, and then vectors and the non-linear relationships among them are read and analyzed by LSTM, which finally achieve the dating tags. Experimental results show that the LSTM has a strong ability to date the ancient texts, and the precision reaches about 95% in our experiments. Thus, the proposed model offers an effective method on how to date the ancient Chinese texts. It also inspires us to actively improve the time-consuming analysis tasks in the Chinese NLP field.

查看原文本刊更多论文

中国古代文献年代测定的机器学习模型

本文利用长短期记忆网络(LSTM)对古汉语汉字序列进行分析和处理，旨在解决古汉语文本的年代测定问题。在该模型中，将每个字符转换成一个高维向量，然后通过LSTM对向量及其之间的非线性关系进行读取和分析，最终实现日期标记。实验结果表明，LSTM具有较强的古文本定年能力，在我们的实验中准确率达到95%左右。因此，该模型为中国古代文献的年代确定提供了一种有效的方法。这也激励我们积极改进中国NLP领域耗时的分析任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Conference on Asian Language Processing (IALP)

自引率

0.00%

发文量