{"title":"Evaluation of the Coherence of Ukrainian Texts Using a Transformer Architecture","authors":"A. Kramov, S. Pogorilyy","doi":"10.1109/ATIT50783.2020.9349355","DOIUrl":null,"url":null,"abstract":"The coherence of a text interprets the thematic connection between the different parts of the text. According to the lack of the unified structure of a text and the necessity to estimate the semantic meaning of its parts, different machine learning methods, namely, deep learning techniques should be used to evaluate the coherence of a text. In this paper, different baseline and state-of-the-art methods of the coherence evaluation based on deep learning paradigms have been analyzed in detail. According to the disability of analyzed methods to take into account both word ordering and long-distant relations within a text, a new Transformer-based neural network model has been proposed. The principle of work and advantages of a self-attention mechanism has been considered. The training of the proposed model has been performed on the corpus of Ukrainian news. In order to estimate the accuracy of the model and to take into account the imbalance of classes, corresponding metrics have been calculated with further detection of an appropriate threshold value of an output. The effectiveness of the model has been examined on the document discrimination and insertion tasks. The results obtained may indicate that the proposed model outperforms baseline methods and can be used to estimate the coherence of both Ukrainian texts and foreign language documents.","PeriodicalId":312916,"journal":{"name":"2020 IEEE 2nd International Conference on Advanced Trends in Information Theory (ATIT)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 2nd International Conference on Advanced Trends in Information Theory (ATIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ATIT50783.2020.9349355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The coherence of a text interprets the thematic connection between the different parts of the text. According to the lack of the unified structure of a text and the necessity to estimate the semantic meaning of its parts, different machine learning methods, namely, deep learning techniques should be used to evaluate the coherence of a text. In this paper, different baseline and state-of-the-art methods of the coherence evaluation based on deep learning paradigms have been analyzed in detail. According to the disability of analyzed methods to take into account both word ordering and long-distant relations within a text, a new Transformer-based neural network model has been proposed. The principle of work and advantages of a self-attention mechanism has been considered. The training of the proposed model has been performed on the corpus of Ukrainian news. In order to estimate the accuracy of the model and to take into account the imbalance of classes, corresponding metrics have been calculated with further detection of an appropriate threshold value of an output. The effectiveness of the model has been examined on the document discrimination and insertion tasks. The results obtained may indicate that the proposed model outperforms baseline methods and can be used to estimate the coherence of both Ukrainian texts and foreign language documents.