{"title":"保留元音和谐语言翻译文本质量的符号化分析。","authors":"Kazuya Hayata","doi":"10.3390/e27090984","DOIUrl":null,"url":null,"abstract":"<p><p>To date, the ordinal pattern-based method has been applied to problems in natural and social sciences. We report, for the first time to our knowledge, an attempt to apply this methodology to a topic in the humanities. Specifically, in an effort to investigate the applicability of the methodology in analyzing the quality of texts that are translated into a language preserving the so-called vowel harmony, computed results are presented for the metrics of divergence between the back-translated and the original texts. As a specific language we focus on Japanese, and as metrics the Hellinger distance as well as the chi-square statistic are employed. Here, the former is a typical information-theoretical measure that can be quantified in natural unit, nat for short, while the latter is useful for performing a non-parametric testing of a null hypothesis with a significance level. The methods are applied to three cases: a Japanese novel along with a translated version available, the Preamble to the Constitution of Japan, and seventeen translations of an opening paragraph of a famous American detective story, which include thirteen human and four machine translations using DeepL and Google Translate. Numerical results aptly show unexpectedly high scores of the machine translations, but it still might be too soon to speculate on their unconditional potentialities. Both our attempt and results are not only novel but are also expected to make a contribution toward an interdisciplinary study between physics and linguistics.</p>","PeriodicalId":11694,"journal":{"name":"Entropy","volume":"27 9","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12468654/pdf/","citationCount":"0","resultStr":"{\"title\":\"Symbolic Analysis of the Quality of Texts Translated into a Language Preserving Vowel Harmony.\",\"authors\":\"Kazuya Hayata\",\"doi\":\"10.3390/e27090984\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>To date, the ordinal pattern-based method has been applied to problems in natural and social sciences. We report, for the first time to our knowledge, an attempt to apply this methodology to a topic in the humanities. Specifically, in an effort to investigate the applicability of the methodology in analyzing the quality of texts that are translated into a language preserving the so-called vowel harmony, computed results are presented for the metrics of divergence between the back-translated and the original texts. As a specific language we focus on Japanese, and as metrics the Hellinger distance as well as the chi-square statistic are employed. Here, the former is a typical information-theoretical measure that can be quantified in natural unit, nat for short, while the latter is useful for performing a non-parametric testing of a null hypothesis with a significance level. The methods are applied to three cases: a Japanese novel along with a translated version available, the Preamble to the Constitution of Japan, and seventeen translations of an opening paragraph of a famous American detective story, which include thirteen human and four machine translations using DeepL and Google Translate. Numerical results aptly show unexpectedly high scores of the machine translations, but it still might be too soon to speculate on their unconditional potentialities. Both our attempt and results are not only novel but are also expected to make a contribution toward an interdisciplinary study between physics and linguistics.</p>\",\"PeriodicalId\":11694,\"journal\":{\"name\":\"Entropy\",\"volume\":\"27 9\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12468654/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Entropy\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.3390/e27090984\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PHYSICS, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Entropy","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.3390/e27090984","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
Symbolic Analysis of the Quality of Texts Translated into a Language Preserving Vowel Harmony.
To date, the ordinal pattern-based method has been applied to problems in natural and social sciences. We report, for the first time to our knowledge, an attempt to apply this methodology to a topic in the humanities. Specifically, in an effort to investigate the applicability of the methodology in analyzing the quality of texts that are translated into a language preserving the so-called vowel harmony, computed results are presented for the metrics of divergence between the back-translated and the original texts. As a specific language we focus on Japanese, and as metrics the Hellinger distance as well as the chi-square statistic are employed. Here, the former is a typical information-theoretical measure that can be quantified in natural unit, nat for short, while the latter is useful for performing a non-parametric testing of a null hypothesis with a significance level. The methods are applied to three cases: a Japanese novel along with a translated version available, the Preamble to the Constitution of Japan, and seventeen translations of an opening paragraph of a famous American detective story, which include thirteen human and four machine translations using DeepL and Google Translate. Numerical results aptly show unexpectedly high scores of the machine translations, but it still might be too soon to speculate on their unconditional potentialities. Both our attempt and results are not only novel but are also expected to make a contribution toward an interdisciplinary study between physics and linguistics.
期刊介绍:
Entropy (ISSN 1099-4300), an international and interdisciplinary journal of entropy and information studies, publishes reviews, regular research papers and short notes. Our aim is to encourage scientists to publish as much as possible their theoretical and experimental details. There is no restriction on the length of the papers. If there are computation and the experiment, the details must be provided so that the results can be reproduced.