识别一个人的母语

PROBLEMS IN PROGRAMMING Pub Date : 2022-12-01 DOI:10.15407/pp2022.03-04.271

Y.S. Lazorenko, I. Sinitsyn, V.L. Shevchenko

{"title":"识别一个人的母语","authors":"Y.S. Lazorenko, I. Sinitsyn, V.L. Shevchenko","doi":"10.15407/pp2022.03-04.271","DOIUrl":null,"url":null,"abstract":"With the great increase of population movement, caused either by temporarily needs due to travelling or by long-term ones due to work, etc., there appears a need to improve the processes of movement control and identification of groups of people. The primary need is to identify if a person belongs to a certain nationality or territory of primary residence. In addition, it can be useful for scientific social and political researches, as well as for the field of tourism and entertainment. Such identification of people by language environment can also help to find out more about the cultural environment, which will make possible to predict the preferences of entire groups of customers better. The analysis of the audio recording of the speaker’s speech was divided into 3 stages, each of which contains a defined step-by-step instruction for the processes of data preparation and handling. Firstly, the sound itself was analyzed, because voice and pronunciation are one of the fastest and most effective tools for identifying a person or a group of people. Then the audio recording was converted into text and an analysis of the lexical composition of the studied fragment of the conversation was done. At the end, the results of the program, got in the previous two stages, were compared and the complex evaluations were done according to the criterias determined in the theoretical researches. The mentioned criterias were used with weighting factors assigned to them. As a result, an assumption about the speech environment of the speaker was given by the program. The work also describes the factors which affects the formation of pronunciation and the change of various intonations, the relationship between text and sound. Some dependencies between sound parameters and the pronunciation of sounds in the Russian and Ukrainian languages were mathematically formalized. The topic of the work is wide, as it considers not only lexical, but also acoustic features of speech. In this way, not only changeable or situational qualities are taken into account, for example, the vocabulary of the conversation, but also those ones that are related to the person regardless of the context of the conversation, his or her emotional state or what language he or she is speaking at the very moment. The criterias considered in the work can help to make assumptions about the speaker’s «natural», most familiar language, in other words the language he or she uses most of the time in life.","PeriodicalId":313885,"journal":{"name":"PROBLEMS IN PROGRAMMING","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of the native language of a person\",\"authors\":\"Y.S. Lazorenko, I. Sinitsyn, V.L. Shevchenko\",\"doi\":\"10.15407/pp2022.03-04.271\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the great increase of population movement, caused either by temporarily needs due to travelling or by long-term ones due to work, etc., there appears a need to improve the processes of movement control and identification of groups of people. The primary need is to identify if a person belongs to a certain nationality or territory of primary residence. In addition, it can be useful for scientific social and political researches, as well as for the field of tourism and entertainment. Such identification of people by language environment can also help to find out more about the cultural environment, which will make possible to predict the preferences of entire groups of customers better. The analysis of the audio recording of the speaker’s speech was divided into 3 stages, each of which contains a defined step-by-step instruction for the processes of data preparation and handling. Firstly, the sound itself was analyzed, because voice and pronunciation are one of the fastest and most effective tools for identifying a person or a group of people. Then the audio recording was converted into text and an analysis of the lexical composition of the studied fragment of the conversation was done. At the end, the results of the program, got in the previous two stages, were compared and the complex evaluations were done according to the criterias determined in the theoretical researches. The mentioned criterias were used with weighting factors assigned to them. As a result, an assumption about the speech environment of the speaker was given by the program. The work also describes the factors which affects the formation of pronunciation and the change of various intonations, the relationship between text and sound. Some dependencies between sound parameters and the pronunciation of sounds in the Russian and Ukrainian languages were mathematically formalized. The topic of the work is wide, as it considers not only lexical, but also acoustic features of speech. In this way, not only changeable or situational qualities are taken into account, for example, the vocabulary of the conversation, but also those ones that are related to the person regardless of the context of the conversation, his or her emotional state or what language he or she is speaking at the very moment. The criterias considered in the work can help to make assumptions about the speaker’s «natural», most familiar language, in other words the language he or she uses most of the time in life.\",\"PeriodicalId\":313885,\"journal\":{\"name\":\"PROBLEMS IN PROGRAMMING\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PROBLEMS IN PROGRAMMING\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15407/pp2022.03-04.271\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PROBLEMS IN PROGRAMMING","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15407/pp2022.03-04.271","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着由于旅行或工作等原因造成的临时需要或长期需要造成的人口流动的大量增加，似乎需要改进对人口群体的流动控制和识别过程。主要需要是确定一个人是否属于某一国籍或主要居住地。此外，它还可以用于科学、社会和政治研究，以及旅游和娱乐领域。这种通过语言环境对人的识别也有助于更多地了解文化环境，从而更好地预测整个客户群的偏好。演讲者演讲录音的分析分为3个阶段，每个阶段都包含一个明确的分步指令，用于数据准备和处理过程。首先，对声音本身进行分析，因为声音和发音是识别一个人或一群人最快、最有效的工具之一。然后将录音转换为文本，并对所研究的对话片段的词汇组成进行分析。最后，对前两阶段的方案结果进行了比较，并根据理论研究确定的准则进行了综合评价。使用上述标准并赋予其权重因子。因此，程序给出了一个关于说话人的语言环境的假设。文章还论述了影响语音形成的因素和各种语调的变化，文声关系。在俄语和乌克兰语中，声音参数和发音之间的一些依赖关系被数学形式化了。这项工作的主题很广泛，因为它不仅考虑了词汇，而且考虑了语音的声学特征。通过这种方式，不仅考虑了可变的或情境性的品质，例如，谈话的词汇，而且还考虑了那些与谈话的上下文、他或她的情绪状态或他或她当时说的是什么语言无关的与人有关的品质。工作中考虑的标准可以帮助对说话者的“自然”，最熟悉的语言做出假设，换句话说，他或她在生活中大部分时间使用的语言。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Identification of the native language of a person

With the great increase of population movement, caused either by temporarily needs due to travelling or by long-term ones due to work, etc., there appears a need to improve the processes of movement control and identification of groups of people. The primary need is to identify if a person belongs to a certain nationality or territory of primary residence. In addition, it can be useful for scientific social and political researches, as well as for the field of tourism and entertainment. Such identification of people by language environment can also help to find out more about the cultural environment, which will make possible to predict the preferences of entire groups of customers better. The analysis of the audio recording of the speaker’s speech was divided into 3 stages, each of which contains a defined step-by-step instruction for the processes of data preparation and handling. Firstly, the sound itself was analyzed, because voice and pronunciation are one of the fastest and most effective tools for identifying a person or a group of people. Then the audio recording was converted into text and an analysis of the lexical composition of the studied fragment of the conversation was done. At the end, the results of the program, got in the previous two stages, were compared and the complex evaluations were done according to the criterias determined in the theoretical researches. The mentioned criterias were used with weighting factors assigned to them. As a result, an assumption about the speech environment of the speaker was given by the program. The work also describes the factors which affects the formation of pronunciation and the change of various intonations, the relationship between text and sound. Some dependencies between sound parameters and the pronunciation of sounds in the Russian and Ukrainian languages were mathematically formalized. The topic of the work is wide, as it considers not only lexical, but also acoustic features of speech. In this way, not only changeable or situational qualities are taken into account, for example, the vocabulary of the conversation, but also those ones that are related to the person regardless of the context of the conversation, his or her emotional state or what language he or she is speaking at the very moment. The criterias considered in the work can help to make assumptions about the speaker’s «natural», most familiar language, in other words the language he or she uses most of the time in life.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PROBLEMS IN PROGRAMMING

自引率

0.00%

发文量