{"title":"Identifying Errors in Russian Web Corpora","authors":"M. Khokhlova","doi":"10.2478/jazcas-2022-0021","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0021","url":null,"abstract":"Abstract The explosion of the Web leads to the production of large amounts of texts and inevitably influences their quality. Errors that tend to occur more often can distort results, especially when texts are used for scientific purposes, in language teaching or learning. Hence, there is a need to examine the existing corpora based on web texts and to clean up the data, which may contain such “noisy” fragments. In our study, we deal with the problem of errors and analyze the Aranea Russicum Maximum corpus. Among such errors, we can name, above all, encoding errors, incorrect font types, as well as segments written in other languages. These phenomena result in incorrect morphological analysis and lemmatization, frequency distortion, as well as the fact that lexical units cannot be found and therefore displayed to corpus users. The paper focuses on the errors, describes their types and outlines possible ways to eliminate them.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125571600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Equal language communication","authors":"J. Dolník","doi":"10.2478/jazcas-2022-0025","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0025","url":null,"abstract":"Abstract The pragmatic reason for supporting the promotion of equal language communication is that this communication is part of a social event that aims to develop an inclusive society. However, this support also has an anthropological reason, which is based on the reconstruction of the origin of man as a being of his kind and his humanization. His humanization is based on continuous self-realization and rational social solidarity, consisting in the fact that the actors of social interaction are committed to creating a lively environment for parity self-realization. Such self-realization is governed by the principle of equality differentiation, which includes the contextual neutralization of relevant differences between actors. Transparency is essential for overcoming equality barriers in language communication, but it is important to perceive the opposition between levelling and cultivated equality communication. This contrast draws attention to the idea of optimally transparent language, which leads the actors to set the optimal ratio between transparent and non-transparent forms of expression. Since the humanization of individuals and society corresponds to cultivated equality communication, the order of the day is to increase the volume of language communication in society with such a degree of non-transparency (negativity) that stimulates the interpretive activity of the actors. When examining equal language communication with regard to its impact on the cultural level of society, one of the relevant issues is the adaptation of the relationship between positive and negative transparency to current communication events.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126988802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Contours of style and stylistics from Jozef Mistrík to the digital era","authors":"Oľga Orgoňová","doi":"10.2478/jazcas-2022-0026","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0026","url":null,"abstract":"Abstract The aim of the study is to follow the metamorphoses of the notion of style (above all) in Slovak stylistics within the last about 30 years. From the initial recapitulation of the traditional ideas of structural linguistics as a science aimed at non-contextual linguistic models of the manners of shaping the texts, the perspective of viewing style is being shifted to viewing it as a socially conditioned speech lingual activity. The culmination is represented by the profiling of style as a construction of a contextually conditioned cultural reflection of text in context. The current communicatively conditioned concepts of style anchored in discourse reality reflect the updated needs of people of the digital era. A non-negligible prerequisite for effectively grasping style (especially in interaction) is the competence of the acting people to interact optimally and in an assimilation-accommodation manner with the surrounding world in the name of harmony, understanding and inclusive behaviour.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116790268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Kladenští type as a problem of automatic morphological analysis","authors":"Klára Osolsobe, Hana Zizková","doi":"10.2478/jazcas-2022-0011","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0011","url":null,"abstract":"Abstract The aim of our paper is to demonstrate the procedures by which the data needed to refine tools for automatic morphological analysis of Czech can be obtained using a corpus, namely the Araneum Bohemicum IV Maximum (Czech, 20.03) 7.10 G web corpus of the ARANEA series and Araneum Bohemicum Maximum (Czech, 15.04) 3,20 G (hereinafter Araneum). Particularly, we will focus on propria of the Kladenští type, i.e., substantivized adjectives of denoting groups of persons according to affiliation. The goal of the probe into the Aranea web corpus is: 1) a corpus-based description of frequented properties of the Kladenští type, which can be used as a starting point for rule disambiguation; 2) creating a list of the most frequent lemmas belonging to the Kladenští type, which can then be included into dictionaries of automatic morphological analyzers (e.g. the MorfFlex dictionary by Hajič and Hlaváčová). We believe that the probe can help improve the results of tools for automatic morphological analysis of Czech.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127541383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Developmental Stylistics","authors":"Lucie Saicová Římalová","doi":"10.2478/jazcas-2022-0028","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0028","url":null,"abstract":"Abstract Developmental stylistics, a sub-discipline of stylistics, studies the development of and changes in stylistic competence during a speaker’s life. This developmental perspective is under-used in stylistic research. The article summarises current definitions of developmental stylistics and proposes a set of new and complex parameters for a discipline that studies stylistic competence at all stages of life, in both production and reception, and in typical and less typical contexts. It discusses the main factors which make ‘developmental’ research within stylistics problematic, and which may contribute to the fragmented nature of the discipline. The wide range of topics within developmental research often calls for an interdisciplinary approach. The great variability in the way speakers communicate and interact with one another, the limited capacity to make generalisations from the research, and the analysis of various types of ‘imperfect’ language all present significant methodological challenges.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130172322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Chinese Language Word Embeddings Based on the Corpus Hanku","authors":"R. Garabík","doi":"10.2478/jazcas-2022-0023","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0023","url":null,"abstract":"Abstract Vector models based on word embeddings are an indispensable part of advanced Natural Language Processing research and language analysis. We describe several Chinese language (Pǔtōnghuà) word embeddings, the differences from “western” language models caused by specific orthographic and linguistic features of the written Chinese language, and introduce a publicly available web interface for querying the vector models, aimed at linguistically or pedagogically oriented users.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126170012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The faces of style and stylistics","authors":"S. Gajda","doi":"10.2478/jazcas-2022-0024","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0024","url":null,"abstract":"Abstract The starting point for the author’s considerations is the conception of style as the humanistic structure of text, which he uses as the foundation of integrating stylistic synthesis. The synthesis is to be characterized by the following: 1) holism (moderately and functionally directed, brought down to a system in which relations connecting different manners of the existence and functioning of language in text are exposed); 2) integrationism (integrating knowledge of language reality, primarily of the functioning of language in text); 3) intermethodologism and transdisciplinariness. Acceptance of this conception of style offers – regarding the understanding of it – a very broad and open cognitive space. There may appear in it different frameworks which present diverse faces of style and stylistics. Their cognitive use enriches the integrative synthesis.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129016717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Non-literary communicative acts in literary style","authors":"P. Mareș","doi":"10.2478/jazcas-2022-0031","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0031","url":null,"abstract":"Abstract Many literary texts prove a tendency to simulate features of diverse communicative acts, styles and genres that are primarily applied in non-literary spheres of communication (for instance journalism, administration, science). The aim of this paper is to suggest a possible theoretical framework for this phenomenon and to describe some ways of simulating non-literary styles in literary texts. The aforementioned simulating can be seen as a form of interdiscursivity that introduces external elements into literary texts. The functioning of it is conditioned by a process of decontextualization and recontextualization that is signalled by various paratextual and textual cues. Taking three Czech prosaic works as examples, the concrete use of this form of interdiscursivity is illustrated in the last part of the paper.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128149020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparative Corpus-Driven Study of Prepositional Semantics in Russian and Czech","authors":"V. Zakharov","doi":"10.2478/jazcas-2022-0020","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0020","url":null,"abstract":"Abstract This paper deals with prepositions with causal meaning in Russian and Czech. In Slavic languages prepositions are closely connected to cases. Russian and Czech prepositions have many common features. Prepositions show a relation in space or time or a special relationship between two or more people, places, things or situations. In the current paper we are dealing with causal relations. There are different ways to express them. Among these means, the most common are prepositional-case forms and complex sentences with a subordinate causal part. We analyze the repertoire of causal prepositions in both languages and describe their statistical representation in corpora. Another task is to reveal translation equivalents between two languages.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128042296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Some observations on the composition by blending in contemporary French from Petit Robert","authors":"Radka Mudrochová","doi":"10.2478/jazcas-2022-0015","DOIUrl":"https://doi.org/10.2478/jazcas-2022-0015","url":null,"abstract":"Abstract The objective of this article is to analyze the composition by amalgam in current French by focusing on the one hand on the notion of amalgam in linguistics and on the other hand on the use and the frequency of use of the chosen amalgams in the diatopic variation of French. The notion of amalgam and / or of portmanteau word does not seem obvious and the explanations or definitions offered by dictionaries as well as by works on lexicology are not unanimous and differ from one another. Before presenting the results of a more detailed research, we therefore find it essential to frame the contribution in a theoretical context dealing with the notion of amalgam, or even of portmanteau word, which allows us to better understand the whole problem.","PeriodicalId":262732,"journal":{"name":"Journal of Linguistics/Jazykovedný casopis","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134450456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}