CogniTextes最新文献

筛选
英文 中文
Non-representativeness in corpora: perils, pitfalls and challenges 社团的非代表性:危险、陷阱和挑战
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1772
T. Egan
{"title":"Non-representativeness in corpora: perils, pitfalls and challenges","authors":"T. Egan","doi":"10.4000/COGNITEXTES.1772","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1772","url":null,"abstract":"This article presents and discusses some problems of representativeness that the author has encountered in over twenty years of corpus-based research. It argues that the inclusion in a general corpus of certain text types, such as grammar treatises or works of historical fiction, can lessen the representativeness of the data, especially if the corpus is designed to reflect the linguistic production, as opposed to the linguistic reception, of a speech community. It is argued that less emphasis should be placed on reception in the compilation of general corpora. Also addressed are problems relating to the comparison of texts in different languages, as well as two solutions that have been proposed to counter these problems. The arguments are illustrated with examples from both contemporary and historical corpora.","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48009696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A rejoinder to Maarten Lemmens’s paper ‘In defence of frequency generalisations and usage-based linguistics. An answer to Frederick Newmeyer’s “Conversational corpora : when big is beautiful”’ 对Maarten Lemmens论文“为频率概括和基于用法的语言学辩护”的回应。对弗雷德里克·纽迈耶的《会话语料库:当大的是美的时候》的回答
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1657
F. Newmeyer
{"title":"A rejoinder to Maarten Lemmens’s paper ‘In defence of frequency generalisations and usage-based linguistics. An answer to Frederick Newmeyer’s “Conversational corpora : when big is beautiful”’","authors":"F. Newmeyer","doi":"10.4000/COGNITEXTES.1657","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1657","url":null,"abstract":"1. Some introductory remarks First, I must express my heart-felt gratitude to Maarten Lemmens for writing a collegial, thought-provoking, and informative reply to my position paper. I learned a great deal from it and can honestly say that if I pursue the issues that I brought up in my piece I will have much recourse to Lemmens’s reply. That said, I found this an extremely difficult rejoinder to write. The reason for my difficulty derives from the great disconnect between the content and claim...","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44226630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Énonciation, corpus et representativité : le cas de « along » 发音、语料库和代表性:以“along”为例
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1517
Graham Ranger
{"title":"Énonciation, corpus et representativité : le cas de « along »","authors":"Graham Ranger","doi":"10.4000/COGNITEXTES.1517","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1517","url":null,"abstract":"La representativite d’un corpus est largement tributaire du rapport entre le corpus et la langue, ou variete de langue, dont le corpus represente un echantillon. Ce rapport ne peut s’evaluer de maniere satisfaisante, car la langue cible ne peut etre saisie que via un ensemble fini d’occurrences, en d’autres mots, un corpus. Nous preconisons par consequent d’evaluer la representativite selon les objectifs specifiques que l’on se fixe. Dans le cadre enonciatif, les marqueurs d’une langue sont decrits en termes d’un invariant, ou forme schematique (FS), qui est configuree par les operations des marqueurs associes, pour aboutir a des valeurs situees en contexte. L’elaboration de la FS se fait sur la base d’exemples authentiques, etudies en contexte et soumis par le linguiste aux manipulations, jugements d’acceptabilite, etc. avec la part d’intuition que ce processus implique. Cet article defend l’utilisation des methodologies de la linguistique de corpus dans un cadre enonciatif, afin de verifier et d’orienter la modelisation theorique du langage. Cette methode sera exposee par une etude de cas du marqueur “along” qui s’appuie sur des donnees issues du British National Corpus, mettant en evidence differentes valeurs contextuelles du marqueur, derivables d’une FS, sur la base de mesures d’associativite.","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48126935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How large should a dense corpus be for reliable studies in early language acquisition ? 对于早期语言习得的可靠研究,密集语料库应该有多大?
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1483
C. Parisse
{"title":"How large should a dense corpus be for reliable studies in early language acquisition ?","authors":"C. Parisse","doi":"10.4000/COGNITEXTES.1483","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1483","url":null,"abstract":"Dense corpora have been put forward as necessary tools for corpus studies of language acquisition. Despite their great interest, they are not yet frequently used, probably because of the high cost involved in their creation. The goal of the present study was to predict the optimal size of a dense longitudinal corpus when used to infer, manually or automatically, the details of lexical or syntactic development in child language. The results show that corpora of at least 30 to 40 one-hour recordings are necessary, but that longer corpora using the same protocol provide little new information. Dense corpora are indeed very useful, but do not need to be overly large to study grammatical development. This has important consequences for corpus-building projects, which can be optimized. The existence of a limit to the amount of information provided by large corpora also has important consequences for linguistic theory, as this helps locate the threshold between learning frozen forms and generalizing knowledge about language structure.","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47112820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corpora and Representativeness: Where to go from now? 企业与代表性:从现在开始该何去何从?
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/cognitextes.1311
S. Raineri, C. Debras
{"title":"Corpora and Representativeness: Where to go from now?","authors":"S. Raineri, C. Debras","doi":"10.4000/cognitextes.1311","DOIUrl":"https://doi.org/10.4000/cognitextes.1311","url":null,"abstract":"Twentieth-century structuralist and generative linguists argued that the study of the language system (langue, competence) must be separated from the study of language use (parole, performance). For Saussure or Chomsky, no generalizations about language could be made based on the observation of patterns, regularities and rules of language performance. For Saussure, “Il n’y a donc rien de collectif dans la parole ; les manifestations en sont individuelles et momentanees. Ici il n’y a rien de p...","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47214498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
In defense of frequency generalizations and usage-based linguistics. An answer to Frederick Newmeyer’s “Conversational corpora : when big is beautiful” 为频率概括和基于用法的语言学辩护。对弗雷德里克·纽迈耶的《会话语料库:当大就是美的时候》的回答
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1616
M. Lemmens
{"title":"In defense of frequency generalizations and usage-based linguistics. An answer to Frederick Newmeyer’s “Conversational corpora : when big is beautiful”","authors":"M. Lemmens","doi":"10.4000/COGNITEXTES.1616","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1616","url":null,"abstract":"In his paper “Conversational corpora : when big is beautiful”, Newmeyer sets himself the goal of evaluating the relationship between corpus size and conclusions drawn from corpora regarding questions of grammatical theory. He formulates a strong critique against corpus research based on too small (conversational) corpora and in doing so, explicitly rejects the usage-based approach to language in which they are embedded. He argues that, unless they are based on large (conversational) corpora, frequency analyses do not give sufficiently reliable analyses compared to introspection-based analyses. In this response, I will counter some of the critique that Newmeyer levels against usage-based (or frequency-based) models, showing that, first of all, his criticism needs to be reevaluated and secondly, frequency-based analyses (and a usage-based approach more generally) do imply a radically different view on grammar which surpasses some of the shortcomings of introspection-based models.","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49198850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Handling Sign Language handshapes annotation with the typannot typefont 使用typannot typefont处理手语手型注释
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1401
Patrick Doan, D. Boutet, AdrienContesse, Claudia S. Bianchini, Claire Danet, Morgane Rébulard, J. Dauphin, Léa Chèvrefils, Chloé Thomas, Mathieu Réguer
{"title":"Handling Sign Language handshapes annotation with the typannot typefont","authors":"Patrick Doan, D. Boutet, AdrienContesse, Claudia S. Bianchini, Claire Danet, Morgane Rébulard, J. Dauphin, Léa Chèvrefils, Chloé Thomas, Mathieu Réguer","doi":"10.4000/COGNITEXTES.1401","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1401","url":null,"abstract":"Le systeme typographique Typannot, presente ici, permet de transcrire les formes des signes des Langues des Signes (LS). La structure generale de cette police est exposee dans ce papier. Trois niveaux d’information sont encodes : le parametre, les parties composant le parametre, les caracteristiques de chacune des parties. Afin de les visualiser a un niveau typographique, nous avons adopte quatre principes de conception : genericite, lisibilite, modularite et inscriptibilite. Ensemble, ils nous guident dans la representation et l’integration des trois niveaux d’information. Le systeme peut transcrire precisement un signe LS et afficher la transcription de maniere flexible grâce a deux modes de representation graphique : une forme generique et une forme composee. Ces deux formes visent a faciliter la transcription et ont le potentiel d’etre utilisees dans d’autres pratiques (par exemple : la lexicographie ou l’ecriture LS). Le fonctionnement de ce systeme typographique est ici decrit a travers le parametre de la configuration (conformation des doigts dans la main) ; les autres parametres suivent les memes principes de construction. Un clavier virtuel presentant plusieurs interfaces, en developpement, permet de composer les glyphes en combinant des caracteres. Quelques resultats d’analyse faite avec des transcriptions sous Typannot sont presentes. Ils montrent la granularite fine de la transcription.","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45685420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Baicchi, Annalisa, Digonnet, Rémi & Sandford, Jodi L. (Eds.), Sensory Perceptions in Language, Embodiment and Epistemology Baicchi,Annalisa,Digonnet,Rémi&Sandford,Jodi L.(编辑),语言中的感官感知,体现和认识论
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/cognitextes.1871
Wenjie Hong
{"title":"Baicchi, Annalisa, Digonnet, Rémi & Sandford, Jodi L. (Eds.), Sensory Perceptions in Language, Embodiment and Epistemology","authors":"Wenjie Hong","doi":"10.4000/cognitextes.1871","DOIUrl":"https://doi.org/10.4000/cognitextes.1871","url":null,"abstract":"Sensory organs can be viewed as a bridge connecting the human body with the environment, thus allowing interactions between our body and its surroundings. Sensory experience is therefore crucial to our understanding of the world and linguistic processes involved in it. The present book, Sensory perceptions in Language, Embodiment and Epistemology, represents a major contribution to highlighting how sensory perceptions shape our linguistic representation of the world from philosophical, lingui...","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43099639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Acknowledgement to Newmeyer 向Newmeyer致谢
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1664
M. Lemmens
{"title":"Acknowledgement to Newmeyer","authors":"M. Lemmens","doi":"10.4000/COGNITEXTES.1664","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1664","url":null,"abstract":"In this short statement, I would like to express my appreciation for the positive and constructive discussion that I feel this “Newmeyer-Lemmens tandem paper” has produced. First of all, I am quite happy that my response has proven useful to Newmeyer and would like to thank him for his kind acknowledgement. His critique that my response addresses bigger issues than the one his original paper envisaged is quite true. His article does indeed not talk about innateness and the poverty of the sti...","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42987136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
De l’uniformité du discours politique : analyse bibliométrique et linguistique de la catégorisation des discours politiques 政治话语一致性:政治话语分类的文献计量学与语言学分析
CogniTextes Pub Date : 2019-06-17 DOI: 10.4000/COGNITEXTES.1337
Julien Perrez, François Randour, Min Reuchamps
{"title":"De l’uniformité du discours politique : analyse bibliométrique et linguistique de la catégorisation des discours politiques","authors":"Julien Perrez, François Randour, Min Reuchamps","doi":"10.4000/COGNITEXTES.1337","DOIUrl":"https://doi.org/10.4000/COGNITEXTES.1337","url":null,"abstract":"Il existe une longue tradition de recherches linguistiques sur le discours politique, mais rares sont les reflexions sur ce que recouvre la notion de discours politique. Dans les etudes, les corpus mobilises emanent majoritairement des elites politiques (debats presidentiels, discours electoraux…), laissant d’autres formes de discours politiques, comme les discours mediatiques portant sur des sujets politiques ou les discours citoyens sous-representes. Dans ce contexte, cette contribution poursuit un double objectif. Tout d’abord, celui de comprendre quels types de discours sont categorises comme politiques dans les recherches en linguistique et quelles en sont les caracteristiques (types d’acteurs, thematiques, etc.). Pour repondre a ces questions, cette contribution propose une analyse bibliometrique basee sur la methode PRISMA portant sur un echantillon de 172 articles scientifiques issus de la base de donnees Scopus. Dans un deuxieme temps, nous posons la question de savoir dans quelle mesure la notion de discours politique renvoie a une realite uniforme d’un point de vue linguistique. Pour repondre a cette deuxieme question, nous etudions les caracteristiques formelles de trois sous-genres de discours politiques (debats parlementaires, debats televises et corpus citoyens) afin d’evaluer leur degre de divergence. Les resultats de ces analyses revelent une reelle difference entre ces trois corpus et nous permettent de mieux delimiter les contours de ce qui pourrait constituer le genre politique et ses registres textuels.","PeriodicalId":53774,"journal":{"name":"CogniTextes","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48673429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信