{"title":"Data-driven dependency parsing of Vedic Sanskrit","authors":"Oliver Hellwig, Sebastian Nehrdich, Sven Sellmer","doi":"10.1007/s10579-023-09636-5","DOIUrl":"https://doi.org/10.1007/s10579-023-09636-5","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"40 5","pages":"1173 - 1206"},"PeriodicalIF":2.7,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41292134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Usage disambiguation of Turkish discourse connectives","authors":"Kezban Başıbüyük, Deniz Zeyrek","doi":"10.1007/s10579-022-09614-3","DOIUrl":"https://doi.org/10.1007/s10579-022-09614-3","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"1 1","pages":"1-34"},"PeriodicalIF":2.7,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44000936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mining culture from professional discourse: a lexicon-based hybrid method","authors":"Yubin Qian","doi":"10.1007/s10579-022-09630-3","DOIUrl":"https://doi.org/10.1007/s10579-022-09630-3","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"1 1","pages":"1-17"},"PeriodicalIF":2.7,"publicationDate":"2023-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41462214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
W. Santos, Rafael Lage de Oliveira, Ivandré Paraboni
{"title":"SetembroBR: a social media corpus for depression and anxiety disorder prediction","authors":"W. Santos, Rafael Lage de Oliveira, Ivandré Paraboni","doi":"10.1007/s10579-022-09633-0","DOIUrl":"https://doi.org/10.1007/s10579-022-09633-0","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"1 1","pages":"1-28"},"PeriodicalIF":2.7,"publicationDate":"2023-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46624516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Camiel Colruyt, Orphée De Clercq, Thierry Desot, Véronique Hoste
{"title":"EventDNA: a dataset for Dutch news event extraction as a basis for news diversification.","authors":"Camiel Colruyt, Orphée De Clercq, Thierry Desot, Véronique Hoste","doi":"10.1007/s10579-022-09623-2","DOIUrl":"https://doi.org/10.1007/s10579-022-09623-2","url":null,"abstract":"<p><p>News organizations increasingly tailor their news offering to the reader through personalized recommendation algorithms. However, automated recommendation algorithms reflect a commercial logic based on calculated relevance to the user, rather than aiming at a well-informed citizenry. In this paper, we introduce the EventDNA corpus, a dataset of 1773 Dutch-language news articles annotated with information on entities, news events and IPTC Media Topic codes, with the ultimate goal to outline a recommendation algorithm that uses news event diversity rather than previous reading behaviour as a key driver for personalized news recommendation. We describe the EventDNA annotation guidelines, which are inspired by the well-known ERE framework and conclude that it is not practical to apply a fixed event typology such as used in ERE to an unrestricted data context. The corpus and related source code is made available at https://github.com/NewsDNA-LT3/.github.</p>","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"57 1","pages":"189-221"},"PeriodicalIF":2.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9672586/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9136049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana D de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darǵis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fišer
{"title":"The ParlaMint corpora of parliamentary proceedings.","authors":"Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana D de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darǵis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fišer","doi":"10.1007/s10579-021-09574-0","DOIUrl":"https://doi.org/10.1007/s10579-021-09574-0","url":null,"abstract":"<p><p>This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are linguistically annotated following the Universal Dependencies formalism and with named entities. Samples of the corpora and conversion scripts are available from the project's GitHub repository, and the complete corpora are openly available via the CLARIN.SI repository for download, as well as through the NoSketch Engine and KonText concordancers and the Parlameter interface for on-line exploration and analysis.</p>","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"57 1","pages":"415-448"},"PeriodicalIF":2.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8807381/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9190380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nico Blokker, André Blessing, Erenay Dayanik, Jonas Kuhn, Sebastian Padó, Gabriella Lapesa
{"title":"Between welcome culture and border fence: A dataset on the European refugee crisis in German newspaper reports.","authors":"Nico Blokker, André Blessing, Erenay Dayanik, Jonas Kuhn, Sebastian Padó, Gabriella Lapesa","doi":"10.1007/s10579-023-09641-8","DOIUrl":"https://doi.org/10.1007/s10579-023-09641-8","url":null,"abstract":"<p><p>Newspaper reports provide a rich source of information on the unfolding of public debates, which can serve as basis for inquiry in political science. Such debates are often triggered by critical events, which attract public attention and incite the reactions of political actors: crisis sparks the debate. However, due to the challenges of reliable annotation and modeling, few large-scale datasets with high-quality annotation are available. This paper introduces <i>DebateNet2.0</i>, which traces the political discourse on the 2015 European refugee crisis in the German quality newspaper <i>taz</i>. The core units of our annotation are political claims (requests for specific actions to be taken) and the actors who advance them (politicians, parties, etc.). Our contribution is twofold. First, we document and release <i>DebateNet2.0</i> along with its companion R package, mardyR. Second, we outline and apply a Discourse Network Analysis (DNA) to <i>DebateNet2.0</i>, comparing two crucial moments of the policy debate on the \"refugee crisis\": the migration flux through the Mediterranean in April/May and the one along the Balkan route in September/October. We guide the reader through the methods involved in constructing a discourse network from a newspaper, demonstrating that there is not one single discourse network for the German migration debate, but multiple ones, depending on the research question through the associated choices regarding political actors, policy fields and time spans.</p>","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"57 1","pages":"121-153"},"PeriodicalIF":2.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9924208/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9192305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Urban, Evelyn Michelle Aguilar Panchi, Saetbyul Lee, Evgenia Brodetsky
{"title":"CINWA (database of terminology for cultivated plants in indigenous languages of northwestern South America): introducing a resource for research in ethnobiology, anthropology, historical linguistics, and interdisciplinary research on the neolithic transition in South America","authors":"M. Urban, Evelyn Michelle Aguilar Panchi, Saetbyul Lee, Evgenia Brodetsky","doi":"10.1007/s10579-022-09628-x","DOIUrl":"https://doi.org/10.1007/s10579-022-09628-x","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":" ","pages":""},"PeriodicalIF":2.7,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45561225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Understanding conversational interaction in multiparty conversations: the EVA Corpus","authors":"I. Mlakar, D. Verdonik, Simona Majhenic, M. Rojc","doi":"10.1007/s10579-022-09627-y","DOIUrl":"https://doi.org/10.1007/s10579-022-09627-y","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"57 1","pages":"641 - 671"},"PeriodicalIF":2.7,"publicationDate":"2022-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42107793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Book Review: the Routledge Handbook of Translation and Ethics","authors":"Hongzheng Li, Ruojin Wang","doi":"10.1007/s10579-022-09626-z","DOIUrl":"https://doi.org/10.1007/s10579-022-09626-z","url":null,"abstract":"","PeriodicalId":49927,"journal":{"name":"Language Resources and Evaluation","volume":"1 1","pages":"1 - 4"},"PeriodicalIF":2.7,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44377368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}