J. Lang. Model.最新文献_第10页

An informal discovery procedure for two-level rules 两级规则的非正式发现程序

J. Lang. Model. Pub Date : 2013-07-22 DOI: 10.15398/jlm.v1i1.62

K. Koskenniemi

引用次数: 6

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.64

A. Przepiórkowski

{"title":"Journal of Language Modelling","authors":"A. Przepiórkowski","doi":"10.15398/jlm.v0i1.64","DOIUrl":"https://doi.org/10.15398/jlm.v0i1.64","url":null,"abstract":"Welcome to the inaugural issue of the Journal of Language Modelling (JLM), a free open-access peer-reviewed journal aiming to help bridge the gap between theoretical linguistics and natural language processing (NLP). Setting up a new journal is not a trivial task, and running it possibly for decades requires determination and perseverance, so any such enterprise should not be taken up lightly. The publication of this issue has been preceded by years of growing conviction that there is no appropriate forum for the exchange of ideas between theoretical, formal and computational linguists. Many conversations with our colleagues – both linguists and NLP practitioners – convinced us that such a forum is indeed needed. Ideally, JLM papers should be accessible to many readers of such periodicals as Natural Language and Linguistic Theories, Journal of Linguistics, Language or Lingua on one hand, and Computational Linguistics, Journal of Natural Language Processing, Journal of Logic, Language and Information or Language Resources and Evaluation, on the other. The affinity to another relatively young journal, Linguistic Issues in Language Technology, should also be clear. On the map of the main linguistic and NLP conferences, we see JLM as close to conferences devoted to constraint-based and formal linguistic theories (HPSG, LFG, TAG, Construction Grammar; Dependency Grammar in general and Meaning-Text Theory in particular; etc.), the Formal Grammar conference at ESSLLI, COLING, Treebanks and Linguistic Theories, etc., but also to LREC (Language Resources and Evaluation Conference), TSD (Text, Speech and Dialogue) or the xTAL series of conferences (see Jap-","PeriodicalId":403597,"journal":{"name":"J. Lang. Model.","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125279215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Slovak Morphosyntactic Tagset 斯洛伐克语形态句法标记集

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.35

R. Garabík, M. Šimková

引用次数: 10

Derivational and Semantic Relations of Croatian Verbs 克罗地亚语动词的派生关系和语义关系

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.34

Kresimir Sojat, Matea Srebacic, Marko Tadić

引用次数: 21

The Case for the Journal's Use of a CC-BY License 《华尔街日报》使用CC-BY许可的案例

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.58

S. Shieber

引用次数: 0

Exploiting Prosody for Syntactic Analysis in Automatic Speech Understanding 利用韵律进行语音自动理解中的句法分析

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.31

György Szaszák, A. Beke

{"title":"Exploiting Prosody for Syntactic Analysis in Automatic Speech Understanding","authors":"György Szaszák, A. Beke","doi":"10.15398/jlm.v0i1.31","DOIUrl":"https://doi.org/10.15398/jlm.v0i1.31","url":null,"abstract":"The relation between syntax and prosody is evident, even if the prosodic structure cannot be directly mapped to the syntactic one and vice versa. Syntax-to-prosody mapping is widely used in text-tospeech applications, but prosody-to-syntax mapping is mostly missing from automatic speech recognition/understanding systems. This paper presents an experiment towards filling this gap and evaluating whether a HMM-based automatic prosodic segmentation tool can be used to support the reconstruction of the syntactic structure directly from speech. Results show that up to 85% of syntactic clause boundaries and up to about 70% of embedded syntactic phrase boundaries could be identified based on the detection of phonological phrases. Recall rates do not depend further on syntactic layering, in other words, whether the phrase is multiply embedded or not. Clause boundaries can be well assigned to intonational phrase level in read speech and can be well separated from lower level syntactic phrases based on the type of the aligned phonological phrase(s). These findings can be exploited in speech understanding systems, allowing for the recovery of the skeleton of the syntactic structure, based purely on the speech signal.","PeriodicalId":403597,"journal":{"name":"J. Lang. Model.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133292454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

The Bulgarian National Corpus: Theory and Practice in Corpus Design 保加利亚国家语料库:语料库设计的理论与实践

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.33

S. Koeva, I. Stoyanova, S. Leseva, Rositsa Dekova, Tsvetana Dimitrova, Ekaterina Tarpomanova

{"title":"The Bulgarian National Corpus: Theory and Practice in Corpus Design","authors":"S. Koeva, I. Stoyanova, S. Leseva, Rositsa Dekova, Tsvetana Dimitrova, Ekaterina Tarpomanova","doi":"10.15398/jlm.v0i1.33","DOIUrl":"https://doi.org/10.15398/jlm.v0i1.33","url":null,"abstract":"The paper discusses several key concepts related to the development of corpora and reconsiders them in light of recent developments in NLP. On the basis of an overview of present-day corpora, we conclude that the dominant practices of corpus design do not utilise adequately the technologies and, as a result, fail to meet the demands of corpus linguistics, computational lexicology and computational linguistics alike. We proceed to lay out a data-driven approach to corpus design, which integrates the best practices of traditional corpus linguistics with the potential of the latest technologies allowing fast collection, automatic metadata description and annotation of large amounts of data. Thus, the gist of the approach we propose is that corpus design should be centred on amassing large amounts of mono- and multilingual texts and on providing them with a detailed metadata description and high-quality multi-level annotation. We go on to illustrate this concept with a description of the compilation, structuring, documentation, and annotation of the Bulgarian National Corpus (BulNC). At present it consists of a Bulgarian part of 979.6 million words, constituting the corpus kernel, and 33 Bulgarian-X language corpora, totalling 972.3 million words, 1.95 billion words altogether. The BulNC is supplied with a comprehensive metadata description, which allows us to organise the texts according to different principles. The Bulgarian part of the BulNC is automatically processed (tokenised and sentence split) and annotated at several levels: morphosyntactic tagging, lemmatisation, word-sense annotation, annotation of noun phrases and named entities. Some levels of annotation are also applied to the Bulgarian-English parallel corpus with the prospect of expanding multilingual annotation both in terms of linguistic levels and the number of languages for which it is available. We conclude with a brief evaluation of the quality of the corpus and an outline of its applications in NLP and linguistic research.","PeriodicalId":403597,"journal":{"name":"J. Lang. Model.","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123122758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

A Personal Note on Open Access in Linguistics 关于语言学开放获取的个人笔记

J. Lang. Model. Pub Date : 2012-12-18 DOI: 10.15398/jlm.v0i1.52

Stefan Müller

引用次数: 4

Henkin semantics for reasoning with natural language 用自然语言推理的亨金语义学

J. Lang. Model. Pub Date : 1900-01-01 DOI: 10.15398/jlm.v3i2.113

Michael Hahn, F. Richter

{"title":"Henkin semantics for reasoning with natural language","authors":"Michael Hahn, F. Richter","doi":"10.15398/jlm.v3i2.113","DOIUrl":"https://doi.org/10.15398/jlm.v3i2.113","url":null,"abstract":"The frequency of intensional and non-first-order definable operators in natural languages constitutes a challenge for automated reasoning with the kind of logical translations that are deemed adequate by formal semanticists. Whereas linguists employ expressive higher-order logics in their theories of meaning, the most successful logical reasoning strategies with natural language to date rely on sophisticated first-order theorem provers and model builders. In order to bridge the fundamental mathematical gap between linguistic theory and computational practice, we present a general translation from a higher-order logic frequently employed in the linguistics literature, two-sorted Type Theory, to first-order logic under Henkin semantics. We investigate alternative formulations of the translation, discuss their properties, and evaluate the availability of linguistically relevant inferences with standard theorem provers in a test suite of inference problems stated in English. The results of the experiment indicate that translation from higher-order logic to first-order logic under Henkin semantics is a promising strategy for automated reasoning with natural languages. The paper is accompanied by the source code (cf. SUPP. FILES ) of the grammar and reasoning architecture described in the paper.","PeriodicalId":403597,"journal":{"name":"J. Lang. Model.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127777234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Simplicity and the form of grammars 简单性和语法形式

J. Lang. Model. Pub Date : 1900-01-01 DOI: 10.15398/jlm.v9i1.257

N. Chomsky

引用次数: 6