Adam Meyers, Zachary Glass, Angus B. Grieve-Smith, Yifan He, Shasha Liao, R. Grishman
{"title":"Jargon-Term Extraction by Chunking","authors":"Adam Meyers, Zachary Glass, Angus B. Grieve-Smith, Yifan He, Shasha Liao, R. Grishman","doi":"10.3115/v1/W14-6002","DOIUrl":"https://doi.org/10.3115/v1/W14-6002","url":null,"abstract":"NLP definitions of Terminology are usually application-dependent. IR terms are noun sequences that characterize topics. Terms can also be arguments for relations like abbreviation, definition or IS-A. In contrast, this paper explores techniques for extracting terms fitting a broader definition: noun sequences specific to topics and not well-known to naive adults. We describe a chunkingbased approach, an evaluation, and applications to non-topic-specific relation extraction.","PeriodicalId":446117,"journal":{"name":"Proceedings of the COLING Workshop on Synchronic and Diachronic Approaches to Analyzing Technical Language","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126612476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Study of Scientific Writing: Comparing Theoretical Guidelines with Practical Implementation","authors":"Mark Kröll, Gunnar Schulze, Roman Kern","doi":"10.3115/v1/W14-6006","DOIUrl":"https://doi.org/10.3115/v1/W14-6006","url":null,"abstract":"Good scientific writing is a skill researchers seek to acquire. Textbook literature provides guidelines to improve scientific writing, for instance, “use active voice when describing your own work”. In this paper we investigate to what extent researchers adhere to textbook principles in their articles. In our analyses we examine a set of selected principles which (i) are general and (ii) verifiable by applying text mining and natural language processing techniques. We develop a framework to automatically analyse a large data set containing 14.000 scientific articles received from Mendeley and PubMed. We are interested in whether adhering to writing principles is related to scientific quality, scientific domain or gender and whether these relations change over time. Our results show (i) a clear relation between journal quality and scientific imprecision, i.e. journals with low impact factors exhibit higher numbers of imprecision indicators such as number of citation bunches and number of relativating words and (ii) that writing style partly depends on domain characteristics and preferences.","PeriodicalId":446117,"journal":{"name":"Proceedings of the COLING Workshop on Synchronic and Diachronic Approaches to Analyzing Technical Language","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134201496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}