Ekaterina Shutova, J. Kaplan, Simone Teufel, A. Korhonen
{"title":"A computational model of logical metonymy","authors":"Ekaterina Shutova, J. Kaplan, Simone Teufel, A. Korhonen","doi":"10.1145/2483969.2483973","DOIUrl":"https://doi.org/10.1145/2483969.2483973","url":null,"abstract":"The use of figurative language is ubiquitous in natural language texts and it is a serious bottleneck in automatic text understanding. A system capable of interpreting figurative expressions would be an invaluable addition to the real-world natural language processing (NLP) applications that need to access semantics, such as machine translation, opinion mining, question answering and many others. In this article we focus on one type of figurative language, logical metonymy, and present a computational model of its interpretation bringing together statistical techniques and the insights from linguistic theory. Compared to previous approaches this model is both more informative and more accurate. The system produces sense-level interpretations of metonymic phrases and then automatically organizes them into conceptual classes, or roles, discussed in the majority of linguistic literature on the phenomenon.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126613856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Word sense and semantic relations in noun compounds","authors":"Su Nam Kim, Timothy Baldwin","doi":"10.1145/2483969.2483971","DOIUrl":"https://doi.org/10.1145/2483969.2483971","url":null,"abstract":"In this article, we investigate word sense distributions in noun compounds (NCs). Our primary goal is to disambiguate the word sense of component words in NCs, based on investigation of “semantic collocation” between them. We use sense collocation and lexical substitution to build supervised and unsupervised word sense disambiguation (WSD) classifiers, and show our unsupervised learner to be superior to a benchmark WSD system. Further, we develop a word sense-based approach to interpreting the semantic relations in NCs.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124958810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning to detect english and hungarian light verb constructions","authors":"V. Vincze, T. IstvánNagy, János Zsibrita","doi":"10.1145/2483691.2483695","DOIUrl":"https://doi.org/10.1145/2483691.2483695","url":null,"abstract":"Light verb constructions consist of a verbal and a nominal component, where the noun preserves its original meaning while the verb has lost it (to some degree). They are syntactically flexible and their meaning can only be partially computed on the basis of the meaning of their parts, thus they require special treatment in natural language processing. For this purpose, the first step is to identify light verb constructions.\u0000 In this study, we present our conditional random fields-based tool—called FXTagger—for identifying light verb constructions. The flexibility of the tool is demonstrated on two, typologically different, languages, namely, English and Hungarian. As earlier studies labeled different linguistic phenomena as light verb constructions, we first present a linguistics-based classification of light verb constructions and then show that FXTagger is able to identify different classes of light verb constructions in both languages.\u0000 Different types of texts may contain different types of light verb constructions; moreover, the frequency of light verb constructions may differ from domain to domain. Hence we focus on the portability of models trained on different corpora, and we also investigate the effect of simple domain adaptation techniques to reduce the gap between the domains. Our results show that in spite of domain specificities, out-domain data can also contribute to the successful LVC detection in all domains.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126406016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling the internal variability of multiword expressions through a pattern-based method","authors":"M. Nissim, Andrea Zaninello","doi":"10.1145/2483691.2483696","DOIUrl":"https://doi.org/10.1145/2483691.2483696","url":null,"abstract":"The issue of internal variability of multiword expressions (MWEs) is crucial towards their identification and extraction in running text. We present a corpus-supported and computational study on Italian MWEs, aimed at defining an automatic method for modeling internal variation, exploiting frequency and part-of-speech (POS) information. We do so by deriving an XML-encoded lexicon of MWEs based on a manually compiled dictionary, which is then projected onto a a large corpus. Since a search for fixed forms suffers from low recall, while an unconstrained flexible search for lemmas yields a loss in precision, we suggest a procedure aimed at maximizing precision in the identification of MWEs within a flexible search. Our method builds on the idea that internal variability can be modelled via the novel introduction of variation patterns, which work over POS patterns, and can be used as working tools for controlling precision. We also compare the performance of variation patterns to that of association measures, and explore the possibility of using variation patterns in MWE extraction in addition to identification. Finally, we suggest that corpus-derived, pattern-related information can be included in the original MWE lexicon by means of an enriched coding and the creation of an XML-based repository of patterns.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126692126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Lexical semantic factors in the acceptability of english support-verb-nominalization constructions","authors":"Anthony R. Davis, Leslie Barrett","doi":"10.1145/2483691.2483694","DOIUrl":"https://doi.org/10.1145/2483691.2483694","url":null,"abstract":"We explore the properties of support-verb and nominalization (SVN) pairs in English, a type of multiword expression in which a semantically impoverished verb combines with a complement nominalization sharing an unexpressed role with the verb. This study follows others in seeking syntactic or lexical semantic factors correlated with the acceptability of these constructions. In particular, following recent work showing certain semantic verb class features to improve SVN classification [Tu and Roth 2011], we explore the possibility that support verbs and the verbal roots of nominalizations in acceptable SVN pairs are clustered according to the classes of Levin [1993]. We compare the compatibility correlation of these results with those of the Aktionsart-class-based proposal of Barrett and Davis [2002]. We find the evidence that Levin classes are a factor in the acceptability of SVN constructions to be equivocal, and conclude with a discussion of the reasons for this finding.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127223613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Carlos Ramisch, Aline Villavicencio, Valia Kordoni
{"title":"Introduction to the special issue on multiword expressions: From theory to practice and use","authors":"Carlos Ramisch, Aline Villavicencio, Valia Kordoni","doi":"10.1145/2483691.2483692","DOIUrl":"https://doi.org/10.1145/2483691.2483692","url":null,"abstract":"We are in 2013, and multiword expressions have been around for a while in the computational linguistics research community. Since the first ACL workshop on MWEs 12 years ago in Sapporo, Japan, much has been discussed, proposed, experimented, evaluated and argued about MWEs. And yet, they deserve the publication of a whole special issue of the ACM Transactions on Speech and Language Processing. But what is it about multiword expressions that keeps them in fashion? Who are the people and the institutions who perform and publish groundbreaking fundamental and applied research in this field? What is the place and the relevance of our lively research community in the bigger picture of computational linguistics? Where do we come from as a community, and most importantly, where are we heading? In this introductory article, we share our point of view about the answers to these questions and introduce the articles that compose the current special issue.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"398 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132179144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards content-level coherence with aspect-guided summarization","authors":"Renxian Zhang, Wenjie Li, D. Gao","doi":"10.1145/2442076.2442078","DOIUrl":"https://doi.org/10.1145/2442076.2442078","url":null,"abstract":"The TAC 2010 summarization track initiated a new task—aspect-guided summarization—that centers on textual aspects embodied as particular kinds of information of a text. We observe that aspect-guided summaries not only address highly specific user need, but also facilitate content-level coherence by using aspect information. In this article, we present a full-fledged approach to aspect-guided summarization with a focus on summary coherence. Our summarization approach depends on two prerequisite subtasks: recognizing aspect-bearing sentences in order to do sentence extraction, and modeling aspect-based coherence with an HMM model in order to predict a coherent sentence ordering. Using the manually annotated TAC 2010 and 2010 datasets, we validated the effectiveness of our proposed methods for those subtasks. Drawing on the empirical results, we proceed to develop an aspect-guided summarizer based on a simple but robust base summarizer. With sentence selection guided by aspect information, our system is one of the best on TAC 2011. With sentence ordering predicted by the aspect-based HMM model, the summaries achieve good coherence.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130076885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Managing information disparity in multilingual document collections","authors":"Kevin Duh, C. Yeung, Tomoharu Iwata, M. Nagata","doi":"10.1145/2442076.2442077","DOIUrl":"https://doi.org/10.1145/2442076.2442077","url":null,"abstract":"Information disparity is a major challenge with multilingual document collections. When documents are dynamically updated in a distributed fashion, information content among different language editions may gradually diverge. We propose a framework for assisting human editors to manage this information disparity, using tools from machine translation and machine learning. Given source and target documents in two different languages, our system automatically identifies information nuggets that are new with respect to the target and suggests positions to place their translations. We perform both real-world experiments and large-scale simulations on Wikipedia documents and conclude our system is effective in a variety of scenarios.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125477381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Syllable Specific Unit Selection Cost Functions for Text-to-Speech Synthesis","authors":"N. Narendra, K. S. Rao","doi":"10.1145/2382434.2382435","DOIUrl":"https://doi.org/10.1145/2382434.2382435","url":null,"abstract":"This paper presents the design and development of syllable specific unit selection cost functions for improving the quality of text-to-speech synthesis. Appropriate unit selection cost functions, namely concatenation cost and target cost, are proposed for syllable based synthesis. Concatenation costs are defined based on the type of segments present at the syllable joins. Proposed concatenation costs have shown significant reduction in perceptual discontinuity at syllable joins. Three-stage target cost formulation is proposed for selecting appropriate units from database. Subjective evaluation has shown improvement in the quality of speech at each stage.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125130805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vassiliki Rentoumi, G. Vouros, V. Karkaletsis, Amalia Moser
{"title":"Investigating Metaphorical Language in Sentiment Analysis: A Sense-to-Sentiment Perspective","authors":"Vassiliki Rentoumi, G. Vouros, V. Karkaletsis, Amalia Moser","doi":"10.1145/2382434.2382436","DOIUrl":"https://doi.org/10.1145/2382434.2382436","url":null,"abstract":"Intuition dictates that figurative language and especially metaphorical expressions should convey sentiment. It is the aim of this work to validate this intuition by showing that figurative language (metaphors) appearing in a sentence drive the polarity of that sentence. Towards this target, the current article proposes an approach for sentiment analysis of sentences where figurative language plays a dominant role. This approach applies Word Sense Disambiguation aiming to assign polarity to word senses rather than tokens. Sentence polarity is determined using the individual polarities for metaphorical senses as well as other contextual information. Experimental evaluation shows that the proposed method achieves high scores in comparison with other state-of-the-art approaches tested on the same corpora. Finally, experimental results provide supportive evidence that this method is also well suited for corpora consisting of literal and figurative language sentences.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"490 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133318816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}