Transactions of the Association for Computational Linguistics最新文献_第5页

Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences 视觉写作提示:以人物为基础的故事生成与策划图像序列

IF 10.9 1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-20 DOI: 10.1162/tacl_a_00553

Xudong Hong, A. Sayeed, K. Mehra, Vera Demberg, B. Schiele

引用次数: 8

Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection 基于模型自省的神经机器翻译中幻觉的理解与检测

IF 10.9 1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-18 DOI: 10.1162/tacl_a_00563

Weijia Xu, Sweta Agrawal, Eleftheria Briakou, Marianna J. Martindale, Marine Carpuat

引用次数: 13

Tracking Brand-Associated Polarity-Bearing Topics in User Reviews 追踪用户评论中与品牌相关的极性话题

IF 10.9 1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-03 DOI: 10.1162/tacl_a_00555

Runcong Zhao, Lin Gui, Hanqi Yan, Yulan He

引用次数: 0

T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates t2 -NER:一种基于两阶段跨度的模板统一命名实体识别框架

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00602

Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan, Weidong Xiao

{"title":"T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates","authors":"Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan, Weidong Xiao","doi":"10.1162/tacl_a_00602","DOIUrl":"https://doi.org/10.1162/tacl_a_00602","url":null,"abstract":"Abstract Named Entity Recognition (NER) has so far evolved from the traditional flat NER to overlapped and discontinuous NER. They have mostly been solved separately, with only several exceptions that concurrently tackle three tasks with a single model. The current best-performing method formalizes the unified NER as word-word relation classification, which barely focuses on mention content learning and fails to detect entity mentions comprising a single word. In this paper, we propose a two-stage span-based framework with templates, namely, T2-NER, to resolve the unified NER task. The first stage is to extract entity spans, where flat and overlapped entities can be recognized. The second stage is to classify over all entity span pairs, where discontinuous entities can be recognized. Finally, multi-task learning is used to jointly train two stages. To improve the efficiency of span-based model, we design grouped templates and typed templates for two stages to realize batch computations. We also apply an adjacent packing strategy and a latter packing strategy to model discriminative boundary information and learn better span (pair) representation. Moreover, we introduce the syntax information to enhance our span representation. We perform extensive experiments on eight benchmark datasets for flat, overlapped, and discontinuous NER, where our model beats all the current competitive baselines, obtaining the best performance of unified NER.","PeriodicalId":33559,"journal":{"name":"Transactions of the Association for Computational Linguistics","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135057751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PASTA: A Dataset for Modeling PArticipant STAtes in Narratives PASTA:叙事中参与者状态建模的数据集

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00600

Sayontan Ghosh, Mahnaz Koupaee, Isabella Chen, Francis Ferraro, Nathanael Chambers, Niranjan Balasubramanian

{"title":"<tt>PASTA</tt>: A Dataset for Modeling PArticipant STAtes in Narratives","authors":"Sayontan Ghosh, Mahnaz Koupaee, Isabella Chen, Francis Ferraro, Nathanael Chambers, Niranjan Balasubramanian","doi":"10.1162/tacl_a_00600","DOIUrl":"https://doi.org/10.1162/tacl_a_00600","url":null,"abstract":"Abstract The events in a narrative are understood as a coherent whole via the underlying states of their participants. Often, these participant states are not explicitly mentioned, instead left to be inferred by the reader. A model that understands narratives should likewise infer these implicit states, and even reason about the impact of changes to these states on the narrative. To facilitate this goal, we introduce a new crowdsourced English-language, Participant States dataset, PASTA. This dataset contains inferable participant states; a counterfactual perturbation to each state; and the changes to the story that would be necessary if the counterfactual were true. We introduce three state-based reasoning tasks that test for the ability to infer when a state is entailed by a story, to revise a story conditioned on a counterfactual state, and to explain the most likely state change given a revised story. Experiments show that today’s LLMs can reason about states to some degree, but there is large room for improvement, especially in problems requiring access and ability to reason with diverse types of knowledge (e.g., physical, numerical, factual).1","PeriodicalId":33559,"journal":{"name":"Transactions of the Association for Computational Linguistics","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135560520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Calibrated Interpretation: Confidence Estimation in Semantic Parsing 校正解释:语义解析中的置信度估计

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00598

Elias Stengel-Eskin, Benjamin Van Durme

引用次数: 6

Improving Multitask Retrieval by Promoting Task Specialization 通过促进任务专门化改进多任务检索

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00597

Wenzheng Zhang, Chenyan Xiong, Karl Stratos, Arnold Overwijk

引用次数: 0

Benchmarking the Generation of Fact Checking Explanations 对事实核查解释的生成进行基准测试

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00601

Daniel Russo, Serra Sinem Tekiroğlu, Marco Guerini

{"title":"Benchmarking the Generation of Fact Checking Explanations","authors":"Daniel Russo, Serra Sinem Tekiroğlu, Marco Guerini","doi":"10.1162/tacl_a_00601","DOIUrl":"https://doi.org/10.1162/tacl_a_00601","url":null,"abstract":"Abstract Fighting misinformation is a challenging, yet crucial, task. Despite the growing number of experts being involved in manual fact-checking, this activity is time-consuming and cannot keep up with the ever-increasing amount of fake news produced daily. Hence, automating this process is necessary to help curb misinformation. Thus far, researchers have mainly focused on claim veracity classification. In this paper, instead, we address the generation of justifications (textual explanation of why a claim is classified as either true or false) and benchmark it with novel datasets and advanced baselines. In particular, we focus on summarization approaches over unstructured knowledge (i.e., news articles) and we experiment with several extractive and abstractive strategies. We employed two datasets with different styles and structures, in order to assess the generalizability of our findings. Results show that in justification production summarization benefits from the claim information, and, in particular, that a claim-driven extractive step improves abstractive summarization performances. Finally, we show that although cross-dataset experiments suffer from performance degradation, a unique model trained on a combination of the two datasets is able to retain style information in an efficient manner.","PeriodicalId":33559,"journal":{"name":"Transactions of the Association for Computational Linguistics","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135057750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Evaluating a Century of Progress on the Cognitive Science of Adjective Ordering 评价一个世纪以来形容词排序认知科学的进展

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00596

William Dyer, Charles Torres, Gregory Scontras, Richard Futrell

引用次数: 0

Introduction to Mathematical Language Processing: Informal Proofs, Word Problems, and Supporting Tasks 数学语言处理导论:非正式证明、文字问题和辅助任务

1区计算机科学

Transactions of the Association for Computational Linguistics Pub Date : 2023-01-01 DOI: 10.1162/tacl_a_00594

Jordan Meadows, André Freitas

引用次数: 1