Annual Meeting of the Association for Computational Linguistics最新文献_第5页

Chain of Thought Prompting Elicits Knowledge Augmentation 思维链提示引出知识扩充

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-04 DOI: 10.48550/arXiv.2307.01640

Di Wu, Jing Zhang, Xinmei Huang

引用次数: 2

Mitigating the Learning Bias towards Repetition by Self-Contrastive Training for Open-Ended Generation 通过自我对比训练减轻开放式世代的重复学习偏见

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-04 DOI: 10.48550/arXiv.2307.01542

Jian Guan, Minlie Huang

引用次数: 0

Diverse Retrieval-Augmented In-Context Learning for Dialogue State Tracking 对话状态跟踪的多元检索增强语境学习

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-04 DOI: 10.48550/arXiv.2307.01453

Brendan King, Jeffrey Flanigan

引用次数: 0

Transformed Protoform Reconstruction 变形原形重建

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-04 DOI: 10.48550/arXiv.2307.01896

Young Min Kim, Kalvin Chang, Chenxuan Cui, David R. Mortensen

引用次数: 1

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding UniFine:零镜头视觉语言理解的统一和细粒度方法

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-03 DOI: 10.48550/arXiv.2307.00862

Rui Sun, Zhecan Wang, Haoxuan You, N. Codella, Kai-Wei Chang, Shih-Fu Chang

{"title":"UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding","authors":"Rui Sun, Zhecan Wang, Haoxuan You, N. Codella, Kai-Wei Chang, Shih-Fu Chang","doi":"10.48550/arXiv.2307.00862","DOIUrl":"https://doi.org/10.48550/arXiv.2307.00862","url":null,"abstract":"Vision-language tasks, such as VQA, SNLI-VE, and VCR are challenging because they require the model's reasoning ability to understand the semantics of the visual world and natural language. Supervised methods working for vision-language tasks have been well-studied. However, solving these tasks in a zero-shot setting is less explored. Since Contrastive Language-Image Pre-training (CLIP) has shown remarkable zero-shot performance on image-text matching, previous works utilized its strong zero-shot ability by converting vision-language tasks into an image-text matching problem, and they mainly consider global-level matching (e.g., the whole image or sentence). However, we find visual and textual fine-grained information, e.g., keywords in the sentence and objects in the image, can be fairly informative for semantics understanding. Inspired by this, we propose a unified framework to take advantage of the fine-grained information for zero-shot vision-language learning, covering multiple tasks such as VQA, SNLI-VE, and VCR. Our experiments show that our framework outperforms former zero-shot methods on VQA and achieves substantial improvement on SNLI-VE and VCR. Furthermore, our ablation studies confirm the effectiveness and generalizability of our proposed method. Code will be available at https://github.com/ThreeSR/UniFine","PeriodicalId":352845,"journal":{"name":"Annual Meeting of the Association for Computational Linguistics","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115962291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SSP: Self-Supervised Post-training for Conversational Search SSP:会话搜索的自我监督后训练

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-02 DOI: 10.48550/arXiv.2307.00569

Quan Tu, Shen Gao, Xiaolong Wu, Zhao Cao, Jiaxin Wen, Rui Yan

{"title":"SSP: Self-Supervised Post-training for Conversational Search","authors":"Quan Tu, Shen Gao, Xiaolong Wu, Zhao Cao, Jiaxin Wen, Rui Yan","doi":"10.48550/arXiv.2307.00569","DOIUrl":"https://doi.org/10.48550/arXiv.2307.00569","url":null,"abstract":"Conversational search has been regarded as the next-generation search paradigm. Constrained by data scarcity, most existing methods distill the well-trained ad-hoc retriever to the conversational retriever. However, these methods, which usually initialize parameters by query reformulation to discover contextualized dependency, have trouble in understanding the dialogue structure information and struggle with contextual semantic vanishing. In this paper, we propose fullmodel (model) which is a new post-training paradigm with three self-supervised tasks to efficiently initialize the conversational search model to enhance the dialogue structure and contextual semantic understanding. Furthermore, the model can be plugged into most of the existing conversational models to boost their performance. To verify the effectiveness of our proposed method, we apply the conversational encoder post-trained by model on the conversational search task using two benchmark datasets: CAsT-19 and CAsT-20. Extensive experiments that our model can boost the performance of several existing conversational search methods. Our source code is available at url{https://github.com/morecry/SSP}.","PeriodicalId":352845,"journal":{"name":"Annual Meeting of the Association for Computational Linguistics","volume":"62 44","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120971702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Revisiting Sample Size Determination in Natural Language Understanding 自然语言理解中样本大小的确定

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-01 DOI: 10.48550/arXiv.2307.00374

Ernie Chang, Muhammad Hassan Rashid, Pin-Jie Lin, Changsheng Zhao, Vera Demberg, Yangyang Shi, Vikas Chandra

引用次数: 0

A New Task and Dataset on Detecting Attacks on Human Rights Defenders 探测对人权维护者的攻击的新任务和数据集

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-06-30 DOI: 10.48550/arXiv.2306.17695

Shihao Ran, Di Lu, Joel Tetreault, A. Cahill, A. Jaimes

引用次数: 0

Should you marginalize over possible tokenizations? 你应该忽略可能的标记化吗?

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-06-30 DOI: 10.48550/arXiv.2306.17757

N. Chirkova, Germán Kruszewski, Jos Rozen, Marc Dymetman

引用次数: 1

Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation 基于多源语义图的多模态讽刺解释生成

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-06-29 DOI: 10.48550/arXiv.2306.16650

Liqiang Jing, Xuemeng Song, Kun Ouyang, Mengzhao Jia, Liqiang Nie

{"title":"Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation","authors":"Liqiang Jing, Xuemeng Song, Kun Ouyang, Mengzhao Jia, Liqiang Nie","doi":"10.48550/arXiv.2306.16650","DOIUrl":"https://doi.org/10.48550/arXiv.2306.16650","url":null,"abstract":"Multimodal Sarcasm Explanation (MuSE) is a new yet challenging task, which aims to generate a natural language sentence for a multimodal social post (an image as well as its caption) to explain why it contains sarcasm. Although the existing pioneer study has achieved great success with the BART backbone, it overlooks the gap between the visual feature space and the decoder semantic space, the object-level metadata of the image, as well as the potential external knowledge. To solve these limitations, in this work, we propose a novel mulTi-source sEmantic grAph-based Multimodal sarcasm explanation scheme, named TEAM. In particular, TEAM extracts the object-level semantic meta-data instead of the traditional global visual features from the input image. Meanwhile, TEAM resorts to ConceptNet to obtain the external related knowledge concepts for the input text and the extracted object meta-data. Thereafter, TEAM introduces a multi-source semantic graph that comprehensively characterize the multi-source (i.e., caption, object meta-data, external knowledge) semantic relations to facilitate the sarcasm reasoning. Extensive experiments on a public released dataset MORE verify the superiority of our model over cutting-edge methods.","PeriodicalId":352845,"journal":{"name":"Annual Meeting of the Association for Computational Linguistics","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133290028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0