First Workshop on Insights from Negative Results in NLP最新文献_第2页

Do Transformers Dream of Inference, or Can Pretrained Generative Models Learn Implicit Inferential Rules? 变形金刚梦想推理，还是预训练生成模型可以学习隐式推理规则?

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.12

Zhengzhong Liang, M. Surdeanu

引用次数: 1

Domain adaptation challenges of BERT in tokenization and sub-word representations of Out-of-Vocabulary words BERT在词汇外词的标记化和子词表示中的领域适应挑战

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.1

Anmol Nayak, Hariprasad Timmapathini, Karthikeyan Ponnalagu, Vijendran Gopalan Venkoparao

{"title":"Domain adaptation challenges of BERT in tokenization and sub-word representations of Out-of-Vocabulary words","authors":"Anmol Nayak, Hariprasad Timmapathini, Karthikeyan Ponnalagu, Vijendran Gopalan Venkoparao","doi":"10.18653/v1/2020.insights-1.1","DOIUrl":"https://doi.org/10.18653/v1/2020.insights-1.1","url":null,"abstract":"BERT model (Devlin et al., 2019) has achieved significant progress in several Natural Language Processing (NLP) tasks by leveraging the multi-head self-attention mechanism (Vaswani et al., 2017) in its architecture. However, it still has several research challenges which are not tackled well for domain specific corpus found in industries. In this paper, we have highlighted these problems through detailed experiments involving analysis of the attention scores and dynamic word embeddings with the BERT-Base-Uncased model. Our experiments have lead to interesting findings that showed: 1) Largest substring from the left that is found in the vocabulary (in-vocab) is always chosen at every sub-word unit that can lead to suboptimal tokenization choices, 2) Semantic meaning of a vocabulary word deteriorates when found as a substring in an Out-Of-Vocabulary (OOV) word, and 3) Minor misspellings in words are inadequately handled. We believe that if these challenges are tackled, it will significantly help the domain adaptation aspect of BERT.","PeriodicalId":441528,"journal":{"name":"First Workshop on Insights from Negative Results in NLP","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127365426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

Label Propagation-Based Semi-Supervised Learning for Hate Speech Classification 基于标签传播的半监督学习仇恨言论分类

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.8

Ashwin Geet D'Sa, I. Illina, D. Fohr, D. Klakow, Dana Ruiter

引用次数: 6

Can Knowledge Graph Embeddings Tell Us What Fact-checked Claims Are About? 知识图谱嵌入能告诉我们事实核查的声明是关于什么的吗?

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.11

Valentina Beretta, S. Harispe, K. Boland, Luke Lo Seen, Konstantin Todorov, Andon Tchechmedjiev

引用次数: 1

How Effectively Can Machines Defend Against Machine-Generated Fake News? An Empirical Study 机器如何有效防御机器生成的假新闻?实证研究

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.7

Meghana Moorthy Bhat, S. Parthasarathy

引用次数: 12

An Analysis of Capsule Networks for Part of Speech Tagging in High- and Low-resource Scenarios 高低资源情景下词性标注的胶囊网络分析

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.10

Andrew Zupon, Faiz Rafique, M. Surdeanu

引用次数: 2

Layout-Aware Text Representations Harm Clustering Documents by Type 布局感知文本表示损害按类型聚类文档

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.9

Catherine Finegan-Dollak, Ashish Verma

引用次数: 4

Q. Can Knowledge Graphs be used to Answer Boolean Questions? A. It’s complicated! 知识图谱可以用来回答布尔问题吗?这很复杂!

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-11-01 DOI: 10.18653/v1/2020.insights-1.2

Daria Dzendzik, Carl Vogel, Jennifer Foster

引用次数: 2

The Extraordinary Failure of Complement Coercion Crowdsourcing 互补强制众包的巨大失败

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-10-12 DOI: 10.18653/v1/2020.insights-1.17

Yanai Elazar, Victoria Basmov, Shauli Ravfogel, Yoav Goldberg, Reut Tsarfaty

引用次数: 6

On Task-Level Dialogue Composition of Generative Transformer Model 生成式变压器模型的任务级对话组成研究

First Workshop on Insights from Negative Results in NLP Pub Date : 2020-10-09 DOI: 10.18653/v1/2020.insights-1.6

Prasanna Parthasarathi, Arvind Neelakantan, Sharan Narang

引用次数: 1