Nathaniel R. Robinson, Zachary Brown, Timothy Sitze, Nancy Fulda
{"title":"基于语言模型隐藏层的文本分类","authors":"Nathaniel R. Robinson, Zachary Brown, Timothy Sitze, Nancy Fulda","doi":"10.1109/SAMI50585.2021.9378669","DOIUrl":null,"url":null,"abstract":"Advancements in machine learning methods have yielded powerful natural language generation models. However, in general, these models have drawn concern for being both uninterpretable and uncontrollable. Model interpretability and control have become important topics of interest among researchers. We explore a variety of machine learning methods to classify the hidden states of language models. This classification enables model interpretation at a deep semantic level and is a necessary part of recently proposed model control methods. We show further that the use of language model hidden layers as text representations in classification tasks may be more reliable in some applications than more standard text representations.","PeriodicalId":402414,"journal":{"name":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text Classifications Learned from Language Model Hidden Layers\",\"authors\":\"Nathaniel R. Robinson, Zachary Brown, Timothy Sitze, Nancy Fulda\",\"doi\":\"10.1109/SAMI50585.2021.9378669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Advancements in machine learning methods have yielded powerful natural language generation models. However, in general, these models have drawn concern for being both uninterpretable and uncontrollable. Model interpretability and control have become important topics of interest among researchers. We explore a variety of machine learning methods to classify the hidden states of language models. This classification enables model interpretation at a deep semantic level and is a necessary part of recently proposed model control methods. We show further that the use of language model hidden layers as text representations in classification tasks may be more reliable in some applications than more standard text representations.\",\"PeriodicalId\":402414,\"journal\":{\"name\":\"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAMI50585.2021.9378669\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAMI50585.2021.9378669","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text Classifications Learned from Language Model Hidden Layers
Advancements in machine learning methods have yielded powerful natural language generation models. However, in general, these models have drawn concern for being both uninterpretable and uncontrollable. Model interpretability and control have become important topics of interest among researchers. We explore a variety of machine learning methods to classify the hidden states of language models. This classification enables model interpretation at a deep semantic level and is a necessary part of recently proposed model control methods. We show further that the use of language model hidden layers as text representations in classification tasks may be more reliable in some applications than more standard text representations.