关联网络:集中分类关注架构的核视角与医学文本和图像的应用

Proceedings of the ACM Conference on Health, Inference, and Learning Pub Date : 2021-04-08 DOI:10.1145/3450439.3451856

D. Dov, Serge Assaad, Shijing Si, Rui Wang, Hongteng Xu, S. Kovalsky, Jonathan Bell, D. Range, Jonathan Cohen, Ricardo Henao, L. Carin

{"title":"关联网络:集中分类关注架构的核视角与医学文本和图像的应用","authors":"D. Dov, Serge Assaad, Shijing Si, Rui Wang, Hongteng Xu, S. Kovalsky, Jonathan Bell, D. Range, Jonathan Cohen, Ricardo Henao, L. Carin","doi":"10.1145/3450439.3451856","DOIUrl":null,"url":null,"abstract":"Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text data represented by sets of word embeddings. State-of-the-art methods, such as the transformer network, typically use attention mechanisms to learn representations of set data, by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that links learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term affinitention (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of the Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text-message triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":"557 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Affinitention nets: kernel perspective on attention architectures for set classification with applications to medical text and images\",\"authors\":\"D. Dov, Serge Assaad, Shijing Si, Rui Wang, Hongteng Xu, S. Kovalsky, Jonathan Bell, D. Range, Jonathan Cohen, Ricardo Henao, L. Carin\",\"doi\":\"10.1145/3450439.3451856\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text data represented by sets of word embeddings. State-of-the-art methods, such as the transformer network, typically use attention mechanisms to learn representations of set data, by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that links learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term affinitention (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of the Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text-message triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.\",\"PeriodicalId\":87342,\"journal\":{\"name\":\"Proceedings of the ACM Conference on Health, Inference, and Learning\",\"volume\":\"557 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ACM Conference on Health, Inference, and Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3450439.3451856\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Conference on Health, Inference, and Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3450439.3451856","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

集合分类是从包含多个实例的集合中预测单个标签的任务。我们考虑的例子是由一组补丁表示的病理切片和由一组词嵌入表示的医学文本数据。最先进的方法，如变压器网络，通常使用注意机制来学习集合数据的表示，通过建模集合实例之间的交互。然而，这些方法具有复杂的启发式架构，包括多个头部和层。当只有少量标记集可用时，注意力架构的复杂性阻碍了它们的训练，这在医疗应用中经常出现。为了解决这个问题，我们提出了一个基于核的表示学习框架，该框架将学习亲和核与来自注意架构的学习表示联系起来。我们表明，学习核的和和乘积的组合相当于学习来自多头多层注意力体系结构的表示。从我们的框架中，我们设计了一个简化的注意力架构，我们称之为亲和力(亲和力-注意力)网络。我们演示了亲和性网络在Set-Cifar10数据集分类、病理切片甲状腺恶性预测以及患者短信分类中的应用。我们表明，与启发式注意力架构相比，亲和性网络提供了竞争结果，并且优于其他竞争方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Affinitention nets: kernel perspective on attention architectures for set classification with applications to medical text and images

Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text data represented by sets of word embeddings. State-of-the-art methods, such as the transformer network, typically use attention mechanisms to learn representations of set data, by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that links learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term affinitention (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of the Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text-message triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the ACM Conference on Health, Inference, and Learning

自引率

0.00%

发文量