How to Leverage a Multi-layered Transformer Language Model for Text Clustering: an Ensemble Approach

Proceedings of the 30th ACM International Conference on Information & Knowledge Management Pub Date : 2021-10-26 DOI:10.1145/3459637.3482121

Mira Ait-Saada, François Role, M. Nadif

引用次数: 4

Abstract

Pre-trained Transformer-based word embeddings are now widely used in text mining where they are known to significantly improve supervised tasks such as text classification, named entity recognition and question answering. Since the Transformer models create several different embeddings for the same input, one at each layer of their architecture, various studies have already tried to identify those of these embeddings that most contribute to the success of the above-mentioned tasks. In contrast the same performance analysis has not yet been carried out in the unsupervised setting. In this paper we evaluate the effectiveness of Transformer models on the important task of text clustering. In particular, we present a clustering ensemble approach that harnesses all the network's layers. Numerical experiments carried out on real datasets with different Transformer models show the effectiveness of the proposed method compared to several baselines.

查看原文本刊更多论文

如何利用多层转换语言模型进行文本聚类:一种集成方法

基于预训练的transformer的词嵌入现在广泛应用于文本挖掘中，已知它们可以显着改善文本分类，命名实体识别和问答等监督任务。由于Transformer模型为相同的输入创建了几个不同的嵌入，在其体系结构的每一层都创建了一个嵌入，因此各种研究已经试图确定这些嵌入中最有助于上述任务成功的那些。相比之下，在无监督设置中尚未进行相同的性能分析。本文评估了Transformer模型在文本聚类这一重要任务上的有效性。特别地，我们提出了一种利用所有网络层的聚类集成方法。在不同Transformer模型的实际数据集上进行的数值实验表明，与几种基线相比，该方法是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 30th ACM International Conference on Information & Knowledge Management

自引率

0.00%

发文量