AMULETY: A Python package to embed adaptive immune receptor sequences

Immunoinformatics (Amsterdam, Netherlands) Pub Date : 2026-03-01 Epub Date: 2026-02-03 DOI:10.1016/j.immuno.2026.100066

Meng Wang , Wengyao Jiang , Yuval Kluger , Steven H. Kleinstein , Gisela Gabernet

{"title":"AMULETY: A Python package to embed adaptive immune receptor sequences","authors":"Meng Wang , Wengyao Jiang , Yuval Kluger , Steven H. Kleinstein , Gisela Gabernet","doi":"10.1016/j.immuno.2026.100066","DOIUrl":null,"url":null,"abstract":"<div><div>Large language models have been developed to capture relevant features of adaptive immune receptors, each with unique potential applications. However, the diversity in available models presents challenges in accessibility and usability for downstream applications. Here we present AMULETY (Adaptive imMUne receptor Language model Embedding Tool), a Python-based software package to generate language model embeddings for adaptive immune receptor sequences, enabling users to leverage the strengths of different models without the need for complex configuration. AMULETY offers functions for embedding adaptive immune receptor amino acid sequences using pre-trained protein or antibody language models for paired B-cell receptor heavy-light, T-cell receptor alpha-beta or gamma-delta chains, or single chain sequences. We showcase the variability on the embedding space for several embeddings on a dataset of antibody binders to several SARS-CoV-2 epitopes as well as T-cell receptors binding to several epitopes and showed that different models may be effective at capturing different aspects of the distinctions between epitope groups. AMULETY is freely available under GPLv3 license from <span><span>https://github.com/immcantation/amulety</span><svg><path></path></svg></span> or via pip from the Python Package Index (PyPI) from <span><span>https://pypi.org/project/amulety/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"21 ","pages":"Article 100066"},"PeriodicalIF":0.0000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119026000029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/3 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Large language models have been developed to capture relevant features of adaptive immune receptors, each with unique potential applications. However, the diversity in available models presents challenges in accessibility and usability for downstream applications. Here we present AMULETY (Adaptive imMUne receptor Language model Embedding Tool), a Python-based software package to generate language model embeddings for adaptive immune receptor sequences, enabling users to leverage the strengths of different models without the need for complex configuration. AMULETY offers functions for embedding adaptive immune receptor amino acid sequences using pre-trained protein or antibody language models for paired B-cell receptor heavy-light, T-cell receptor alpha-beta or gamma-delta chains, or single chain sequences. We showcase the variability on the embedding space for several embeddings on a dataset of antibody binders to several SARS-CoV-2 epitopes as well as T-cell receptors binding to several epitopes and showed that different models may be effective at capturing different aspects of the distinctions between epitope groups. AMULETY is freely available under GPLv3 license from https://github.com/immcantation/amulety or via pip from the Python Package Index (PyPI) from https://pypi.org/project/amulety/.

Abstract Image

查看原文本刊更多论文

AMULETY：嵌入适应性免疫受体序列的Python包

已经开发出大型语言模型来捕捉适应性免疫受体的相关特征，每个都具有独特的潜在应用。然而，可用模型的多样性对下游应用程序的可访问性和可用性提出了挑战。在这里，我们提出了AMULETY（适应性免疫受体语言模型嵌入工具），这是一个基于python的软件包，用于生成适应性免疫受体序列的语言模型嵌入，使用户能够在不需要复杂配置的情况下利用不同模型的优势。AMULETY提供嵌入适应性免疫受体氨基酸序列的功能，使用预先训练的蛋白质或抗体语言模型，用于配对b细胞受体重-轻，t细胞受体α - β或γ - δ链或单链序列。我们展示了几种嵌入到几种SARS-CoV-2表位的抗体结合物以及结合到几种表位的t细胞受体数据集的嵌入空间的可变性，并表明不同的模型可能有效地捕获表位组之间差异的不同方面。AMULETY在GPLv3许可下可从https://github.com/immcantation/amulety或通过pip从https://pypi.org/project/amulety/获得Python包索引（PyPI）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Immunoinformatics (Amsterdam, Netherlands) Immunology, Computer Science Applications

自引率

0.00%

发文量

审稿时长

60 days