{"title":"吸引-排斥群:通过力归一化和可调相互作用的t-SNE的广义框架。","authors":"Jingcheng Lu, Jeff Calder","doi":"10.1098/rsta.2024.0234","DOIUrl":null,"url":null,"abstract":"<p><p>We propose a new method for data visualization based on attraction-repulsion swarming (ARS) dynamics, which we call ARS visualization. ARS is a generalized framework that is based on viewing the t-distributed stochastic neighbour embedding (t-SNE) visualization technique as a swarm of interacting agents driven by attraction and repulsion. Motivated by recent developments in swarming, we modify the t-SNE dynamics to include a normalization by the <i>total influence</i>, which results in better posed dynamics in which we can use a data size independent time step (of [Formula: see text]) and a simple gradient descent iteration. ARS also includes the ability to separately tune the attraction and repulsion kernels, which gives the user control over the tightness within clusters and the spacing between them in the visualization. In contrast with t-SNE, our proposed ARS data visualization method is not gradient descent on the Kullback-Leibler (KL) divergence, and can be viewed solely as an interacting particle system driven by attraction and repulsion forces, which illustrates that the KL divergence is not an essential part of the t-SNE algorithm. We provide theoretical results illustrating how the choice of interaction kernel affects the dynamics, and experimental results to validate our method and compare to t-SNE on the MNIST, Cifar-10, SVHN and NORB datasets.This article is part of the theme issue 'Partial differential equations in data science'.</p>","PeriodicalId":19879,"journal":{"name":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","volume":"383 2298","pages":"20240234"},"PeriodicalIF":3.7000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attraction-repulsion swarming: a generalized framework of t-SNE via force normalization and tunable interactions.\",\"authors\":\"Jingcheng Lu, Jeff Calder\",\"doi\":\"10.1098/rsta.2024.0234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>We propose a new method for data visualization based on attraction-repulsion swarming (ARS) dynamics, which we call ARS visualization. ARS is a generalized framework that is based on viewing the t-distributed stochastic neighbour embedding (t-SNE) visualization technique as a swarm of interacting agents driven by attraction and repulsion. Motivated by recent developments in swarming, we modify the t-SNE dynamics to include a normalization by the <i>total influence</i>, which results in better posed dynamics in which we can use a data size independent time step (of [Formula: see text]) and a simple gradient descent iteration. ARS also includes the ability to separately tune the attraction and repulsion kernels, which gives the user control over the tightness within clusters and the spacing between them in the visualization. In contrast with t-SNE, our proposed ARS data visualization method is not gradient descent on the Kullback-Leibler (KL) divergence, and can be viewed solely as an interacting particle system driven by attraction and repulsion forces, which illustrates that the KL divergence is not an essential part of the t-SNE algorithm. We provide theoretical results illustrating how the choice of interaction kernel affects the dynamics, and experimental results to validate our method and compare to t-SNE on the MNIST, Cifar-10, SVHN and NORB datasets.This article is part of the theme issue 'Partial differential equations in data science'.</p>\",\"PeriodicalId\":19879,\"journal\":{\"name\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"volume\":\"383 2298\",\"pages\":\"20240234\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1098/rsta.2024.0234\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsta.2024.0234","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Attraction-repulsion swarming: a generalized framework of t-SNE via force normalization and tunable interactions.
We propose a new method for data visualization based on attraction-repulsion swarming (ARS) dynamics, which we call ARS visualization. ARS is a generalized framework that is based on viewing the t-distributed stochastic neighbour embedding (t-SNE) visualization technique as a swarm of interacting agents driven by attraction and repulsion. Motivated by recent developments in swarming, we modify the t-SNE dynamics to include a normalization by the total influence, which results in better posed dynamics in which we can use a data size independent time step (of [Formula: see text]) and a simple gradient descent iteration. ARS also includes the ability to separately tune the attraction and repulsion kernels, which gives the user control over the tightness within clusters and the spacing between them in the visualization. In contrast with t-SNE, our proposed ARS data visualization method is not gradient descent on the Kullback-Leibler (KL) divergence, and can be viewed solely as an interacting particle system driven by attraction and repulsion forces, which illustrates that the KL divergence is not an essential part of the t-SNE algorithm. We provide theoretical results illustrating how the choice of interaction kernel affects the dynamics, and experimental results to validate our method and compare to t-SNE on the MNIST, Cifar-10, SVHN and NORB datasets.This article is part of the theme issue 'Partial differential equations in data science'.
期刊介绍:
Continuing its long history of influential scientific publishing, Philosophical Transactions A publishes high-quality theme issues on topics of current importance and general interest within the physical, mathematical and engineering sciences, guest-edited by leading authorities and comprising new research, reviews and opinions from prominent researchers.