Linear-time approximation scheme for k-means clustering of axis-parallel affine subspaces

IF 0.4 4区 计算机科学 Q4 MATHEMATICS
Kyungjin Cho, Eunjin Oh
{"title":"Linear-time approximation scheme for k-means clustering of axis-parallel affine subspaces","authors":"Kyungjin Cho,&nbsp;Eunjin Oh","doi":"10.1016/j.comgeo.2023.101981","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper, we present a linear-time approximation scheme for <em>k</em>-means clustering of <em>incomplete</em> data points in <em>d</em>-dimensional Euclidean space. An <em>incomplete</em> data point with <span><math><mi>Δ</mi><mo>&gt;</mo><mn>0</mn></math></span><span><span> unspecified entries is represented as an axis-parallel affine subspace of dimension Δ. The distance between two incomplete data points is defined as the </span>Euclidean distance between two closest points in the axis-parallel affine subspaces corresponding to the data points. We present an algorithm for </span><em>k</em>-means clustering of <em>n</em> axis-parallel affine subspaces of dimension Δ that yields an <span><math><mo>(</mo><mn>1</mn><mo>+</mo><mi>ϵ</mi><mo>)</mo></math></span>-approximate solution in <span><math><mi>O</mi><mo>(</mo><mi>n</mi><mi>d</mi><mo>)</mo></math></span> time. The constants hidden behind <span><math><mi>O</mi><mo>(</mo><mo>⋅</mo><mo>)</mo></math></span> depend only on <span><math><mi>Δ</mi><mo>,</mo><mi>ϵ</mi></math></span> and <em>k</em>. This improves the <span><math><mi>O</mi><mo>(</mo><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><mi>d</mi><mo>)</mo></math></span>-time algorithm by Eiben et al. (2021) <span>[7]</span> by a factor of <em>n</em>.</p></div>","PeriodicalId":51001,"journal":{"name":"Computational Geometry-Theory and Applications","volume":null,"pages":null},"PeriodicalIF":0.4000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Geometry-Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925772123000019","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we present a linear-time approximation scheme for k-means clustering of incomplete data points in d-dimensional Euclidean space. An incomplete data point with Δ>0 unspecified entries is represented as an axis-parallel affine subspace of dimension Δ. The distance between two incomplete data points is defined as the Euclidean distance between two closest points in the axis-parallel affine subspaces corresponding to the data points. We present an algorithm for k-means clustering of n axis-parallel affine subspaces of dimension Δ that yields an (1+ϵ)-approximate solution in O(nd) time. The constants hidden behind O() depend only on Δ,ϵ and k. This improves the O(n2d)-time algorithm by Eiben et al. (2021) [7] by a factor of n.

轴平行仿射子空间k均值聚类的线性时间近似格式
本文给出了d维欧氏空间中不完全数据点的k均值聚类的线性时间近似方案。Δ>;0个未指定条目表示为维度为Δ的轴平行仿射子空间。两个不完全数据点之间的距离被定义为与数据点相对应的轴平行仿射子空间中的两个最近点之间的欧几里得距离。我们提出了一种对维度为Δ的n轴平行仿射子空间进行k均值聚类的算法,该算法在O(nd)时间内产生(1+)-近似解。隐藏在O(‧)后面的常数仅取决于Δ、Ş和k。这改进了Eiben等人的O(n2d)-时间算法。(2021)[7]的因子为n。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
16.70%
发文量
43
审稿时长
>12 weeks
期刊介绍: Computational Geometry is a forum for research in theoretical and applied aspects of computational geometry. The journal publishes fundamental research in all areas of the subject, as well as disseminating information on the applications, techniques, and use of computational geometry. Computational Geometry publishes articles on the design and analysis of geometric algorithms. All aspects of computational geometry are covered, including the numerical, graph theoretical and combinatorial aspects. Also welcomed are computational geometry solutions to fundamental problems arising in computer graphics, pattern recognition, robotics, image processing, CAD-CAM, VLSI design and geographical information systems. Computational Geometry features a special section containing open problems and concise reports on implementations of computational geometry tools.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信