信息架构:使用k -均值聚类和最佳合并方法进行开放卡片分类数据分析

IF 1 4区 计算机科学 Q3 COMPUTER SCIENCE, CYBERNETICS
Sione Paea, C. Katsanos, Gabiriele Bulivou
{"title":"信息架构:使用k -均值聚类和最佳合并方法进行开放卡片分类数据分析","authors":"Sione Paea, C. Katsanos, Gabiriele Bulivou","doi":"10.1093/iwc/iwac022","DOIUrl":null,"url":null,"abstract":"\n Open card sorting is a well-established method for discovering how people understand and categorize information. This paper addresses the problem of quantitatively analyzing open card sorting data using the K-means algorithm. Although the K-means algorithm is effective, its results are too sensitive to initial category centers. Therefore, many approaches in the literature have focused on determining suitable initial centers. However, this is not always possible, especially when the number of categories is increased. This paper proposes an approach to improve the quality of the solution produced by the K-means for open card sort data analysis. Results show that the proposed initialization approach for K-means outperforms existing initialization methods, such as MaxMin, random initialization and K-means++. The proposed algorithm is applied to a real-world open card sorting dataset, and, unlike existing solutions in the literature, it can be used with any number of participants and cards.","PeriodicalId":50354,"journal":{"name":"Interacting with Computers","volume":"11 1","pages":"670-689"},"PeriodicalIF":1.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Information Architecture: Using K-Means Clustering and the Best Merge Method for Open Card Sorting Data Analysis\",\"authors\":\"Sione Paea, C. Katsanos, Gabiriele Bulivou\",\"doi\":\"10.1093/iwc/iwac022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Open card sorting is a well-established method for discovering how people understand and categorize information. This paper addresses the problem of quantitatively analyzing open card sorting data using the K-means algorithm. Although the K-means algorithm is effective, its results are too sensitive to initial category centers. Therefore, many approaches in the literature have focused on determining suitable initial centers. However, this is not always possible, especially when the number of categories is increased. This paper proposes an approach to improve the quality of the solution produced by the K-means for open card sort data analysis. Results show that the proposed initialization approach for K-means outperforms existing initialization methods, such as MaxMin, random initialization and K-means++. The proposed algorithm is applied to a real-world open card sorting dataset, and, unlike existing solutions in the literature, it can be used with any number of participants and cards.\",\"PeriodicalId\":50354,\"journal\":{\"name\":\"Interacting with Computers\",\"volume\":\"11 1\",\"pages\":\"670-689\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2022-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Interacting with Computers\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1093/iwc/iwac022\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, CYBERNETICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Interacting with Computers","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1093/iwc/iwac022","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 1

摘要

开放式卡片分类是一种行之有效的方法,用于发现人们如何理解和分类信息。本文解决了使用K-means算法定量分析开放卡片分类数据的问题。虽然K-means算法是有效的,但其结果对初始类别中心过于敏感。因此,文献中的许多方法都侧重于确定合适的初始中心。然而,这并不总是可能的,特别是当类别的数量增加时。本文提出了一种提高开放卡片分类数据分析的K-means解的质量的方法。结果表明,本文提出的K-means初始化方法优于MaxMin、随机初始化和k -means++等现有初始化方法。所提出的算法应用于现实世界的开放卡片分类数据集,并且,与文献中现有的解决方案不同,它可以用于任何数量的参与者和卡片。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Information Architecture: Using K-Means Clustering and the Best Merge Method for Open Card Sorting Data Analysis
Open card sorting is a well-established method for discovering how people understand and categorize information. This paper addresses the problem of quantitatively analyzing open card sorting data using the K-means algorithm. Although the K-means algorithm is effective, its results are too sensitive to initial category centers. Therefore, many approaches in the literature have focused on determining suitable initial centers. However, this is not always possible, especially when the number of categories is increased. This paper proposes an approach to improve the quality of the solution produced by the K-means for open card sort data analysis. Results show that the proposed initialization approach for K-means outperforms existing initialization methods, such as MaxMin, random initialization and K-means++. The proposed algorithm is applied to a real-world open card sorting dataset, and, unlike existing solutions in the literature, it can be used with any number of participants and cards.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Interacting with Computers
Interacting with Computers 工程技术-计算机:控制论
CiteScore
2.70
自引率
0.00%
发文量
12
审稿时长
>12 weeks
期刊介绍: Interacting with Computers: The Interdisciplinary Journal of Human-Computer Interaction, is an official publication of BCS, The Chartered Institute for IT and the Interaction Specialist Group . Interacting with Computers (IwC) was launched in 1987 by interaction to provide access to the results of research in the field of Human-Computer Interaction (HCI) - an increasingly crucial discipline within the Computer, Information, and Design Sciences. Now one of the most highly rated journals in the field, IwC has a strong and growing Impact Factor, and a high ranking and excellent indices (h-index, SNIP, SJR).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信