CompTrails: comparing hypotheses across behavioral networks

IF 2.8 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Data Mining and Knowledge Discovery Pub Date : 2024-01-03 DOI:10.1007/s10618-023-00996-8

Tobias Koopmann, Martin Becker, Florian Lemmerich, Andreas Hotho

{"title":"CompTrails: comparing hypotheses across behavioral networks","authors":"Tobias Koopmann, Martin Becker, Florian Lemmerich, Andreas Hotho","doi":"10.1007/s10618-023-00996-8","DOIUrl":null,"url":null,"abstract":"<p>The term <i>Behavioral Networks</i> describes networks that contain relational information on human behavior. This ranges from social networks that contain friendships or cooperations between individuals, to navigational networks that contain geographical or web navigation, and many more. Understanding the forces driving behavior within these networks can be beneficial to improving the underlying network, for example, by generating new hyperlinks on websites, or by proposing new connections and friends on social networks. Previous approaches considered different hypotheses on a single network and evaluated which hypothesis fits best. These hypotheses can represent human intuition and expert opinions or be based on previous insights. In this work, we extend these approaches to enable the comparison of a single hypothesis between multiple networks. We unveil several issues of naive approaches that potentially impact comparisons and lead to undesired results. Based on these findings, we propose a framework with five flexible components that allow addressing specific analysis goals tailored to the application scenario. We show the benefits and limits of our approach by applying it to synthetic data and several real-world datasets, including web navigation, bibliometric navigation, and geographic navigation. Our work supports practitioners and researchers with the aim of understanding similarities and differences in human behavior between environments.</p>","PeriodicalId":55183,"journal":{"name":"Data Mining and Knowledge Discovery","volume":"28 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Mining and Knowledge Discovery","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10618-023-00996-8","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The term Behavioral Networks describes networks that contain relational information on human behavior. This ranges from social networks that contain friendships or cooperations between individuals, to navigational networks that contain geographical or web navigation, and many more. Understanding the forces driving behavior within these networks can be beneficial to improving the underlying network, for example, by generating new hyperlinks on websites, or by proposing new connections and friends on social networks. Previous approaches considered different hypotheses on a single network and evaluated which hypothesis fits best. These hypotheses can represent human intuition and expert opinions or be based on previous insights. In this work, we extend these approaches to enable the comparison of a single hypothesis between multiple networks. We unveil several issues of naive approaches that potentially impact comparisons and lead to undesired results. Based on these findings, we propose a framework with five flexible components that allow addressing specific analysis goals tailored to the application scenario. We show the benefits and limits of our approach by applying it to synthetic data and several real-world datasets, including web navigation, bibliometric navigation, and geographic navigation. Our work supports practitioners and researchers with the aim of understanding similarities and differences in human behavior between environments.

Abstract Image

查看原文本刊更多论文

CompTrails：跨行为网络的假设比较

行为网络一词描述的是包含人类行为相关信息的网络。其中包括包含个人之间友谊或合作关系的社交网络，以及包含地理或网络导航的导航网络等等。了解这些网络中的行为驱动力有助于改善底层网络，例如，在网站上生成新的超链接，或在社交网络上提出新的连接和朋友。以前的方法考虑了单个网络的不同假设，并评估哪种假设最适合。这些假设可以代表人类的直觉和专家意见，也可以基于以往的见解。在这项工作中，我们对这些方法进行了扩展，以便在多个网络之间对单一假设进行比较。我们揭示了天真方法的几个问题，这些问题可能会影响比较并导致不理想的结果。基于这些发现，我们提出了一个包含五个灵活组件的框架，可以根据应用场景实现特定的分析目标。通过将我们的方法应用于合成数据和几个真实世界的数据集，包括网络导航、文献计量导航和地理导航，我们展示了这种方法的优势和局限性。我们的工作可为从业人员和研究人员提供支持，帮助他们了解不同环境下人类行为的异同。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Data Mining and Knowledge Discovery 工程技术-计算机：人工智能

CiteScore

10.40

自引率

4.20%

发文量

审稿时长

10 months

期刊介绍： Advances in data gathering, storage, and distribution have created a need for computational tools and techniques to aid in data analysis. Data Mining and Knowledge Discovery in Databases (KDD) is a rapidly growing area of research and application that builds on techniques and theories from many fields, including statistics, databases, pattern recognition and learning, data visualization, uncertainty modelling, data warehousing and OLAP, optimization, and high performance computing.