从GitHub数据中导航人工智能技术领域

IF 12.5 1区 社会学 Q1 SOCIAL ISSUES
Jaemyoung Choi , Sungsoo Lee , Hakyeon Lee
{"title":"从GitHub数据中导航人工智能技术领域","authors":"Jaemyoung Choi ,&nbsp;Sungsoo Lee ,&nbsp;Hakyeon Lee","doi":"10.1016/j.techsoc.2025.103090","DOIUrl":null,"url":null,"abstract":"<div><div>As artificial intelligence (AI) is considered a pivotal technology determining competitiveness, understanding the current and future state of AI technology has become crucial. Conventional approaches to mapping the technology landscape have relied heavily on patent data, but patents cannot adequately capture the state of the art in rapidly changing technologies like AI, due to significant time lags from development to registration. Given that much of the AI technology is developed through open source projects on GitHub, the largest and most popular code host and social coding platform, GitHub emerges as a promising data source for navigating the AI technology landscape. This study aims to explore and predict the AI landscape based on GitHub data. We propose a new bibliometric-like measure, called library coupling, which leverages the unique aspect of code reuse in open source software development to capture the relationships between GitHub repositories. A total of 2879 AI-related repositories with Python-based libraries were collected from GitHub. An AI repository network is constructed based on library coupling relationships among these repositories. Using the attributed graph clustering technique, the AI repositories within the network are grouped into 20 AI technology clusters. Subsequently, we employ graph convolutional network-based link prediction to predict the changes in the AI technology landscape. The proposed GitHub-based technology landscaping approach can be effectively utilized to grasp the current state of rapidly evolving <span>AI</span> technologies and predict their future trends, thereby supporting informed decision making in national <span>AI</span> policy formulation and corporate <span>AI</span> strategy.</div></div>","PeriodicalId":47979,"journal":{"name":"Technology in Society","volume":"84 ","pages":"Article 103090"},"PeriodicalIF":12.5000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Navigating the AI technology landscape from GitHub data\",\"authors\":\"Jaemyoung Choi ,&nbsp;Sungsoo Lee ,&nbsp;Hakyeon Lee\",\"doi\":\"10.1016/j.techsoc.2025.103090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As artificial intelligence (AI) is considered a pivotal technology determining competitiveness, understanding the current and future state of AI technology has become crucial. Conventional approaches to mapping the technology landscape have relied heavily on patent data, but patents cannot adequately capture the state of the art in rapidly changing technologies like AI, due to significant time lags from development to registration. Given that much of the AI technology is developed through open source projects on GitHub, the largest and most popular code host and social coding platform, GitHub emerges as a promising data source for navigating the AI technology landscape. This study aims to explore and predict the AI landscape based on GitHub data. We propose a new bibliometric-like measure, called library coupling, which leverages the unique aspect of code reuse in open source software development to capture the relationships between GitHub repositories. A total of 2879 AI-related repositories with Python-based libraries were collected from GitHub. An AI repository network is constructed based on library coupling relationships among these repositories. Using the attributed graph clustering technique, the AI repositories within the network are grouped into 20 AI technology clusters. Subsequently, we employ graph convolutional network-based link prediction to predict the changes in the AI technology landscape. The proposed GitHub-based technology landscaping approach can be effectively utilized to grasp the current state of rapidly evolving <span>AI</span> technologies and predict their future trends, thereby supporting informed decision making in national <span>AI</span> policy formulation and corporate <span>AI</span> strategy.</div></div>\",\"PeriodicalId\":47979,\"journal\":{\"name\":\"Technology in Society\",\"volume\":\"84 \",\"pages\":\"Article 103090\"},\"PeriodicalIF\":12.5000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Technology in Society\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0160791X25002805\",\"RegionNum\":1,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOCIAL ISSUES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Technology in Society","FirstCategoryId":"90","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160791X25002805","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL ISSUES","Score":null,"Total":0}
引用次数: 0

摘要

人工智能(AI)被认为是决定竞争力的关键技术,因此了解人工智能技术的现状和未来状态变得至关重要。绘制技术版图的传统方法严重依赖于专利数据,但由于从开发到注册存在明显的时间滞后,专利无法充分捕捉人工智能等快速变化的技术的最新进展。鉴于大部分人工智能技术是通过GitHub上的开源项目开发的,GitHub是最大和最受欢迎的代码托管和社交编码平台,因此GitHub成为导航人工智能技术领域的有前途的数据源。本研究旨在基于GitHub数据探索和预测AI景观。我们提出了一种新的类似文献计量学的度量方法,称为库耦合,它利用开源软件开发中代码重用的独特方面来捕获GitHub存储库之间的关系。从GitHub中总共收集了2879个与ai相关的基于python的库。基于这些库之间的库耦合关系构建AI库网络。利用属性图聚类技术,将网络内的人工智能库划分为20个人工智能技术集群。随后,我们采用基于图卷积网络的链接预测来预测人工智能技术领域的变化。所提出的基于github的技术景观方法可以有效地用于掌握快速发展的人工智能技术的现状并预测其未来趋势,从而支持国家人工智能政策制定和企业人工智能战略的明智决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Navigating the AI technology landscape from GitHub data
As artificial intelligence (AI) is considered a pivotal technology determining competitiveness, understanding the current and future state of AI technology has become crucial. Conventional approaches to mapping the technology landscape have relied heavily on patent data, but patents cannot adequately capture the state of the art in rapidly changing technologies like AI, due to significant time lags from development to registration. Given that much of the AI technology is developed through open source projects on GitHub, the largest and most popular code host and social coding platform, GitHub emerges as a promising data source for navigating the AI technology landscape. This study aims to explore and predict the AI landscape based on GitHub data. We propose a new bibliometric-like measure, called library coupling, which leverages the unique aspect of code reuse in open source software development to capture the relationships between GitHub repositories. A total of 2879 AI-related repositories with Python-based libraries were collected from GitHub. An AI repository network is constructed based on library coupling relationships among these repositories. Using the attributed graph clustering technique, the AI repositories within the network are grouped into 20 AI technology clusters. Subsequently, we employ graph convolutional network-based link prediction to predict the changes in the AI technology landscape. The proposed GitHub-based technology landscaping approach can be effectively utilized to grasp the current state of rapidly evolving AI technologies and predict their future trends, thereby supporting informed decision making in national AI policy formulation and corporate AI strategy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
17.90
自引率
14.10%
发文量
316
审稿时长
60 days
期刊介绍: Technology in Society is a global journal dedicated to fostering discourse at the crossroads of technological change and the social, economic, business, and philosophical transformation of our world. The journal aims to provide scholarly contributions that empower decision-makers to thoughtfully and intentionally navigate the decisions shaping this dynamic landscape. A common thread across these fields is the role of technology in society, influencing economic, political, and cultural dynamics. Scholarly work in Technology in Society delves into the social forces shaping technological decisions and the societal choices regarding technology use. This encompasses scholarly and theoretical approaches (history and philosophy of science and technology, technology forecasting, economic growth, and policy, ethics), applied approaches (business innovation, technology management, legal and engineering), and developmental perspectives (technology transfer, technology assessment, and economic development). Detailed information about the journal's aims and scope on specific topics can be found in Technology in Society Briefings, accessible via our Special Issues and Article Collections.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信