{"title":"Navigating the AI technology landscape from GitHub data","authors":"Jaemyoung Choi , Sungsoo Lee , Hakyeon Lee","doi":"10.1016/j.techsoc.2025.103090","DOIUrl":null,"url":null,"abstract":"<div><div>As artificial intelligence (AI) is considered a pivotal technology determining competitiveness, understanding the current and future state of AI technology has become crucial. Conventional approaches to mapping the technology landscape have relied heavily on patent data, but patents cannot adequately capture the state of the art in rapidly changing technologies like AI, due to significant time lags from development to registration. Given that much of the AI technology is developed through open source projects on GitHub, the largest and most popular code host and social coding platform, GitHub emerges as a promising data source for navigating the AI technology landscape. This study aims to explore and predict the AI landscape based on GitHub data. We propose a new bibliometric-like measure, called library coupling, which leverages the unique aspect of code reuse in open source software development to capture the relationships between GitHub repositories. A total of 2879 AI-related repositories with Python-based libraries were collected from GitHub. An AI repository network is constructed based on library coupling relationships among these repositories. Using the attributed graph clustering technique, the AI repositories within the network are grouped into 20 AI technology clusters. Subsequently, we employ graph convolutional network-based link prediction to predict the changes in the AI technology landscape. The proposed GitHub-based technology landscaping approach can be effectively utilized to grasp the current state of rapidly evolving <span>AI</span> technologies and predict their future trends, thereby supporting informed decision making in national <span>AI</span> policy formulation and corporate <span>AI</span> strategy.</div></div>","PeriodicalId":47979,"journal":{"name":"Technology in Society","volume":"84 ","pages":"Article 103090"},"PeriodicalIF":12.5000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Technology in Society","FirstCategoryId":"90","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160791X25002805","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL ISSUES","Score":null,"Total":0}
引用次数: 0
Abstract
As artificial intelligence (AI) is considered a pivotal technology determining competitiveness, understanding the current and future state of AI technology has become crucial. Conventional approaches to mapping the technology landscape have relied heavily on patent data, but patents cannot adequately capture the state of the art in rapidly changing technologies like AI, due to significant time lags from development to registration. Given that much of the AI technology is developed through open source projects on GitHub, the largest and most popular code host and social coding platform, GitHub emerges as a promising data source for navigating the AI technology landscape. This study aims to explore and predict the AI landscape based on GitHub data. We propose a new bibliometric-like measure, called library coupling, which leverages the unique aspect of code reuse in open source software development to capture the relationships between GitHub repositories. A total of 2879 AI-related repositories with Python-based libraries were collected from GitHub. An AI repository network is constructed based on library coupling relationships among these repositories. Using the attributed graph clustering technique, the AI repositories within the network are grouped into 20 AI technology clusters. Subsequently, we employ graph convolutional network-based link prediction to predict the changes in the AI technology landscape. The proposed GitHub-based technology landscaping approach can be effectively utilized to grasp the current state of rapidly evolving AI technologies and predict their future trends, thereby supporting informed decision making in national AI policy formulation and corporate AI strategy.
期刊介绍:
Technology in Society is a global journal dedicated to fostering discourse at the crossroads of technological change and the social, economic, business, and philosophical transformation of our world. The journal aims to provide scholarly contributions that empower decision-makers to thoughtfully and intentionally navigate the decisions shaping this dynamic landscape. A common thread across these fields is the role of technology in society, influencing economic, political, and cultural dynamics. Scholarly work in Technology in Society delves into the social forces shaping technological decisions and the societal choices regarding technology use. This encompasses scholarly and theoretical approaches (history and philosophy of science and technology, technology forecasting, economic growth, and policy, ethics), applied approaches (business innovation, technology management, legal and engineering), and developmental perspectives (technology transfer, technology assessment, and economic development). Detailed information about the journal's aims and scope on specific topics can be found in Technology in Society Briefings, accessible via our Special Issues and Article Collections.