{"title":"从数据流中监督学习:概述和更新","authors":"Jesse Read, Indre Zliobaite","doi":"10.1145/3737279","DOIUrl":null,"url":null,"abstract":"The literature on machine learning in the context of data streams is vast and growing. This indicates not only an ongoing interest, but also an ongoing need for a synthesis of new developments in this area. Here we reformulate the definitions of supervised data-stream learning, alongside consideration of contemporary concept drift and temporal dependence. Equipped with this, carry out a fresh discussion of what constitutes a supervised data-stream learning task; including continual and reinforcement learning; highlighting major assumptions and constraints. We carry out a fresh reconsideration of approaches and methods, with regard to their suitability to modern settings. But more than a categorization of state-of-the-art streaming methods, we provide a re-introduction to what is supervised stream learning, and our emphasis here is a survey of settings, and algorithmic settings. Our main goal is to pull theory and practice of supervised learning over data streams closer together. We conclude that practical stream learning does not mandate an online-learning regime. In the modern context, learning regimes should be selected and developed according to the factual data arrival mode, resource constraints, and maximum robustness and trustworthiness. We finish with a set of recommendations to this effect.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"5 1","pages":""},"PeriodicalIF":23.8000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Supervised Learning from Data Streams: An Overview and Update\",\"authors\":\"Jesse Read, Indre Zliobaite\",\"doi\":\"10.1145/3737279\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The literature on machine learning in the context of data streams is vast and growing. This indicates not only an ongoing interest, but also an ongoing need for a synthesis of new developments in this area. Here we reformulate the definitions of supervised data-stream learning, alongside consideration of contemporary concept drift and temporal dependence. Equipped with this, carry out a fresh discussion of what constitutes a supervised data-stream learning task; including continual and reinforcement learning; highlighting major assumptions and constraints. We carry out a fresh reconsideration of approaches and methods, with regard to their suitability to modern settings. But more than a categorization of state-of-the-art streaming methods, we provide a re-introduction to what is supervised stream learning, and our emphasis here is a survey of settings, and algorithmic settings. Our main goal is to pull theory and practice of supervised learning over data streams closer together. We conclude that practical stream learning does not mandate an online-learning regime. In the modern context, learning regimes should be selected and developed according to the factual data arrival mode, resource constraints, and maximum robustness and trustworthiness. We finish with a set of recommendations to this effect.\",\"PeriodicalId\":50926,\"journal\":{\"name\":\"ACM Computing Surveys\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":23.8000,\"publicationDate\":\"2025-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Computing Surveys\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3737279\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Computing Surveys","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3737279","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
Supervised Learning from Data Streams: An Overview and Update
The literature on machine learning in the context of data streams is vast and growing. This indicates not only an ongoing interest, but also an ongoing need for a synthesis of new developments in this area. Here we reformulate the definitions of supervised data-stream learning, alongside consideration of contemporary concept drift and temporal dependence. Equipped with this, carry out a fresh discussion of what constitutes a supervised data-stream learning task; including continual and reinforcement learning; highlighting major assumptions and constraints. We carry out a fresh reconsideration of approaches and methods, with regard to their suitability to modern settings. But more than a categorization of state-of-the-art streaming methods, we provide a re-introduction to what is supervised stream learning, and our emphasis here is a survey of settings, and algorithmic settings. Our main goal is to pull theory and practice of supervised learning over data streams closer together. We conclude that practical stream learning does not mandate an online-learning regime. In the modern context, learning regimes should be selected and developed according to the factual data arrival mode, resource constraints, and maximum robustness and trustworthiness. We finish with a set of recommendations to this effect.
期刊介绍:
ACM Computing Surveys is an academic journal that focuses on publishing surveys and tutorials on various areas of computing research and practice. The journal aims to provide comprehensive and easily understandable articles that guide readers through the literature and help them understand topics outside their specialties. In terms of impact, CSUR has a high reputation with a 2022 Impact Factor of 16.6. It is ranked 3rd out of 111 journals in the field of Computer Science Theory & Methods.
ACM Computing Surveys is indexed and abstracted in various services, including AI2 Semantic Scholar, Baidu, Clarivate/ISI: JCR, CNKI, DeepDyve, DTU, EBSCO: EDS/HOST, and IET Inspec, among others.