{"title":"利用 Collective Mind、虚拟化 MLOps、MLPerf、Collective Knowledge Playground 和可重现的优化锦标赛,打造更高效、更具成本效益的人工智能/人工智能系统","authors":"Grigori Fursin","doi":"arxiv-2406.16791","DOIUrl":null,"url":null,"abstract":"In this white paper, I present my community effort to automatically co-design\ncheaper, faster and more energy-efficient software and hardware for AI, ML and\nother popular workloads with the help of the Collective Mind framework (CM),\nvirtualized MLOps, MLPerf benchmarks and reproducible optimization tournaments.\nI developed CM to modularize, automate and virtualize the tedious process of\nbuilding, running, profiling and optimizing complex applications across rapidly\nevolving open-source and proprietary AI/ML models, datasets, software and\nhardware. I achieved that with the help of portable, reusable and\ntechnology-agnostic automation recipes (ResearchOps) for MLOps and DevOps\n(CM4MLOps) discovered in close collaboration with academia and industry when\nreproducing more than 150 research papers and organizing the 1st mass-scale\ncommunity benchmarking of ML and AI systems using CM and MLPerf. I donated CM and CM4MLOps to MLCommons to help connect academia and industry\nto learn how to build and run AI and other emerging workloads in the most\nefficient and cost-effective way using a common and technology-agnostic\nautomation, virtualization and reproducibility framework while unifying\nknowledge exchange, protecting everyone's intellectual property, enabling\nportable skills, and accelerating transfer of the state-of-the-art research to\nproduction. My long-term vision is to make AI accessible to everyone by making\nit a commodity automatically produced from the most suitable open-source and\nproprietary components from different vendors based on user demand,\nrequirements and constraints such as cost, latency, throughput, accuracy,\nenergy, size and other important characteristics.","PeriodicalId":501291,"journal":{"name":"arXiv - CS - Performance","volume":"6 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments\",\"authors\":\"Grigori Fursin\",\"doi\":\"arxiv-2406.16791\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this white paper, I present my community effort to automatically co-design\\ncheaper, faster and more energy-efficient software and hardware for AI, ML and\\nother popular workloads with the help of the Collective Mind framework (CM),\\nvirtualized MLOps, MLPerf benchmarks and reproducible optimization tournaments.\\nI developed CM to modularize, automate and virtualize the tedious process of\\nbuilding, running, profiling and optimizing complex applications across rapidly\\nevolving open-source and proprietary AI/ML models, datasets, software and\\nhardware. I achieved that with the help of portable, reusable and\\ntechnology-agnostic automation recipes (ResearchOps) for MLOps and DevOps\\n(CM4MLOps) discovered in close collaboration with academia and industry when\\nreproducing more than 150 research papers and organizing the 1st mass-scale\\ncommunity benchmarking of ML and AI systems using CM and MLPerf. I donated CM and CM4MLOps to MLCommons to help connect academia and industry\\nto learn how to build and run AI and other emerging workloads in the most\\nefficient and cost-effective way using a common and technology-agnostic\\nautomation, virtualization and reproducibility framework while unifying\\nknowledge exchange, protecting everyone's intellectual property, enabling\\nportable skills, and accelerating transfer of the state-of-the-art research to\\nproduction. My long-term vision is to make AI accessible to everyone by making\\nit a commodity automatically produced from the most suitable open-source and\\nproprietary components from different vendors based on user demand,\\nrequirements and constraints such as cost, latency, throughput, accuracy,\\nenergy, size and other important characteristics.\",\"PeriodicalId\":501291,\"journal\":{\"name\":\"arXiv - CS - Performance\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Performance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.16791\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Performance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.16791","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在这篇白皮书中,我介绍了我的社区工作,即借助 Collective Mind 框架 (CM)、虚拟化 MLOps、MLPerf 基准和可重现的优化比赛,为 AI、ML 和其他流行工作负载自动共同设计更便宜、更快速、更节能的软件和硬件。我开发 CM 的目的是将在快速发展的开源和专有 AI/ML 模型、数据集、软件和硬件上构建、运行、剖析和优化复杂应用的繁琐过程模块化、自动化和虚拟化。我与学术界和工业界密切合作,为 MLOps 和 DevOps(CM4MLOps)开发了可移植、可重用和技术无关的自动化配方(ResearchOps),发表了 150 多篇研究论文,并使用 CM 和 MLPerf 组织了第一次大规模 ML 和 AI 系统社区基准测试。我将 CM 和 CM4MLOps 捐赠给了 MLCommons,以帮助连接学术界和产业界,学习如何使用通用的技术自动化、虚拟化和可重现性框架,以最高效、最具成本效益的方式构建和运行人工智能和其他新兴工作负载,同时统一知识交流,保护每个人的知识产权,实现可移植技能,并加速最先进研究成果的转化。我的长期愿景是让每个人都能使用人工智能,根据用户需求、要求和制约因素(如成本、延迟、吞吐量、精度、能量、尺寸和其他重要特征),让人工智能成为一种商品,由不同供应商提供的最合适的开源和专有组件自动生产而成。
Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments
In this white paper, I present my community effort to automatically co-design
cheaper, faster and more energy-efficient software and hardware for AI, ML and
other popular workloads with the help of the Collective Mind framework (CM),
virtualized MLOps, MLPerf benchmarks and reproducible optimization tournaments.
I developed CM to modularize, automate and virtualize the tedious process of
building, running, profiling and optimizing complex applications across rapidly
evolving open-source and proprietary AI/ML models, datasets, software and
hardware. I achieved that with the help of portable, reusable and
technology-agnostic automation recipes (ResearchOps) for MLOps and DevOps
(CM4MLOps) discovered in close collaboration with academia and industry when
reproducing more than 150 research papers and organizing the 1st mass-scale
community benchmarking of ML and AI systems using CM and MLPerf. I donated CM and CM4MLOps to MLCommons to help connect academia and industry
to learn how to build and run AI and other emerging workloads in the most
efficient and cost-effective way using a common and technology-agnostic
automation, virtualization and reproducibility framework while unifying
knowledge exchange, protecting everyone's intellectual property, enabling
portable skills, and accelerating transfer of the state-of-the-art research to
production. My long-term vision is to make AI accessible to everyone by making
it a commodity automatically produced from the most suitable open-source and
proprietary components from different vendors based on user demand,
requirements and constraints such as cost, latency, throughput, accuracy,
energy, size and other important characteristics.