用于开发和测试光通信系统 ML 模型的实验数据集

IF 4 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Caio Santos;Abdelrahmane Moawad;Behnam Shariati;Robert Emmerich;Pooyan Safari;Colja Schubert;Johannes K. Fischer
{"title":"用于开发和测试光通信系统 ML 模型的实验数据集","authors":"Caio Santos;Abdelrahmane Moawad;Behnam Shariati;Robert Emmerich;Pooyan Safari;Colja Schubert;Johannes K. Fischer","doi":"10.1364/JOCN.531788","DOIUrl":null,"url":null,"abstract":"Due to the scarcity of diverse and well-organized public datasets, individual research organizations are often forced to develop and utilize their own datasets. However, the utilization of machine learning (ML) models in optical communications and networks heavily depends on the existence of high-quality datasets, especially covering the various parameters to be optimized in wavelength-division multiplexing (WDM) systems. In this work, we present a public dataset for developing and testing ML models. The dataset is developed in a laboratory setting and includes 12,672 samples including data points with different modulation formats, symbol rates, distances, WDM channel allocation profiles, etc. Each data point offers more than 60 features, revealing almost every aspect of the transmission setup. Moreover, we provide optical spectra of the entire C-band as well as a constellation diagram of the channel under test for all the data points. The diversity and extensiveness of the dataset alongside a well-structured document would allow plenty of use-cases and studies to be carried out covering quality of transmission (QoT) studies, optical spectrum analysis, constellation diagram modeling, digital twin evaluation, etc. Similar to our previous efforts, the current dataset aims to facilitate collaboration by offering a way for fair comparison of research outcomes in data analysis within the domain of optical communication systems.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"16 11","pages":"G1-G10"},"PeriodicalIF":4.0000,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Experimental dataset for developing and testing ML models in optical communication systems\",\"authors\":\"Caio Santos;Abdelrahmane Moawad;Behnam Shariati;Robert Emmerich;Pooyan Safari;Colja Schubert;Johannes K. Fischer\",\"doi\":\"10.1364/JOCN.531788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the scarcity of diverse and well-organized public datasets, individual research organizations are often forced to develop and utilize their own datasets. However, the utilization of machine learning (ML) models in optical communications and networks heavily depends on the existence of high-quality datasets, especially covering the various parameters to be optimized in wavelength-division multiplexing (WDM) systems. In this work, we present a public dataset for developing and testing ML models. The dataset is developed in a laboratory setting and includes 12,672 samples including data points with different modulation formats, symbol rates, distances, WDM channel allocation profiles, etc. Each data point offers more than 60 features, revealing almost every aspect of the transmission setup. Moreover, we provide optical spectra of the entire C-band as well as a constellation diagram of the channel under test for all the data points. The diversity and extensiveness of the dataset alongside a well-structured document would allow plenty of use-cases and studies to be carried out covering quality of transmission (QoT) studies, optical spectrum analysis, constellation diagram modeling, digital twin evaluation, etc. Similar to our previous efforts, the current dataset aims to facilitate collaboration by offering a way for fair comparison of research outcomes in data analysis within the domain of optical communication systems.\",\"PeriodicalId\":50103,\"journal\":{\"name\":\"Journal of Optical Communications and Networking\",\"volume\":\"16 11\",\"pages\":\"G1-G10\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Optical Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10705755/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10705755/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

摘要

由于缺乏种类繁多、组织有序的公共数据集,各个研究机构往往不得不开发和利用自己的数据集。然而,机器学习(ML)模型在光通信和网络中的应用在很大程度上依赖于高质量数据集的存在,尤其是涵盖波分复用(WDM)系统中需要优化的各种参数的数据集。在这项工作中,我们提出了一个用于开发和测试 ML 模型的公共数据集。该数据集是在实验室环境中开发的,包含 12672 个样本,包括不同调制格式、符号率、距离、波分复用信道分配情况等数据点。每个数据点提供 60 多个特征,揭示了传输设置的几乎所有方面。此外,我们还为所有数据点提供了整个 C 波段的光学光谱以及被测信道的星座图。数据集的多样性和广泛性以及结构良好的文档将允许开展大量的用例和研究,包括传输质量(QoT)研究、光学频谱分析、星座图建模、数字孪生评估等。与我们之前的工作类似,当前的数据集旨在通过提供一种公平比较光通信系统领域内数据分析研究成果的方法来促进合作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Experimental dataset for developing and testing ML models in optical communication systems
Due to the scarcity of diverse and well-organized public datasets, individual research organizations are often forced to develop and utilize their own datasets. However, the utilization of machine learning (ML) models in optical communications and networks heavily depends on the existence of high-quality datasets, especially covering the various parameters to be optimized in wavelength-division multiplexing (WDM) systems. In this work, we present a public dataset for developing and testing ML models. The dataset is developed in a laboratory setting and includes 12,672 samples including data points with different modulation formats, symbol rates, distances, WDM channel allocation profiles, etc. Each data point offers more than 60 features, revealing almost every aspect of the transmission setup. Moreover, we provide optical spectra of the entire C-band as well as a constellation diagram of the channel under test for all the data points. The diversity and extensiveness of the dataset alongside a well-structured document would allow plenty of use-cases and studies to be carried out covering quality of transmission (QoT) studies, optical spectrum analysis, constellation diagram modeling, digital twin evaluation, etc. Similar to our previous efforts, the current dataset aims to facilitate collaboration by offering a way for fair comparison of research outcomes in data analysis within the domain of optical communication systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.40
自引率
16.00%
发文量
104
审稿时长
4 months
期刊介绍: The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信