{"title":"KMSharing:网络边缘高效数据共享的框架和空间抽象","authors":"Yuchen Sun;Lailong Luo;Deke Guo;Li Liu;Junjie Xie","doi":"10.1109/TNET.2024.3465844","DOIUrl":null,"url":null,"abstract":"Edge storage promises to be crucial for edge computing infrastructure, which enables users to access data within a low delay from widespread storage nodes at the network edge. The key challenge is how to integrate massive geographically distributed weak edge nodes to form an efficient storage system, enabling users to launch data operations from any node or retrieve the desired data across the entire distributed system. To address this data-sharing problem, researchers from both the traditional peer-to-peer (P2P) overlay networking and emerging edge computing fields have proposed some decentralized indexing mechanisms. However, existing studies lack insightful descriptions and analyses about the nature of the data-sharing problem at the network edge. It motivates us to rethink the edge data-sharing framework and provide the problem reformulation for analyzing the limitations of existing schemes. We reveal that the existing data-sharing schemes fail in complex network topologies which can be regarded as high-dimensional network spaces beyond the representation of low-dimensional Euclidean spaces or other existing hash spaces. A better space abstraction is an urgent need to alleviate the performance degradation due to the dimensional mismatch between network spaces and virtual spaces. To fill this gap, this paper proposes the Kautz metric space, a novel space abstraction extended from Kautz graphs, where the coordinates and the metric are defined as Kautz strings and Kautz distances (i.e., the shortest distances in undirected Kautz graphs), respectively. We design a dynamic programming algorithm to directly compute the Kautz distances. Then, we propose KMSharing, an efficient edge data-sharing scheme: both nodes and data are represented in a Kautz metric space, where the Kautz distance of any two Kautz strings reflects the network delay of the corresponding nodes. The workflow of KMSharing consists of three core components: the virtual address allocation represents edge nodes in the Kautz metric space; the data-to-node mapping ensures the uniqueness of target nodes; and forwarding table construction ensures the data delivery. Theoretical analyses confirm that KMSharing ideally achieves \n<inline-formula> <tex-math>$\\mathcal {O}\\left ({{ \\tau }}\\right)$ </tex-math></inline-formula>\n network delays, \n<inline-formula> <tex-math>$\\mathcal {O}\\left ({{ \\log N }}\\right)$ </tex-math></inline-formula>\n overlay hops, and \n<inline-formula> <tex-math>$\\mathcal {O}\\left ({{ 1 }}\\right)$ </tex-math></inline-formula>\n forwarding entries in an N-node edge system with the network radius \n<inline-formula> <tex-math>$\\tau $ </tex-math></inline-formula>\n, while the successive ensuring data delivery. Its worst-case network delay \n<inline-formula> <tex-math>$\\mathcal {O}\\left ({{ \\tau \\log N }}\\right)$ </tex-math></inline-formula>\n is also much better than \n<inline-formula> <tex-math>${\\mathcal {O}\\left ({{ \\tau N^{\\alpha } }}\\right)},\\alpha \\mathrm {\\in }(0,1)$ </tex-math></inline-formula>\n, the worst case of the baselines using Euclidean spaces. Evaluation on various network topologies also shows that our KMSharing effectively reduces network delays and indexing costs than existing data-sharing schemes.","PeriodicalId":13443,"journal":{"name":"IEEE/ACM Transactions on Networking","volume":"32 6","pages":"5440-5458"},"PeriodicalIF":3.0000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"KMSharing: The Framework and Space Abstraction for Efficient Data Sharing at the Network Edge\",\"authors\":\"Yuchen Sun;Lailong Luo;Deke Guo;Li Liu;Junjie Xie\",\"doi\":\"10.1109/TNET.2024.3465844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Edge storage promises to be crucial for edge computing infrastructure, which enables users to access data within a low delay from widespread storage nodes at the network edge. The key challenge is how to integrate massive geographically distributed weak edge nodes to form an efficient storage system, enabling users to launch data operations from any node or retrieve the desired data across the entire distributed system. To address this data-sharing problem, researchers from both the traditional peer-to-peer (P2P) overlay networking and emerging edge computing fields have proposed some decentralized indexing mechanisms. However, existing studies lack insightful descriptions and analyses about the nature of the data-sharing problem at the network edge. It motivates us to rethink the edge data-sharing framework and provide the problem reformulation for analyzing the limitations of existing schemes. We reveal that the existing data-sharing schemes fail in complex network topologies which can be regarded as high-dimensional network spaces beyond the representation of low-dimensional Euclidean spaces or other existing hash spaces. A better space abstraction is an urgent need to alleviate the performance degradation due to the dimensional mismatch between network spaces and virtual spaces. To fill this gap, this paper proposes the Kautz metric space, a novel space abstraction extended from Kautz graphs, where the coordinates and the metric are defined as Kautz strings and Kautz distances (i.e., the shortest distances in undirected Kautz graphs), respectively. We design a dynamic programming algorithm to directly compute the Kautz distances. Then, we propose KMSharing, an efficient edge data-sharing scheme: both nodes and data are represented in a Kautz metric space, where the Kautz distance of any two Kautz strings reflects the network delay of the corresponding nodes. The workflow of KMSharing consists of three core components: the virtual address allocation represents edge nodes in the Kautz metric space; the data-to-node mapping ensures the uniqueness of target nodes; and forwarding table construction ensures the data delivery. Theoretical analyses confirm that KMSharing ideally achieves \\n<inline-formula> <tex-math>$\\\\mathcal {O}\\\\left ({{ \\\\tau }}\\\\right)$ </tex-math></inline-formula>\\n network delays, \\n<inline-formula> <tex-math>$\\\\mathcal {O}\\\\left ({{ \\\\log N }}\\\\right)$ </tex-math></inline-formula>\\n overlay hops, and \\n<inline-formula> <tex-math>$\\\\mathcal {O}\\\\left ({{ 1 }}\\\\right)$ </tex-math></inline-formula>\\n forwarding entries in an N-node edge system with the network radius \\n<inline-formula> <tex-math>$\\\\tau $ </tex-math></inline-formula>\\n, while the successive ensuring data delivery. Its worst-case network delay \\n<inline-formula> <tex-math>$\\\\mathcal {O}\\\\left ({{ \\\\tau \\\\log N }}\\\\right)$ </tex-math></inline-formula>\\n is also much better than \\n<inline-formula> <tex-math>${\\\\mathcal {O}\\\\left ({{ \\\\tau N^{\\\\alpha } }}\\\\right)},\\\\alpha \\\\mathrm {\\\\in }(0,1)$ </tex-math></inline-formula>\\n, the worst case of the baselines using Euclidean spaces. Evaluation on various network topologies also shows that our KMSharing effectively reduces network delays and indexing costs than existing data-sharing schemes.\",\"PeriodicalId\":13443,\"journal\":{\"name\":\"IEEE/ACM Transactions on Networking\",\"volume\":\"32 6\",\"pages\":\"5440-5458\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE/ACM Transactions on Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10713188/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE/ACM Transactions on Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10713188/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
KMSharing: The Framework and Space Abstraction for Efficient Data Sharing at the Network Edge
Edge storage promises to be crucial for edge computing infrastructure, which enables users to access data within a low delay from widespread storage nodes at the network edge. The key challenge is how to integrate massive geographically distributed weak edge nodes to form an efficient storage system, enabling users to launch data operations from any node or retrieve the desired data across the entire distributed system. To address this data-sharing problem, researchers from both the traditional peer-to-peer (P2P) overlay networking and emerging edge computing fields have proposed some decentralized indexing mechanisms. However, existing studies lack insightful descriptions and analyses about the nature of the data-sharing problem at the network edge. It motivates us to rethink the edge data-sharing framework and provide the problem reformulation for analyzing the limitations of existing schemes. We reveal that the existing data-sharing schemes fail in complex network topologies which can be regarded as high-dimensional network spaces beyond the representation of low-dimensional Euclidean spaces or other existing hash spaces. A better space abstraction is an urgent need to alleviate the performance degradation due to the dimensional mismatch between network spaces and virtual spaces. To fill this gap, this paper proposes the Kautz metric space, a novel space abstraction extended from Kautz graphs, where the coordinates and the metric are defined as Kautz strings and Kautz distances (i.e., the shortest distances in undirected Kautz graphs), respectively. We design a dynamic programming algorithm to directly compute the Kautz distances. Then, we propose KMSharing, an efficient edge data-sharing scheme: both nodes and data are represented in a Kautz metric space, where the Kautz distance of any two Kautz strings reflects the network delay of the corresponding nodes. The workflow of KMSharing consists of three core components: the virtual address allocation represents edge nodes in the Kautz metric space; the data-to-node mapping ensures the uniqueness of target nodes; and forwarding table construction ensures the data delivery. Theoretical analyses confirm that KMSharing ideally achieves
$\mathcal {O}\left ({{ \tau }}\right)$
network delays,
$\mathcal {O}\left ({{ \log N }}\right)$
overlay hops, and
$\mathcal {O}\left ({{ 1 }}\right)$
forwarding entries in an N-node edge system with the network radius
$\tau $
, while the successive ensuring data delivery. Its worst-case network delay
$\mathcal {O}\left ({{ \tau \log N }}\right)$
is also much better than
${\mathcal {O}\left ({{ \tau N^{\alpha } }}\right)},\alpha \mathrm {\in }(0,1)$
, the worst case of the baselines using Euclidean spaces. Evaluation on various network topologies also shows that our KMSharing effectively reduces network delays and indexing costs than existing data-sharing schemes.
期刊介绍:
The IEEE/ACM Transactions on Networking’s high-level objective is to publish high-quality, original research results derived from theoretical or experimental exploration of the area of communication/computer networking, covering all sorts of information transport networks over all sorts of physical layer technologies, both wireline (all kinds of guided media: e.g., copper, optical) and wireless (e.g., radio-frequency, acoustic (e.g., underwater), infra-red), or hybrids of these. The journal welcomes applied contributions reporting on novel experiences and experiments with actual systems.