O. Aidel, A. Cavalli, Hélène Cordier, C. L'Orphelin, Gilles Mathieu, A. Pagano, S. Reynaud
{"title":"CIC portal: a collaborative and scalable integration platform for high availability grid operations","authors":"O. Aidel, A. Cavalli, Hélène Cordier, C. L'Orphelin, Gilles Mathieu, A. Pagano, S. Reynaud","doi":"10.1109/GRID.2007.4354124","DOIUrl":null,"url":null,"abstract":"EGEE, along with its sister project LCG, manages the world's largest grid production infrastructure which is spreading nowadays over 260 sites in more than 40 countries. Just as building such a system requires novel approaches; its management also requires innovation. From an operational point of view, the first challenge we face is to provide scalable procedures and tools able to monitor the ever expanding infrastructure and the constant evolution of the needs. The second is to ensure that all these tools strongly interact with one another, even though their development is spread out worldwide. Consequently, our goal is to provide a homogeneous way to access tools and analyze data for daily operational needs. To implement this concept into LCG/EGEE infrastructure management tools, 1N2P3 Computing Centre proposed a web portal, named \"CIC operations portal\", conceived and built as an integration platform for existing features and new requirements. Firstly, we describe the initial needs that led us to the present architecture of this portal. We then emphasize a specific feature for the operations efficiency which is the web interface dedicated to EGEE overall daily monitoring. We also deal with the high availability mechanism put in place by INFN-CNAF to address failover and replication issues. We finally present how the CIC portal has become one of the essential EGEE and LCG core services.","PeriodicalId":304508,"journal":{"name":"2007 8th IEEE/ACM International Conference on Grid Computing","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 8th IEEE/ACM International Conference on Grid Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GRID.2007.4354124","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
EGEE, along with its sister project LCG, manages the world's largest grid production infrastructure which is spreading nowadays over 260 sites in more than 40 countries. Just as building such a system requires novel approaches; its management also requires innovation. From an operational point of view, the first challenge we face is to provide scalable procedures and tools able to monitor the ever expanding infrastructure and the constant evolution of the needs. The second is to ensure that all these tools strongly interact with one another, even though their development is spread out worldwide. Consequently, our goal is to provide a homogeneous way to access tools and analyze data for daily operational needs. To implement this concept into LCG/EGEE infrastructure management tools, 1N2P3 Computing Centre proposed a web portal, named "CIC operations portal", conceived and built as an integration platform for existing features and new requirements. Firstly, we describe the initial needs that led us to the present architecture of this portal. We then emphasize a specific feature for the operations efficiency which is the web interface dedicated to EGEE overall daily monitoring. We also deal with the high availability mechanism put in place by INFN-CNAF to address failover and replication issues. We finally present how the CIC portal has become one of the essential EGEE and LCG core services.