Ian T Foster, J. Gieraltowski, Scott Gose, N. Maltsev, E. May, Alex Rodriguez, Dinanath Sulakhe, A. Vaniachine, J. Shank, S. Youssef, D. Adams, R. Baker, W. Deng, J. Smith, Dantong Yu, I. Legrand, Suresh Singh, C. Steenberg, Yang Xia, M. Afaq, E. Berman, J. Annis, L. Bauerdick, M. Ernst, I. Fisk, L. Giacchetti, G. Graham, A. Heavey, J. Kaiser, N. Kuropatkin, R. Pordes, V. Sekhri, J. Weigand, Yujun Wu, Keith Baker, Lawrence Sorrillo, J. Huth, Matthew Allen, L. Grundhoefer, J. Hicks, F. Luehring, S. Peck, R. Quick, Stephen C. Simms, G. Fekete, Jan vandenBerg, Kihyeon Cho, Kihwan Kwon, Dongchul Son, Hyoungwoo Park, S. Canon, K. Jackson, D. Konerding, Jason R. Lee, D. Olson, I. Sakrejda, B. Tierney, Mark L. Green, Russ Miller, J. Letts, T. Martin, David Bury, C. Dumitrescu, D. Engh, R. Gardner, M. Mambelli, Y. Smirnov, Jens-S. Vöckler, M. Wilde, Yong Zhao, Xin Zhao, P. Avery, R. Cavanaugh, Bockjoo Kim, C. Prescott, J. Rodriguez, A. Zahn, S. McKee, C. Jordan, James E. Prewett, T. Thomas, H. Severini, Ben Cliff
{"title":"The Grid2003 production grid: principles and practice","authors":"Ian T Foster, J. Gieraltowski, Scott Gose, N. Maltsev, E. May, Alex Rodriguez, Dinanath Sulakhe, A. Vaniachine, J. Shank, S. Youssef, D. Adams, R. Baker, W. Deng, J. Smith, Dantong Yu, I. Legrand, Suresh Singh, C. Steenberg, Yang Xia, M. Afaq, E. Berman, J. Annis, L. Bauerdick, M. Ernst, I. Fisk, L. Giacchetti, G. Graham, A. Heavey, J. Kaiser, N. Kuropatkin, R. Pordes, V. Sekhri, J. Weigand, Yujun Wu, Keith Baker, Lawrence Sorrillo, J. Huth, Matthew Allen, L. Grundhoefer, J. Hicks, F. Luehring, S. Peck, R. Quick, Stephen C. Simms, G. Fekete, Jan vandenBerg, Kihyeon Cho, Kihwan Kwon, Dongchul Son, Hyoungwoo Park, S. Canon, K. Jackson, D. Konerding, Jason R. Lee, D. Olson, I. Sakrejda, B. Tierney, Mark L. Green, Russ Miller, J. Letts, T. Martin, David Bury, C. Dumitrescu, D. Engh, R. Gardner, M. Mambelli, Y. Smirnov, Jens-S. Vöckler, M. Wilde, Yong Zhao, Xin Zhao, P. Avery, R. Cavanaugh, Bockjoo Kim, C. Prescott, J. Rodriguez, A. Zahn, S. McKee, C. Jordan, James E. Prewett, T. Thomas, H. Severini, Ben Cliff","doi":"10.1109/HPDC.2004.36","DOIUrl":null,"url":null,"abstract":"The Grid2003 Project has deployed a multivirtual organization, application-driven grid laboratory (\"Grid3\") that has sustained for several months the production-level services required by physics experiments of the Large Hadron Collider at CERN (ATLAS and CMS), the Sloan Digital Sky Survey project, the gravitational wave search experiment LIGO, the BTeV experiment at Fermilab, as well as applications in molecular structure analysis and genome analysis, and computer science research projects in such areas as job and data scheduling. The deployed infrastructure has been operating since November 2003 with 27 sites, a peak of 2800 processors, work loads from 10 different applications exceeding 1300 simultaneous jobs, and data transfers among sites of greater than 2 TB/day. We describe the principles that have guided the development of this unique infrastructure and the practical experiences that have resulted from its creation and use. We discuss application requirements for grid services deployment and configuration, monitoring infrastructure, application performance, metrics, and operational experiences. We also summarize lessons learned.","PeriodicalId":446429,"journal":{"name":"Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"152","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPDC.2004.36","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 152
Abstract
The Grid2003 Project has deployed a multivirtual organization, application-driven grid laboratory ("Grid3") that has sustained for several months the production-level services required by physics experiments of the Large Hadron Collider at CERN (ATLAS and CMS), the Sloan Digital Sky Survey project, the gravitational wave search experiment LIGO, the BTeV experiment at Fermilab, as well as applications in molecular structure analysis and genome analysis, and computer science research projects in such areas as job and data scheduling. The deployed infrastructure has been operating since November 2003 with 27 sites, a peak of 2800 processors, work loads from 10 different applications exceeding 1300 simultaneous jobs, and data transfers among sites of greater than 2 TB/day. We describe the principles that have guided the development of this unique infrastructure and the practical experiences that have resulted from its creation and use. We discuss application requirements for grid services deployment and configuration, monitoring infrastructure, application performance, metrics, and operational experiences. We also summarize lessons learned.