{"title":"Hadoop MapReduce框架实现大规模虚拟筛选的分子对接","authors":"Jing Zhao, Ruisheng Zhang, Zhili Zhao, Dianwei Chen, Lujie Hou","doi":"10.1109/APSCC.2012.67","DOIUrl":null,"url":null,"abstract":"Traditional virtual screening in the grid needs chemists to upload small molecule files and collect the results manually, which cannot implement docking and collection of results automatically. This caused heavy workload to chemists. In this paper, we took advantage of Hadoop platform in the massive data storage. We stored and managed small molecule files and docking results files using HDFS. In addition, MapReduce programming framework is used for parallel molecular docking to preliminarily process results files, in order to achieve the automation of the virtual screening molecular docking. The research of this thesis will be helpful to drug researcher by offering a massive data storage management system for large-scale virtual screening, and will also provide a reference for drug discovery in the cloud environment to promote the development of computational chemistry e-science.","PeriodicalId":256842,"journal":{"name":"2012 IEEE Asia-Pacific Services Computing Conference","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Hadoop MapReduce Framework to Implement Molecular Docking of Large-Scale Virtual Screening\",\"authors\":\"Jing Zhao, Ruisheng Zhang, Zhili Zhao, Dianwei Chen, Lujie Hou\",\"doi\":\"10.1109/APSCC.2012.67\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional virtual screening in the grid needs chemists to upload small molecule files and collect the results manually, which cannot implement docking and collection of results automatically. This caused heavy workload to chemists. In this paper, we took advantage of Hadoop platform in the massive data storage. We stored and managed small molecule files and docking results files using HDFS. In addition, MapReduce programming framework is used for parallel molecular docking to preliminarily process results files, in order to achieve the automation of the virtual screening molecular docking. The research of this thesis will be helpful to drug researcher by offering a massive data storage management system for large-scale virtual screening, and will also provide a reference for drug discovery in the cloud environment to promote the development of computational chemistry e-science.\",\"PeriodicalId\":256842,\"journal\":{\"name\":\"2012 IEEE Asia-Pacific Services Computing Conference\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE Asia-Pacific Services Computing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSCC.2012.67\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Asia-Pacific Services Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSCC.2012.67","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hadoop MapReduce Framework to Implement Molecular Docking of Large-Scale Virtual Screening
Traditional virtual screening in the grid needs chemists to upload small molecule files and collect the results manually, which cannot implement docking and collection of results automatically. This caused heavy workload to chemists. In this paper, we took advantage of Hadoop platform in the massive data storage. We stored and managed small molecule files and docking results files using HDFS. In addition, MapReduce programming framework is used for parallel molecular docking to preliminarily process results files, in order to achieve the automation of the virtual screening molecular docking. The research of this thesis will be helpful to drug researcher by offering a massive data storage management system for large-scale virtual screening, and will also provide a reference for drug discovery in the cloud environment to promote the development of computational chemistry e-science.