Yang Liu, Zina Ben-Miled, O. Bukhres, Michael Bem, Robert Jones, Robert J. Oppelt
{"title":"Efficient schema design for a pharmaceutical data repository","authors":"Yang Liu, Zina Ben-Miled, O. Bukhres, Michael Bem, Robert Jones, Robert J. Oppelt","doi":"10.1109/CBMS.2000.856907","DOIUrl":null,"url":null,"abstract":"Large scientific databases often consist of heterogeneous components that differ in the number of attributes and the number of tuples. In this paper, we show that increased performance is achieved when a complex schema is used to represent different components of a data repository. The driving application used in this study is a pharmaceutical drug repository. The types of access patterns associated with this repository consist of the retrieval of a selected set of attribute values for a consecutive or a random set of tuples. These consecutive and random access patterns, which are typical of a drug discovery experiment, are used to measure the data retrieval throughput for different schemas.","PeriodicalId":189930,"journal":{"name":"Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2000.856907","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Large scientific databases often consist of heterogeneous components that differ in the number of attributes and the number of tuples. In this paper, we show that increased performance is achieved when a complex schema is used to represent different components of a data repository. The driving application used in this study is a pharmaceutical drug repository. The types of access patterns associated with this repository consist of the retrieval of a selected set of attribute values for a consecutive or a random set of tuples. These consecutive and random access patterns, which are typical of a drug discovery experiment, are used to measure the data retrieval throughput for different schemas.