{"title":"An architecture for a scalable, high-performance digital library","authors":"R. Grossman, X. Qin, W. Xu, H. Hulen, T. Tyler","doi":"10.1109/MASS.1995.528219","DOIUrl":"https://doi.org/10.1109/MASS.1995.528219","url":null,"abstract":"Requirements for a high-performance, scalable digital library of multimedia data are presented together with a layered architecture for a system that addresses the requirements. The approach is to view digital data as persistent collections of complex objects and to use lightweight object management to manage this data. To scale as the amount of data increases, the object management component is layered over a storage management component. The storage management component supports hierarchical storage, third-party data transfer and parallel input-output. Several issues that arise from the interface between the storage management and object management components are discussed. The authors have developed a prototype of a digital library using this design. Two key components of the prototype are AIM Net and HPSS. AIM Net is a persistent object manager and is a product of Oak Park Research. HPSS is the High Performance Storage System, developed by a collaboration including IBM Government Systems and several national labs.","PeriodicalId":345074,"journal":{"name":"Proceedings of IEEE 14th Symposium on Mass Storage Systems","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121824259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A framework for understanding large scale digital storage systems","authors":"J. Berry","doi":"10.1109/MASS.1995.528239","DOIUrl":"https://doi.org/10.1109/MASS.1995.528239","url":null,"abstract":"The digital revolution is now underway. The use of binary zeros and ones to store data is increasing at a steady rate. They may represent text, images, pictures, sounds, maps, books, music, instructions, programs, or just about anything else which can be represented digitally. As the sizes of the digital data holdings have continued to grow, so too has the need to provide meaningful access to this data. There are a number of efforts now underway to provide such access. In most cases the efforts have been domain specific and progress in one area has been hard to replicate in a different domain. Part of this difficulty has been the lack of a general set of concepts and vocabulary that are sufficiently broad enough to bridge the gaps. The paper presents a general taxonomy of knowledge that is independent of subject matter domain. It begins with knowledge as the most general class and then proceeds to subdivide knowledge into its constituent parts: factual knowledge, procedural knowledge, and judgmental knowledge. Definitions of each type of knowledge are given along with examples sufficient to understand each subclass. A vocabulary is introduced that provides a means to discuss the topic in a manner independent of a specific problem domain. Understanding of the differences between different types or classes of knowledge is necessary if a person or an organization is to begin to build systems that acquire, organize, store, and retrieve various types of knowledge. The paper concludes with a discussion of some tools that are currently available to assist in the building and maintaining of a knowledge resource.","PeriodicalId":345074,"journal":{"name":"Proceedings of IEEE 14th Symposium on Mass Storage Systems","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123746866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The parallel I/O architecture of the high-performance storage system (HPSS)","authors":"R. Watson, R. Coyne","doi":"10.1109/MASS.1995.528214","DOIUrl":"https://doi.org/10.1109/MASS.1995.528214","url":null,"abstract":"Datasets up to terabyte size and petabyte total capacities have created a serious imbalance between I/O and storage-system performance and system functionality. One promising approach is the use of parallel data-transfer techniques for client access to storage, peripheral-to-peripheral transfers, and remote file transfers. This paper describes the parallel I/O architecture and mechanisms, parallel transport protocol (PTP), parallel FTP, and parallel client application programming interface (API) used by the high-performance storage system (HPSS). Parallel storage integration issues with a local parallel file system are also discussed.","PeriodicalId":345074,"journal":{"name":"Proceedings of IEEE 14th Symposium on Mass Storage Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130403697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}