Guoliang Liu, Liuying Ma, Pengfei Yan, Shuai Zhang, L. Liu
{"title":"广域文件系统GeoFS的设计与实现","authors":"Guoliang Liu, Liuying Ma, Pengfei Yan, Shuai Zhang, L. Liu","doi":"10.1109/NAS.2014.24","DOIUrl":null,"url":null,"abstract":"We propose GeoFS, a POSIX-compliant, wide-area distributed file system, which is used for sharing files between sites. GeoFS uses FUSE to provide standard file system interfaces to applications, it allows users to control over consistency and replication via extended attributes. In the era of big data, traditional file systems do not adapt well to update a large directory (i.e. Huge number of files in one directory), if there is a small fraction of changes in a directory, the whole cache of the directory metadata must be discarded, and a new copy will be obtained from remote server, resulting in poor performance. We address this issue by partitioning metadata into blocks, and only transferring modified block(s) over the network. GeoFS also supports client caching, prefetching, parallel read and compression to make it suitable for use in networks with high latency and low bandwidth characteristics. Performance tests demonstrate that Geo FS outperforms NFS in a wide-area environment.","PeriodicalId":186621,"journal":{"name":"2014 9th IEEE International Conference on Networking, Architecture, and Storage","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Design and Implementation of GeoFS: A Wide-Area File System\",\"authors\":\"Guoliang Liu, Liuying Ma, Pengfei Yan, Shuai Zhang, L. Liu\",\"doi\":\"10.1109/NAS.2014.24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose GeoFS, a POSIX-compliant, wide-area distributed file system, which is used for sharing files between sites. GeoFS uses FUSE to provide standard file system interfaces to applications, it allows users to control over consistency and replication via extended attributes. In the era of big data, traditional file systems do not adapt well to update a large directory (i.e. Huge number of files in one directory), if there is a small fraction of changes in a directory, the whole cache of the directory metadata must be discarded, and a new copy will be obtained from remote server, resulting in poor performance. We address this issue by partitioning metadata into blocks, and only transferring modified block(s) over the network. GeoFS also supports client caching, prefetching, parallel read and compression to make it suitable for use in networks with high latency and low bandwidth characteristics. Performance tests demonstrate that Geo FS outperforms NFS in a wide-area environment.\",\"PeriodicalId\":186621,\"journal\":{\"name\":\"2014 9th IEEE International Conference on Networking, Architecture, and Storage\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 9th IEEE International Conference on Networking, Architecture, and Storage\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NAS.2014.24\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 9th IEEE International Conference on Networking, Architecture, and Storage","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAS.2014.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design and Implementation of GeoFS: A Wide-Area File System
We propose GeoFS, a POSIX-compliant, wide-area distributed file system, which is used for sharing files between sites. GeoFS uses FUSE to provide standard file system interfaces to applications, it allows users to control over consistency and replication via extended attributes. In the era of big data, traditional file systems do not adapt well to update a large directory (i.e. Huge number of files in one directory), if there is a small fraction of changes in a directory, the whole cache of the directory metadata must be discarded, and a new copy will be obtained from remote server, resulting in poor performance. We address this issue by partitioning metadata into blocks, and only transferring modified block(s) over the network. GeoFS also supports client caching, prefetching, parallel read and compression to make it suitable for use in networks with high latency and low bandwidth characteristics. Performance tests demonstrate that Geo FS outperforms NFS in a wide-area environment.