Globus XIO pipe open driver: enabling GridFTP to leverage standard Unix tools

TG '11: Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery(2011)

引用 8|浏览0
暂无评分
摘要
Scientific research creates substantially large volumes of data throughout the processes of discovery and analysis. Given the necessity for data sharing and data relocation, members of the scientific community are often faced with a productivity loss that correlates with the time cost incurred during the data transfer process. The GridFTP protocol was developed to improve this situation by addressing the performance, reliability, and security limitations of standard FTP and other commonly used data movement tools such as SCP. The Globus implementation of GridFTP is widely used to rapidly and reliably move data between geographically distributed systems. Traditionally, GridFTP performs well for datasets containing large files. When the data is partitioned into many small files, however, it suffers from lower transfer rates. Although the pipelining and concurrency solution in GridFTP provides improved transfer rates for datasets using lots-of-small-files, these solutions cannot be applied in environments that have strict firewall rules. In some cases, tarring the files in a dataset on the fly will help; in other cases, a checksum of the files after they are written to disk is desired. In this paper, we present the Globus XIO Pipe Open Driver which enables GridFTP to leverage the standard Unix tools to perform these tasks. We demonstrate the effectiveness of this functionality through several experiments.
更多
查看译文
关键词
large file,standard unix tool,large volume,globus implementation,gridftp protocol,globus xio pipe open,lower transfer rate,open driver,data movement tool,enabling gridftp,data relocation,data transfer process,transfer rate,globus xio pipe,data transfer,pipe,checksum,scientific research
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要