Pairs and Pairix: a file format and a tool for efficient storage and retrieval for Hi-C read pairs
biorxiv(2021)
摘要
Summary As the amount of three-dimensional chromosomal interaction data continues to increase, storing and accessing such data efficiently becomes paramount. We introduce Pairs, a block-compressed text file format for storing paired genomic coordinates from Hi-C data, and Pairix, an open-source C application to index and query Pairs files. Pairix (also available in Python and R) extends the functionalities of Tabix to paired coordinates data. We have also developed PairsQC, a collapsible HTML quality control report generator for Pairs files.
Availability The format specification and source code are available at , and .
Contact peter_park{at}hms.harvard.edu or burak_alver{at}hms.harvard.edu
### Competing Interest Statement
The authors have declared no competing interest.
更多查看译文
关键词
file format,efficient storage,pairix
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要