谷歌浏览器插件
订阅小程序
在清言上使用

ObjTables: Structured Spreadsheets That Promote Data Quality, Reuse, and Integration

CoRR(2020)

引用 0|浏览42
暂无评分
摘要
A central challenge in science is to understand how systems behaviors emergefrom complex networks. This often requires aggregating, reusing, andintegrating heterogeneous information. Supplementary spreadsheets to articlesare a key data source. Spreadsheets are popular because they are easy to readand write. However, spreadsheets are often difficult to reanalyze because theycapture data ad hoc without schemas that define the objects, relationships, andattributes that they represent. To help researchers reuse and composespreadsheets, we developed ObjTables, a toolkit that makes spreadsheets human-and machine-readable by combining spreadsheets with schemas and anobject-relational mapping system. ObjTables includes a format for schemas;markup for indicating the class and attribute represented by each spreadsheetand column; numerous data types for scientific information; and high-levelsoftware for using schemas to read, write, validate, compare, merge, revision,and analyze spreadsheets. By making spreadsheets easier to reuse, ObjTablescould enable unprecedented secondary meta-analyses. By making it easy to buildnew formats and associated software for new types of data, ObjTables can alsoaccelerate emerging scientific fields.
更多
查看译文
关键词
spreadsheets,data quality
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要