Making Aggregation Work in Uncertain and Probabilistic Databases

IEEE Transactions on Knowledge and Data Engineering(2011)

引用 65|浏览1
暂无评分
摘要
We describe how aggregation is handled in the Trio system for uncertain and probabilistic data. Because “exact” aggregation in uncertain databases can produce exponentially sized results, we provide three alternatives: a low bound on the aggregate value, a high bound on the value, and the expected value. These variants return a single result instead of a set of possible results, and they are generally efficient to compute for both full-table and grouped aggregation queries. We provide formal definitions and semantics and a description of our open source implementation for single-table aggregation queries. We study the performance and scalability of our algorithms through experiments over a large synthetic data set. We also provide some preliminary results on aggregations over joins.
更多
查看译文
关键词
query processing,statistical databases,Trio system,aggregation work,full-table aggregation queries,grouped aggregation queries,probabilistic databases,uncertain databases,Database management,query processing.
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要