Efficient Temporal Counting with Bounded Error

 

Yufei Tao, Xiaokui Xiao

 

To appear in Very Large Data Base Journal (VLDBJ)

 
  Abstract

  
  This paper studies aggregate search in transaction time databases. Specifically, each object in such a database can be modeled as a horizontal segment, whose y-projection is its search key, and its x-projection represents the period when the key was valid in history. Given a query timestamp qt and a key range qk, a count-query retrieves the number of objects that are alive at qt, and their keys fall in qk. We provide a method that accurately answers such queries, with error less than 1/ε + ε×Nalive(qt), where Nalive(qt) is the number of objects alive at time qt, and ε is any constant in (0, 1]. Denoting the disk page size as B, and n = N / B, our technique requires O(n) space, processes any query in O(logBn) time, and supports each update in O(logBn) amortized I/Os. As demonstrated by extensive experiments, the proposed solutions guarantee query results with extremely high precision (median relative error below 5%), while consuming only a fraction of the space occupied by the existing approaches that promise precise results.




 

  Paper download

  
 
  Implementation and datasets
 
  Before you proceed with downloading, please read and agree to the terms of using our implementation.

  The real datasets used to generate data for our experiments: CA and LB. (The generation is described in the paper)
  Codes: Zoning (the pre-processing module for Ancseg), Ancseg, MVB-tree. These implementations are accomplished by Xiaokui Xiao.
   
 
 

Back to Yufei's home, or publication list