Maintaining Sliding Window Skylines on Data Streams

 

Yufei Tao, and Dimitris Papadias

 

IEEE Transactions on Knowledge and Data Engineering (TKDE), 18(3): 377-391, 2006

 
Abstract


The skyline of a multi-dimensional dataset contains the “best” tuples according to any preference function that is monotonic on each dimension. Although skyline computation has received considerable attention in conventional databases, the existing algorithms are inapplicable to stream applications because (i) they assume static data that are stored in the disk (rather than continuously arriving/expiring), (ii) they focus on “one-time” execution that returns a single skyline (in contrast to constantly tracking skyline changes), and (iii) they aim at reducing the I/O overhead (as opposed to minimizing the CPU-cost and main-memory consumption).

This paper studies skyline computation in stream environments, where query processing takes into account only a “sliding window” covering the most recent tuples. We propose algorithms that continuously monitor the incoming data and maintain the skyline incrementally. Our techniques utilize several interesting properties of stream skylines to improve space/time efficiency by expunging data from the system as early as possible (i.e., before their expiration). Furthermore, we analyze the asymptotical performance of the proposed solutions, and evaluate their efficiency with extensive experiments.
 

Paper download

  
 
Implementation and datasets 

Before you proceed with downloading, please read and agree to the terms of using our implementation.

Download our source codes (implemented: by Yufei Tao).
Download our dataset generators (implemented: by Yufei Tao). 
 

 

Back to Yufei's home, or publication list