Distance-based Representative Skyline

 

Yufei Tao, Ling Ding, Xuemin Lin, and Jian Pei

 

In International Conference on Data Engineering (ICDE), 2009

 
Abstract


Given an integer k, a representative skyline contains the k skyline points that best describe the tradeoffs among different dimensions offered by the full skyline. Although this topic has been previously studied, the existing solution may sometimes produce k points that appear in an arbitrarily tiny cluster, and therefore, fail to be representative. Motivated by this, we propose a new definition of representative skyline that minimizes the distance between a non-representative skyline point and its nearest representative. We also study algorithms for computing distance-based representative skylines. In 2D space, there is a dynamic programming algorithm that guarantees the optimal solution. For dimensionality at least 3, we prove that the problem is NP-hard, and give a 2-approximate polynomial time algorithm. Using a multidimensional access method, our algorithm can directly report the representative skyline, without retrieving the full skyline. We show that our representative skyline not only better captures the contour of the entire skyline than the previous method, but also can be computed much faster.




 

Paper download

     
 
Implementation and datasets

Please read the terms of using our implementation.
 
Binary and data.
Download. The package includes the NBA and Island datasets. The anti-correlated datasets were generated following the standard approach described in reference [1]. All executables accept command-line inputs.

Source.
Download 1. Implementation by Yufei Tao, including R-tree bulkloader, I-greedy, and the algorithm of finding 2D optimal max-dominance representative skylines.
Download 2. Implementation by Ling Ding, including the algorithms of computing 2D optimal representative skylines, and greedy max-dominance representative skylines.

 

Supporting grants

CUHK 1202/06 and CUHK 4161/07 from HKRGC.
 
 

Back to Yufei's home, or publication list