CMU-CS-98-137
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-98-137

Selectivity Estimation of Window Queries for Line Segment Datasets

Guido Proietti*, Christos Faloutsos

May 1998

CMU-CS-98-137.ps


Keywords: Spatial databases, GIS, line segment data, range queries, selectivity estimation


Despite of the fact that large line segment datasets are becoming more and more popular, most of the analysis for estimating the selectivity of window queries posed spatial data - the most important parameter for query optimization - has focused on point or region data only.

In this paper we move one significant step forward in line segment datasets theoretical analysis. We discovered that real lines closely follow a distribution law, that we named the SLED law (Segment LEngth Distribution). The SLED law can be used for an accurate estimation of the selectivity of window queries. Experiments on a variety of real line segment datasets (hydrographic systems, roadmaps, railroads, utilities networks) show that our law holds and that our formula is extremely accurate, enjoying a maximum relative error of 4% in estimating the selectivity.

15 pages

*On leave from Dipartimento di Matematica Pura ed Applicata, University of L'Aquila, Via Vetoio, I-67010, Italy.


Return to: SCS Technical Report Collection
School of Computer Science homepage

This page maintained by reports@cs.cmu.edu