Hierarchically organized skew-tolerant histograms for geographic data objects

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    18 Citations (Scopus)

    Abstract

    Histograms have been widely used for fast estimation of query result sizes in query optimization. In this paper, we propose a new histogram method, called the Skew-Tolerant Histogram (STHistogram) for two or three dimensional geographic data objects that are used in many real-world applications in practice. The proposed method provides a significantly enhanced accuracy in a robust manner even for the data set that has a highly skewed distribution. Our method detects hotspots present in various parts of a data set and exploits them in organizing histogram buckets. For this purpose, we first define the concept of a hotspot, and provide an algorithm that efficiently extracts hotspots from the given data set. Then, we present our histogram construction method that utilizes hotspot information. We also describe how to estimate query result sizes by using the proposed histogram. We show through extensive performance experiments that the proposed method provides better performance than other existing methods.

    Original languageEnglish
    Title of host publicationProceedings of the 2010 International Conference on Management of Data, SIGMOD '10
    Pages627-638
    Number of pages12
    DOIs
    Publication statusPublished - 2010
    Event2010 International Conference on Management of Data, SIGMOD '10 - Indianapolis, IN, United States
    Duration: 2010 Jun 62010 Jun 11

    Publication series

    NameProceedings of the ACM SIGMOD International Conference on Management of Data
    ISSN (Print)0730-8078

    Other

    Other2010 International Conference on Management of Data, SIGMOD '10
    Country/TerritoryUnited States
    CityIndianapolis, IN
    Period10/6/610/6/11

    Keywords

    • histograms
    • query optimization
    • spatial databases

    ASJC Scopus subject areas

    • Software
    • Information Systems

    Fingerprint

    Dive into the research topics of 'Hierarchically organized skew-tolerant histograms for geographic data objects'. Together they form a unique fingerprint.

    Cite this