Swinburne
Browse
- No file added yet -

Estimating selectivity for joined RDF triple patterns

Download (294.68 kB)
conference contribution
posted on 2024-07-11, 07:14 authored by Hai Huang, Chengfei LiuChengfei Liu
A fundamental problem related to RDF query processing is selectivity estimation, which is crucial to query optimization for determining a join order of RDF triple patterns. In this paper we focus research on selectivity estimation for SPARQL graph patterns. The previous work takes the join uniformity assumption when estimating the joined triple patterns. This assumption would lead to highly inaccurate estimations in the cases where properties in SPARQL graph patterns are correlated. We take into account the dependencies among properties in SPARQL graph patterns and propose a more accurate estimation model. Since star and chain query patterns are common in SPARQL graph patterns, we first focus on these two basic patterns and propose to use Bayesian network and chain histogram respectively for estimating the selectivity of them. Then, for estimating the selectivity of an arbitrary SPARQL graph pattern, we design algorithms for maximally using the precomputed statistics of the star paths and chain paths. The experiments show that our method outperforms existing approaches in accuracy.

Funding

XML Views of Relational Databases: Semantics and Update Problems

Australian Research Council

Find out more...

Effective and efficient keyword search for relevant entities over Extensible Markup Language (XML) data

Australian Research Council

Find out more...

History

Available versions

PDF (Accepted manuscript)

ISBN

9781450307178

Journal title

Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11

Conference name

ACM International Conference on Information and Knowledge Management

Location

Glasgow

Start date

2011-10-24

End date

2011-10-28

Pagination

9 pp

Publisher

ACM

Copyright statement

Copyright © 2011 ACM. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings of CIKM (2011) http://doi.acm.org/10.1145/2063576.2063784

Language

eng

Usage metrics

    Publications

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC