Swinburne
Browse

Semantics based buffer reduction for queries over XML data streams

Download (103.91 kB)
conference contribution
posted on 2024-07-09, 19:15 authored by Chi Yang, Chengfei LiuChengfei Liu, Jianxin Li, Jeffrey Xu Yu, Junhu Wang
With respect to current methods for query evaluation over XML data streams, adoption of certain types of buffering techniques is unavoidable. Under lots of circumstances, the buffer scale may increase exponentially, which can cause memory bottleneck. Some optimization techniques have been proposed to solve the problem. However, the limit of these techniques has been defined by a concurrency lower bound and been theoretically proved. In this paper, we show through an empirical study that this lower bound can be broken by taking semantic information into account for buffer reduction. To demonstrate this, we build a SAX-based XML stream query evaluation system and design an algorithm that consumes buffers in line with the concurrency lower bound. After a further analysis of the lower bound, we design several semantic rules for the purpose of breaking the lower bound and incorporate these rules in the lower bound algorithm. Experiments are conducted to show that the algorithms deploying semantic rules individually and collectively all significantly outperform the lower bound algorithm that does not consider semantic information.

History

Available versions

PDF (Published version)

ISSN

1445-1336

Conference name

Conferences in Research and Practice in Information Technology Series

Location

Wollongong

Start date

2008-01-22

End date

2008-01-25

Volume

75

Issue

PART 1

Pagination

8 pp

Publisher

Australian Computer Society

Copyright statement

Copyright © 2008 Australian Computer Society, Inc. This paper appeared at the Nineteenth Australasian Database Conference (ADC2008), Wollongong, Australia, January 2008. Conferences in Research and Practice in Information Technology, Vol. 75. Alan Fekete and Xuemin Lin, Eds. Reproduction for academic, notfor- profit purposes permitted provided this text is included.

Notes

This paper was named Best Student Paper at the 19th Australasian Database Conference (ADC 2008).

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC