Swinburne
Browse

Efficient batch processing for multiple keyword queries on graph data

Download (472.35 kB)
conference contribution
posted on 2024-07-11, 08:06 authored by Lu ChenLu Chen, Chengfei LiuChengfei Liu, Xiaochun Yang, Bin Wang, Jianxin Li, Rui ZhouRui Zhou
Recently, answering keyword queries on graph data has drawn a great deal of attention from database communities. However, most graph keyword search solutions proposed so far primarily focus on a single query setting. We observe that for a popular keyword query system, the number of keyword queries received could be substantially large even in a short time interval, and the chance that these queries share common keywords is quite high. Therefore, answering keyword queries in batches would significantly enhance the performance of the system. Motivated by this, this paper studies efficient batch processing for multiple keyword queries on graph data. Realized that finding both the optimal query plan for multiple queries and the optimal query plan for a single keyword query on graph data are computationally hard, we first propose two heuristic approaches which target maximizing keyword overlap and give preferences for processing keywords with short sizes. Then we devise a cardinality based cost estimation model that takes both graph data statistics and search semantics into account. Based on the model, we design an A* based algorithm to find the global optimal execution plan for multiple queries. We evaluate the proposed model and algorithms on two real datasets and the experimental results demonstrate their efficacy.

Funding

ARC | DP160102412

ARC | DP140103499

ARC | DP160102114

History

Available versions

PDF (Accepted manuscript)

ISBN

9781450340731

Journal title

International Conference on Information and Knowledge Management, Proceedings

Conference name

ACM International Conference on Information and Knowledge Management

Location

Indianapolis, Indiana

Start date

2016-10-24

End date

2016-10-28

Volume

24-28-October-2016

Pagination

1261-1270

Publisher

ACM

Copyright statement

Copyright © ACM 2016. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in CIKM '16 Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, Indianapolis, Indiana, USA — October 24 - 28, 2016, http://doi.acm.org/10.1145/2983323.2983806.

Language

eng

Usage metrics

    Publications

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC