Swinburne
Browse

XClean: Providing valid spelling suggestions for XML keyword queries

Download (420.21 kB)
conference contribution
posted on 2024-07-11, 07:14 authored by Yifei Lu, Wei Wang, Jianxin Li, Chengfei LiuChengfei Liu
An important facility to aid keyword search on XML data is suggesting alternative queries when user queries contain typographical errors. Query suggestion thus can improve users’ search experience by avoiding returning empty result or results of poor qualities. In this paper, we study the problem of effectively and efficiently providing quality query suggestions for keyword queries on an XML document. We illustrate certain biases in previous work and propose a principled and general framework, XClean, based on the state-of-the-art language model. Compared with previous methods, XClean can accommodate different error models and XML keyword query semantics without losing rigor. Algorithms have been developed that compute the top-k suggestions efficiently. We performed an extensive experiment study using two large-scale real datasets. The experiment results demonstrate the effectiveness and efficiency of the proposed methods.

Funding

XML Views of Relational Databases: Semantics and Update Problems

Australian Research Council

Find out more...

Effective and efficient keyword search for relevant entities over Extensible Markup Language (XML) data

Australian Research Council

Find out more...

Effective and Efficient Video Search

Australian Research Council

Find out more...

Effective and Efficient Keyword Search in Relational Databases

Australian Research Council

Find out more...

History

Available versions

PDF (Accepted manuscript)

ISBN

9781424489596

ISSN

1084-4627

Conference name

IEEE International Conference on Data Engineering

Location

Hannover

Start date

2011-04-11

End date

2011-04-16

Pagination

11 pp

Publisher

IEEE

Copyright statement

Copyright © 2011 IEEE. The accepted manuscript is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC