Swinburne
Browse

Building digital ink recognizers using data mining: Distinguishing between text and shapes in hand drawn diagrams

Download (371.55 kB)
conference contribution
posted on 2024-07-09, 14:20 authored by Rachel Blagojevic, Beryl Plimmer, John Grundy, Yong Wang
The low accuracy rates of text-shape dividers for digital ink diagrams are hindering their use in real world applications. While recognition of handwriting is well advanced and there have been many recognition approaches proposed for hand drawn sketches, there has been less attention on the division of text and drawing. The choice of features and algorithms is critical to the success of the recognition, yet heuristics currently form the basis of selection. We propose the use of data mining techniques to automate the process of building text-shape recognizers. This systematic approach identifies the algorithms best suited to the specific problem and generates the trained recognizer. We have generated dividers using data mining and training with diagrams from three domains. The evaluation of our new recognizer on realistic diagrams from two different domains, against two other recognizers shows it to be more successful at dividing shapes and text with 95.2% of strokes correctly classified compared with 86.9% and 83.3% for the two others.

History

Available versions

PDF (Accepted manuscript)

ISBN

9783642130212

ISSN

0302-9743

Journal title

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volume

6096 LNAI

Issue

PART 1

Pagination

9 pp

Publisher

Springer

Copyright statement

Copyright © 2010 Springer. The accepted manuscript is reproduced in accordance with the copyright policy of the publisher. The original publication is available at www.springerlink.com.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC