Swinburne
Browse

Mining dates from historical documents

Download (189.24 kB)
conference contribution
posted on 2024-07-13, 03:34 authored by Dana McKay, Sally Jo Cunningham
The essential quality of information in a digital library is accessibility. Full text search is not enough for some collections, more can be done. Historical collections, for example, contain dates and it would be useful to historians to be able to search by them. However, these dates may occur anywhere within the text of historical documents, and to be searchd they must be extracted from the documents and integrated into the collection index. Doing this manually is very expensive, and described here is a system to do it automatically. This system was implemented within the Greenstone framework used by the New Zealand Digital Library, and involved the use of some carefully designed heuristics.

History

Available versions

PDF (Published version)

Conference name

Computing Arts 2001: Digital Resources for Research in the Humanities, Sydney, Australia, 26-28 September 2001

Publisher

University of Sydney

Copyright statement

Copyright © 2001 Dana McKay and Sally Jo Cunningham. The paper is reproduced with the permission of the publisher.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC