Swinburne
Browse

Application of metamorphic testing to supervised classifiers

Download (216.12 kB)
conference contribution
posted on 2024-07-26, 14:15 authored by XiaoYuan Xie, Joshua W K Ho, Christian Murphy, Gail Kaiser, Bao Wen Xu, Tsong ChenTsong Chen
Many applications in the field of scientific computing - such as computational biology, computational linguistics, and others - depend on Machine Learning algorithms to provide important core functionality to support solutions in the particular problem domains. However, it is difficult to test such applications because often there is no 'test oracle' to indicate what the correct output should be for arbitrary input. To help address the quality of such software, in this paper we present a technique for testing the implementations of supervised machine learning classification algorithms on which such scientific computing software depends. Our technique is based on an approach called 'metamorphic testing', which has been shown to be effective in such cases. More importantly, we demonstrate that our technique not only serves the purpose of verification, but also can be applied in validation. In addition to presenting our technique, we describe a case study we performed on a real-world machine learning application framework, and discuss how programmers implementing machine learning algorithms can avoid the common pitfalls discovered in our study. We also discuss how our findings can be of use to other areas outside scientific computing, as well.

Funding

Directorate for Computer & Information Science & Engineering

National Cancer Institute

National Natural Science Foundation of China

Australian Research Council

History

Available versions

PDF (Published version)

ISBN

9780769538280

ISSN

1550-6002

Journal title

Proceedings - International Conference on Quality Software

Conference name

International Conference on Quality Software

Volume

2009

Issue

2009

Pagination

9 pp

Publisher

IEEE

Copyright statement

Copyright © 2009 IEEE. The published version is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC