Swinburne
Browse

On the statistical properties of the F-measure

Download (264.76 kB)
conference contribution
posted on 2024-07-11, 10:55 authored by Tsong ChenTsong Chen, Fei-Ching Kuo, Robert Merkel
The F-measure - the number of distinct test cases to detect the first program failure - is an effectiveness measure for debug testing strategies. We show that for random testing with replacement, the F-measure will be distributed according to the geometric distribution. A simulation study examines the distribution of two adaptive random testing methods, to study how closely their sampling distributions approximate the geometric distribution, revealing that in the worst case scenario, the sampling distribution for adaptive random testing is very similar to random testing. Our results have provided an answer to a conjecture that adaptive random testing is always a more effective alternative to random testing, with reference to the F-measure. We consider the implications of our findings for previous studies conducted in the area, and make recommendations to future studies.

History

Available versions

PDF (Published version)

ISBN

769522076

Journal title

Proceedings - Fourth International Conference on Quality Software, QSIC 2004

Conference name

Fourth International Conference on Quality Software, QSIC 2004

Pagination

7 pp

Publisher

IEEE

Copyright statement

Copyright © 2004 IEEE. The published version is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC