Swinburne
Browse

Comparative analysis of evolving software systems using the gini coefficient

Download (433.57 kB)
conference contribution
posted on 2024-07-09, 18:17 authored by Rajesh Vasa, Markus LumpeMarkus Lumpe, Philip BranchPhilip Branch, Oscar Nierstrasz
Software metrics offer us the promise of distilling useful information from vast amounts of software in order to track development progress, to gain insights into the nature of the software, and to identify potential problems. Unfortunately, however, many software metrics exhibit highly skewed, non-Gaussian distributions. As a consequence, usual ways of interpreting these metrics---for example, in terms of “average” values---can be highly misleading. Many metrics, it turns out, are distributed like wealth---with high concentrations of values in selected locations. We propose to analyze software metrics using the Gini coefficient, a higherorder statistic widely used in economics to study the distribution of wealth. Our approach allows us not only to observe changes in software systems efficiently, but also to assess project risks and monitor the development process itself. We apply the Gini coefficient to numerous metrics over a range of software projects, and we show that many metrics not only display remarkably high Gini values, but that these values are remarkably consistent as a project evolves over time.

Funding

Swiss National Science Foundation

History

Available versions

PDF (Published version)

ISBN

9781424448289

Journal title

IEEE International Conference on Software Maintenance, ICSM

Conference name

IEEE International Conference on Software Maintenance, ICSM

Volume

1

Pagination

179-188

Publisher

IEEE

Copyright statement

Copyright © 2009 IEEE. The published version is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC