Swinburne
Browse
- No file added yet -

Exploiting per user information for supercomputing workload prediction requires care

Download (216.36 kB)
conference contribution
posted on 2024-07-26, 13:51 authored by T. V. Dinh, L. L. H. Andrew, Philip BranchPhilip Branch
Efficient management of supercomputing facilities requires estimates of future workload based on past user behaviour. For supercomputers with large numbers of users, aggregate user behaviour is commonly assumed to be best in prediction of future workloads, however for systems with smaller numbers of users the question arises as to whether it is still suitable or if benefits can be derived from monitoring individual user behaviour to predict future workload. We compare using individual user behaviour, aggregate user behaviour and a hybrid approach where we track heavy users individually and cluster aggregate light users into a small number of clusters. We find that the hybrid approach produces the best results in both mean absolute error and mean squared error. However, treating all users separately provides slightly worse predictions. We also introduce a new approach to prediction based on the hazard function which is a significant improvement on previously used schemes based on autoregressive models. The schemes are investigated numerically using a two-year workload trace from a supercomputer with a population of 136 users.

Funding

Increasing internet energy and cost efficiency by improving higher-layer protocols

Australian Research Council

Find out more...

History

Available versions

PDF (Accepted manuscript)

ISBN

9780769549965

Journal title

Proceedings - 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2013

Conference name

IEEE International Symposium on Cluster, Cloud and Grid Computing

Location

Delft

Start date

2013-05-13

End date

2013-05-16

Pagination

7 pp

Publisher

IEEE

Copyright statement

Copyright © 2013 IEEE. The accepted manuscript is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC