Swinburne
Browse
- No file added yet -

A Local-Optimisation Based Strategy for Cost-Effective Datasets Storage of Scientific Applications in the Cloud

Download (275.63 kB)
conference contribution
posted on 2024-07-09, 17:00 authored by Dong Yuan, Yun YangYun Yang, Xiao Liu, Jinjun ChenJinjun Chen
Massive computation power and storage capacity of cloud computing systems allow scientists to deploy computation and data intensive applications without infrastructure investment, where large application datasets can be stored in the cloud. However, due to the pay-as-you-go model, the datasets should be strategically stored in order to reduce the overall application cost. In this paper, by utilising Data Dependency Graph (DDG) from data provenances in scientific applications, deleted datasets can be regenerated, and as such we develop a novel cost-effective datasets storage strategy that can automatically store appropriate datasets in the cloud. This strategy achieves a localised optimal trade-off between computation and storage, meanwhile also taking users' tolerance of data accessing delay into consideration. Simulations conducted on general (random) datasets and a specific astrophysics pulsar searching application with Amazon's cost model show that our strategy can reduce the application cost significantly.

Funding

An Integrated Geophysical Study of the Southern Oklahoma Aulacogen

Directorate for Geosciences

Find out more...

History

Available versions

PDF (Accepted manuscript)

ISBN

9781457708367

Conference name

IEEE International Conference on Cloud Computing

Location

Washington, DC

Start date

2011-07-04

End date

2011-07-09

Pagination

7 pp

Publisher

IEEE

Copyright statement

Copyright © 2011 IEEE. The accepted manuscript is reproduced in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC