Swinburne
Browse
- No file added yet -

An algorithm for cost-effectively storing scientific datasets with multiple service providers in the cloud

Download (344.47 kB)
conference contribution
posted on 2024-07-11, 07:21 authored by Dong Yuan, Xiao Liu, Lizhen Cui, Tiantian Zhang, Wenhao Li, Dahai Cao, Yun YangYun Yang
The proliferation of cloud computing allows scientists to deploy computation and data intensive applications without infrastructure investment, where large generated datasets can be flexibly stored with multiple cloud service providers. Due to the pay-as-you-go model, the total application cost largely depends on the usage of computation, storage and bandwidth resources, and cutting the cost of cloud-based data storage becomes a big concern for deploying scientific applications in the cloud. In this paper, we propose a novel algorithm that can automatically decide whether a generated dataset should be 1) stored in the current cloud, 2) deleted and re-generated whenever reused or 3) transferred to cheaper cloud service for storage. The algorithm finds the trade-off among computation, storage and bandwidth costs in the cloud, which are three key factors for the cost of storing generated application datasets with multiple cloud service providers. Simulations conducted with popular cloud service providers' pricing models show that the proposed algorithm is highly cost-effective to be utilised in the cloud.

Funding

ARC | DP110101340

ARC | LP130100324

History

Available versions

PDF (Accepted manuscript)

ISBN

9780769550831

Journal title

IEEE 9th International Conference on eScience (eScience), Beijing, China, 22-25 October 2013

Conference name

9th IEEE International Conference on e-Science, e-Science 2013

Location

Beijing

Start date

2013-10-22

End date

2013-10-25

Pagination

285-292

Publisher

IEEE

Copyright statement

Copyright © 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Language

eng

Usage metrics

    Publications

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC