Scalable repositories for virtual clusters

Paolo Anedda, Simone Leo, Massimo Gaggero, Gianluigi Zanetti

Euro-Par 2009 -- Parallel Processing Workshops, Volume 6043/2010, page 414--423 - june 2010

For a large class of scientific data analysis applications it is becoming important, due to the sheer size of datasets, to have the option to perform the analysis directly where the data are stored, rather than on remote computational clusters. A possible strategy is the use of virtual clusters, thus guaranteeing a high degree of isolation from the underlying physical computational structure, and a very compact initial description. Deploying, saving and restoring HPC dedicated virtual clusters introduces, however, a different class of requirements on the virtual machines managing infrastructure, in particular for what concerns storage I/O requirements, whose scalability boundaries are easily reached. Here we discuss an alternative approach based on a storage model that leverages the WORM (write once, read many) character of the data used by VM management to increase, in a scalable way, the aggregate data bandwidth available to virtual cluster level operations and provide preliminary results indicating that it is a viable solution.

Références BibTex

@InProceedings{ALGZ10a,
  author       = {Anedda, P. and Leo, S. and Gaggero, M. and Zanetti, G.},
  title        = {Scalable repositories for virtual clusters},
  booktitle    = {Euro-Par 2009 -- Parallel Processing Workshops},
  series       = {Lecture Notes in Computer Science},
  volume       = {6043/2010},
  pages        = {414--423},
  month        = {june},
  year         = {2010},
  editor       = {H. X. Lin et al.},
  publisher    = {Springer},
  note         = {isbn: 978-3-642-14121-8idxproject: CYBERSAR},
  keywords     = {Virtual cluster,Data-driven application,HPC},
  url          = {http://www.springerlink.com/content/np5u8k1x9l6u755g
}

Autres publications dans la base

» Paolo Anedda
» Simone Leo
» Massimo Gaggero
» Gianluigi Zanetti