Campus Units

Mechanical Engineering, Electrical and Computer Engineering

Document Type

Article

Publication Version

Accepted Manuscript

Publication Date

2014

Journal or Book Title

Computing in Science & Engineering

Volume

16

Issue

4

First Page

62

Last Page

72

DOI

10.1109/MCSE.2013.134

Abstract

The complexity of many problems in science and engineering requires computational capacity exceeding what average user can expect from a single computational center. While many of these problems can be viewed as a set of independent tasks, their collective complexity easily requires millions core-hours on any state-of-the-art HPC resource, and throughput that cannot be sustained by a single multi-user queuing system. In this paper we explore the use of aggregated HPC resources to solve large-scale engineering problems. We show it is possible to build a computational federation that is easy to use by end-users, and is elastic, resilient and scalable. We argue that the fusion of federated computing and real-life engineering problems can be brought to average user if relevant middleware is provided. We report on the use of federation of 10 distributed heterogeneous HPC resources to perform a large-scale interrogation of the parameter space in the microscale fluid flow problem.

Comments

This is a manuscript of an article published as Diaz-Montes, Javier, Yu Xie, Ivan Rodero, Jaroslaw Zola, Baskar Ganapathysubramanian, and Manish Parashar. "Federated Computing for the Masses--Aggregating Resources to Tackle Large-Scale Engineering Problems." Computing in Science & Engineering 16, no. 4 (2014): 62-72. DOI:10.1109/MCSE.2013.134. Posted with permission.

Rights

“© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.”

Copyright Owner

IEEE

Language

en

File Format

application/pdf

Published Version

Share

COinS