Scalable Middleware and Tools for High-Performance Computing

Speaker:  Dorian C Arnold – Orange County, CA, United States
Topic(s):  Applied Computing

Abstract

In the pursuit of unprecedented high-performance computing (HPC) capabilities, and the scientific and economic advances such capabilities will bring, U.S., Asian, European and even Indian governments have established initiatives to build and deploy extreme scale systems with exaflop (quintillion or 10^18 floating point operations per second) computational power. In terms of component counts, such systems are expected to comprise scales orders of magnitude larger than current systems. As aresult, we face heightened challenges in the design and deployment of scalable software services for the management of such systems and tools for finding and fixing functional and performance problems in large scale applications.

In this talk, I describe our research efforts focused on the design of scalable, robust middleware services and tools. Specifically, I will talk about our work in tree-based overlay networks (TBONs) that have become the foundation of several important tools at U.S. Department of Energy national laboratories. One such tool I will talk about is the Stack Trace Analysis Tool, a TBON-based tool for debugging extreme scale applications. Lastly, I will overview the work we are doing to address the responsive instantiation of large scale applications and tools, a common problem encountered in extreme scale HPC systems.

About this Lecture

Number of Slides:  45
Duration:  50 minutes
Languages Available:  English
Last Updated: 

Request this Lecture

To request this particular lecture, please complete this online form.

Request a Tour

To request a tour with this speaker, please complete this online form.

All requests will be sent to ACM headquarters for review.