sponsored byACMIEEE The International Conference for High Performance 
Computing, Networking, Storage and Analysis
FacebookTwitterGoogle PlusLinkedInYouTubeFlickr

SCHEDULE: NOV 15-20, 2015

When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.

Performance Comparison of the Multi-Zone Scalar Pentadiagonal (SP-MZ) NAS Parallel Benchmark on Many-Core Parallel Platforms

SESSION: Regular & ACM Student Research Competition Poster Reception

EVENT TYPE: Posters, Receptions, ACM Student Research Competition

EVENT TAG(S): HPC Beginner Friendly, Regular Poster

TIME: 5:15PM - 7:00PM

SESSION CHAIR(S): Michela Becchi, Manish Parashar, Dorian C. Arnold

AUTHOR(S):Christopher P. Stone, Bracy Elton

ROOM:Level 4 - Lobby

ABSTRACT:

The NAS multi-zone scalar-pentadiagonal (SP-MZ) benchmark is representative of many CFD applications. Offloading this class of algorithm to many-core accelerator devices should boost application throughput and reduce time-to-solution. OpenACC and OpenMP compiler directives provide platform portability, hierarchical thread and vector parallelism, and simplified development for legacy applications. We examine the performance of the SP-MZ benchmark on clusters comprised of NVIDIA GPU and Intel Xeon Phi accelerators. We found that offloading the SP-MZ application to the accelerators was straightforward using the compiler directives. However, significant code restructuring was required to attain acceptable performance on the many-core accelerator devices. We implemented similar optimizations for the Intel Xeon Phi, via OpenMP, and the NVIDIA Kepler GPU, with OpenACC, in order to increase both thread and vector parallelism. We observed comparable performance between the two many-core accelerator devices and to HPC-grade multi-core host CPUs.

Chair/Author Details:

Michela Becchi, Manish Parashar, Dorian C. Arnold (Chair) - University of Missouri|Rutgers University|University of New Mexico|

Christopher P. Stone - Computational Science and Engineering, LLC

Bracy Elton - Engility Corporation

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar