BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:2.0 BEGIN:VEVENT DTSTART:20151117T231500Z DTEND:20151118T010000Z LOCATION:Level 4 - Lobby DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: Heterogeneity continues to increase in computing applications, with the rise of accelerators such as GPUs, FPGAs, APUs, and other co-processors. They have also become common in state-of-the-art supercomputers on the TOP500 list. Programming models, such as CUDA, OpenMP, OpenACC and OpenCL are designed to offload compute intensive workloads to co-processors efficiently. However, the naive offload model, synchronously copying and executing, in sequence is inefficient. However, pipelining these activities reduces programmability.=0A=0AWe propose an easy-to-use directive-based pipelining extension for OpenMP. Our extension offers a simple interface to overlap data transfer and kernel computation with an auto-tuning scheduler. We achieve performance improvements between 40% and 60% for a Lattice QCD application. SUMMARY:Directive-Based Pipelining Extension for OpenMP PRIORITY:3 END:VEVENT END:VCALENDAR