BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:2.0 BEGIN:VEVENT DTSTART:20151117T193000Z DTEND:20151117T200000Z LOCATION:19AB DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: GPUs have recently emerged to be widely adopted for gener-al-purpose applications. To capture on-chip locality for these applications, modern GPUs have integrated multi-level cache hierarchy, in an attempt to reduce the amount and latency of the massive and sometimes irregular memory accesses. However, inferior performance are frequently attained due to the serious congestion in the caches results from the huge volume of concurrent threads. In this paper, we propose a novel compiler and runtime framework for adaptive and transparent cache bypassing on GPUs. It uses a simple yet effective approach to control the bypass degree so as to match the size of applications' runtime footprints. We validate the design on several GPUs from different generations using 16 cache sensitive applications. Experiment measures show that our design can significantly improve overall performance (up to 2.16x on average). Besides, we summarize some optimization guidelines regarding GPU caches based on the experiment figures. SUMMARY:Adaptive and Transparent Cache Bypassing for GPUs PRIORITY:3 END:VEVENT END:VCALENDAR