TIME
| PRESENTATION
| SPEAKER
| LOCATION
| PLANNER
|
8:30AM - 5:00PM |
Exhibition of Regular & ACM Student Research Competition Posters |
|
Level 4 - Lobby |
|
5:15PM - 7:00PM |
An Analysis of Network Congestion in the Titan Supercomputer's Interconnect |
Jonathan Freed |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Lessons from Post-Processing Climate Data on Modern Flash-Based HPC Systems |
Adnan Haider |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
UtiliStation: Increasing the Utilization of the International Space Station with Big Data Analytics for Stowage |
Ellis Giles |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Rapid Replication of Multi-Petabyte File Systems |
Justin G. Sybrandt |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Integrating STELLA & MODESTO: Definition and Optimization of Complex Stencil Programs |
Tobias Gysi |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
High Performance Model Based Image Reconstruction |
Xiao Wang |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Forecasting Storms in Parallel File Systems |
Ryan McKenna |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
PEAK: Parallel EM Algorithm using Kd-tree |
Laleh Aghababaie Beni |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A High-Performance Preconditioned SVD Solver for Accurately Computing Large-Scale Singular Value Problems in PRIMME |
Lingfei Wu |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Improving Application Concurrency on GPUs by Managing Implicit and Explicit Synchronizations |
Michael C. Butler |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Performance Analysis and Optimization of the Weather Research and Forecasting Model (WRF) on Intel Multicore and Manycore Architectures |
samuel J. Elliott |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Performance Analysis and Optimization of a Hybrid Distributed Reverse Time Migration Application |
Sri Raj Paul |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
AccFFT: A New Parallel FFT Library for CPU and GPU Architectures |
Amir Gholami |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Process Variation-Aware Power Scheduling for HPC Applications |
Neha Gholkar |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Portable Performance of Large-Scale Physics Applications: Toward Targeting Heterogeneous Exascale Architectures Through Application Fitting |
William Killian |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
IGen: The Illinois Genomics Execution Environment |
Subho Sankar Banerjee |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Hilbert Curve Based Flexible Dynamic Partitioning Scheme for Adaptive Scientific Computations |
Milinda Fernando |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Practical Floating-Point Divergence Detection |
Wei-Fan Chiang |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Non-Blocking Preconditioned Conjugate Gradient Methods for Extreme-Scale Computing |
Paul Eller |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
I/O Performance Analysis Framework on Measurement Data from Scientific Clusters |
Michelle V. Koo |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Modeling the Impact of Thread Configuration on Power and Performance of GPUs |
Tiffany A. Connors |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Efficient Multiscale Platelets Modeling Using Supercomputers |
Na Zhang |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Optimization Strategies for Materials Science Applications on Cori: An Intel Knights Landing, Many Integrated Core Architecture |
Luther D. Martin |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Exploring the Trade-Off Space of Hierarchical Scheduling for Very Large HPC Centers |
Stephen Herbein |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
High Order Automatic Differentiation with MPI |
Mu Wang |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Resource Usage Characterization for Social Network Analytics on Spark |
Irene Manotas, Rui Zhang, Min Li, Renu Tewari, Dean Hildebrand |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A Coding Based Optimization for Hadoop |
Zakia Asad, Mohammad Asad Rehman Chaudhry, David Malone |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
BLAST Motivated Small Dense Linear Algebra Library Comparison |
Pate Motter, Ian Karlin, Christopher Earl |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Numerical Tools for Multiscale Wave Propagation in Geophysics |
Jose Camata, Lucio de Abreu Correa, Luciano de Carvalho Paludo, Regis Cottereau, Alvaro Coutinho |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Scalable Mesh Generation for HPC Applications |
Rajeev Jain, Navamita Ray, Iulian Grindeanu, Danqing Wu, Vijay Mahadevan |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Neuroscience Gateway - Enabling HPC for Computational Neuroscience |
Subhashini Sivagnanam, Amitava Majumdar, Pramod Kumbhar, Michael Hines, Kenneth Yoshimoto, Ted Carnevale |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Scaling Smart Appliances for Spatial Data Synthesis |
Luis Pineda-Morales, Balaji Subramaniam, Kate Keahey, Gabriel Antoniu, Alexandru Costan, Shaowen Wang, Anand Padmanabhan, Aiman Soliman |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Utility-Based Data Transfers Scheduling Between Distributed Computing Facilities |
Xin Wang, Wei Tang, Rajkumar Kettimuttu, Zhiling Lan |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
FPGA Based OpenCL Acceleration of Genome Sequencing Software |
Ashish Sirasao, Elliott Delaye, Ravi Sunkavalli, Stephen Neuendorffer |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Benchmarking High Performance Graph Analysis Systems with Graph Mining and Pattern Matching Workloads |
Seokyong Hong, Seung-Hwan Lim, Sangkeun Lee, Sreenivas R. Sukumar, Ranga R. Vatsavai |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Large-Scale MO Calculation with GPU-accelerated FMO Program |
Hiroaki Umeda, Toshihiro Hanawa, Mitsuo Shoji, Taisuke Boku, Yasuteru Shigeta |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Fast Classification of MPI Applications Using Lamport's Logical Clocks |
Zhou Tong |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
HPX Applications and Performance Adaptation |
Alice Koniges, Jayashree Ajay Candadai, Hartmut Kaiser, Kevin Huck, Jeremy Kemp, Thomas Heller, Matthew Anderson, Andrew Lumsdaine, Adrian Serio, Michael Wolf, Bryce Lelbach, Ron Brightwell, Thomas Sterling |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Virtualizing File Transfer Agents for Increased Throughput on a Single Host |
Thomas Stitt, Amanda Bonnie, Zach Fuerst |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Mitos: A Simple Interface for Complex Hardware Sampling and Attribution |
Alfredo Gimenez, Benafsh Husain, David Boehme, Todd Gamblin, Martin Schulz |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
HPC Enabled Real-Time Remote Processing of Laparoscopic Surgery |
Karan Sapra, Zahra Ronaghi, Ryan Izard, Edward Duffy, Melissa C. Smith, Kuang-Ching Wang, David M. Kwartowitz |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Consistent Hashing Distance Metrics for Large-Scale Object Storage |
Philip Carns, Kevin Harms, John Jenkins, Misbah Mubarak, Robert B. Ross, Christopher Carothers |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Dynamic Adaptively Refined Mesh Simulations on 1M+ Cores |
Brian T. N. Gunney |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Scalable and Highly SIMD-Vectorized Molecular Dynamics Simulation Involving Multiple Bubble Nuclei |
Hiroshi Watanabe, Satoshi Morita, Hajime Inaoka, Haruhiko Matsuo, Synge Todo, Nobuyasu Ito |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Transition to Trinity: Preparing a Next-Generation Network |
Kathryn S. Protin, Susan K. Coulter, Jesse E. Martinez, Alex F. Montaño, Charles D. Wilder |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Comparison of Machine-Learning Techniques for Handling Multicollinearity in Big Data Analytics and High-Performance Data Mining |
Gerard Dumancas, Ghalib Bello |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Beating cuBLAS: Automatically Generating Bespoke Matrix Multiplication Kernels Using GiMMiK |
Freddie D. Witherden, Bartosz D. Wozniak, Francis P. Russell, Peter E. Vincent, Paul H. J. Kelly |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
GPU-Accelerated VLSI Routing Using Group Steiner Trees |
Basileal Imana, Venkata Suhas Maringanti, Peter Yoon |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Improving Throughput by Dynamically Adapting Concurrency of Data Transfer |
Prasanna Balaprakash, Vitali Morozov, Rajkumar Kettimuthu |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
PERMON Toolbox Combining Discretization, Domain Decomposition, and Quadratic Programming |
Vaclav Hapla, David Horak, Lukas Pospisil |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Scaling Uncertainty Quantification Studies to Millions of Jobs |
Tamara L. Dahlgren, David Domyancic, Scott Brandon, Todd Gamblin, John Gyllenhaal, Rao Nimmakayala, Richard Klein |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Overcoming Distributed Debugging Challenges in the MPI+OpenMP Programming Model |
Lai Wei, Ignacio Laguna, Dong H. Ahn, Matthew P. LeGendre, Gregory L. Lee |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Emulating In-Memory Data Rearrangement for HPC Applications |
Christopher W. Hajas, G. Scott Lloyd, Maya B. Gokhale |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
C++ Abstraction Layers – Performance, Portability and Productivity |
Dennis C. Dinge, Simon D. Hammond, Christian R. Trott, Harold C. Edwards |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
STATuner: Efficient Tuning of CUDA Kernels Parameters |
Ravi Gupta, Ignacio Laguna, Dong H. Ahn, Todd Gamblin, Saurabh Bagchi, Felix Xiaozhu Lin |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
An Approach to the Highest Efficiency of the HPCG Benchmark on the SX-ACE Supercomputer |
Kazuhiko Komatsu, Ryusuke Egawa, Yoko Isobe, Ryusei Ogata, Hiroyuki Takizawa, Hiroaki Kobayashi |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Performance, Power, and Energy of In-Situ and Post-Processing Visualization: A Case Study in Climate Simulation |
Vignesh Adhinarayanan, Scott Pakin, David Rogers, Wu-chun Feng, James Ahrens |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Large Scale Artificial Neural Network Training Using MultiGPUs |
Linnan Wang, Wei Wu, Alex Zhang, Jianxiong Xiao, Yi Yang |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Simulating and Visualizing Traffic on the Dragonfly Network |
Abhinav Bhatele, Nikhil Jain, Yarden Livnat, Valerio Pascucci, Peer-Timo Bremer |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Development of Explicit Moving Particle Simulation Framework and Zoom-Up Tsunami Analysis System |
Kohei Murotani, Seiichi Koshizuka, Masao Ogino, Ryuji Shioya |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Reliable Performance Auto-Tuning in Presence of DVFS |
Md Rakib Hasan, Eric Van Hensbergen, Wade Walker |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
LIBXSMM: A High Performance Library for Small Matrix Multiplications |
Alexander Heinecke, Hans Pabst, Greg Henry |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A High-Performance Approach for Solution Space Traversal in Combinatorial Optimization |
Wendy K. Tam Cho, Yan Y. Liu |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A Performance Evaluation of Kokkos and RAJA using the TeaLeaf Mini-App |
Matt Martineau, Simon McIntosh-Smith, Wayne Gaudin, Mike Boulton, David Beckingsale |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Efficient Large-Scale Sparse Eigenvalue Computations on Heterogeneous Hardware |
Moritz Kreutzer, Andreas Pieper, Andreas Alvermann, Holger Fehske, Georg Hager, Gerhard Wellein, Alan R. Bishop |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Integrated Co-Design of Future Exascale Software |
Bjoern Gmeiner, Markus Huber, Lorenz John, Ulrich Ruede, Christian Waluga, Barbara Wohlmuth, Holger Stengel, Martin Bauer |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
mrCUDA: Low-Overhead Middleware for Transparently Migrating CUDA Execution from Remote to Local GPUs |
Pak Markthub, Akihiro Nomura, Satoshi Matsuoka |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Adapting Genome-Wide Association Workflows for HPC Processing at Pawsey |
Jindan (Charlene) Yang, Christopher Harris, Sylvia Young, Grant Morahan |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Out-of-Core Sorting Acceleration using GPU and Flash NVM |
Hitoshi Sato, Ryo Mizote, Satoshi Matsuoka |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Performance Comparison of the Multi-Zone Scalar Pentadiagonal (SP-MZ) NAS Parallel Benchmark on Many-Core Parallel Platforms |
Christopher P. Stone, Bracy Elton |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
OPESCI: Open Performance portablE Seismic Imaging |
Marcos de Aguiar, Gerard Gorman, Renato Miceli, Christian Jacobs, Michael Lange, Tianjiao Sun, Felippe Zacarias |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Parallel Execution of Workflows Driven by a Distributed Database Management System |
Renan Souza, Vítor Silva, Daniel de Oliveira, Patrick Valduriez, Alexandre A. B. Lima, Marta Mattoso |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Using MuMMI to Model and Optimize Energy and Performance of HPC Applications on Power-Aware Supercomputers |
Xingfu Wu, Valerie Taylor |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
High Level Synthesis of SPARQL Queries |
Marco Minutoli, Vito Giovanni Castellana, Antonino Tumeo |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
User Environment Tracking and Problem Detection with XALT |
Kapil Agrawal, Gregory Peterson, Mark Fahey, Robert McLay, Reuben Budiardja |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Task-Based Parallel Computation of the Density Matrix in Quantum-Based Molecular Dynamics Using Graph Partitioning |
Sergio Pino, Matthew Kroonblawd, Purnima Ghale, Georg Hahn, Vivek Sardeshmukh, Guangjie Shi, Hristo Djidjev, Christian Negre, Robert Pavel, Benjamin Bergen, Susan Mniszewski, Christoph Junghans |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Investigating Prefetch Potential on the Xeon Phi with Autotuning |
Saami Rahman, Ziliang Zong, Apan Qasem |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Automating Sparse Linear Solver Selection with Lighthouse |
Kanika Sood, Pate Motter, Elizabeth Jessup, Boyana Norris |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Geometric-Aware Partitioning on Large-Scale Data for Parallel Quad Meshing |
Wuyi Yu, Qin Chen, Jian Tao, Xin Li |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Analysis of Node Failures in High Performance Computers Based on System Logs |
Siavash Ghiasvand, Florina M. Ciorba, Ronny tschueter, Wolfgang E. Nagel |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Directive-Based Pipelining Extension for OpenMP |
Xuewen Cui, Thomas R. W. Scogland, Bronis R. de Supinski, Wu-chun Feng |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Matrices Over Runtime Systems @ Exascale |
Emmanuel Agullo, Olivier Aumage, George Bosilca, Bérenger Bramas, Alfredo Buttari, Olivier Coulaud, Eric Darve, Jack Dongarra, Mathieu Faverge, Nathalie Furmento, Luc Giraud, Abdou Guermouche, Julien Langou, Florent Lopez, Hatem Ltaief, Samuel Pitoiset, Florent Pruvost, Marc Sergent, Samuel Thibault, Stanimire Tomov |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Parallelization, Acceleration, and Advancement of Dissipative Particle Dynamics (DPD) Methods |
Timothy I. Mattox, James P. Larentzos, Christopher P. Stone, Sean Ziegeler, John K. Brennan, Martin Lísal |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Reduced-Precision Floating-Point Analysis |
Michael O. Lam, Jeffrey K. Hollingsworth |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
GPU Acceleration of a Non-Hydrostatic Ocean Model Using a Mixed Precision Multigrid Preconditioned Conjugate Gradient Method |
Takateru Yamagishi, Yoshimasa Matsumura |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Parallelization of Tsunami Simulation on CPU, GPU and FPGAs |
Fumiya Kono, Naohito Nakasato, Kensaku Hayashi, Alexander Vazhenin, Stanislav Sedukhin, Kohei Nagasu, Kentaro Sano, Vasily Titov |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Large-Scale Ultrasound Simulations with Local Fourier Basis Decomposition |
Jiri Jaros, Matej Dohnal, Bradley E. Treeby |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
libSkylark: A Framework for High-Performance Matrix Sketching for Statistical Computing |
Georgios Kollias, Yves Ineichen, Haim Avron, Vikas Sindhwani, Ken Clarkson, Costas Bekas, Alessandro Curioni |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Heuristic Dynamic Load Balancing Algorithm Applied to the Fragment Molecular Orbital Method |
Yuri Alexeev, Prasanna Balaprakash |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
SLAP: Making a Case for the Low-Powered Cluster by Leveraging Mobile Processors |
Dukyun Nam, Jik-Soo Kim, Hoon Ryu, Gibeom Gu, Chan Yeol Park |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Characterizing Memory Throttling Using Processor and Memory Performance |
Bo Li, Edgar A. Leon, Kirk W. Cameron |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Argo: An Exascale Operating System and Runtime |
Swann Perarnau, Rinku Gupta, Pete Beckman |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Cost Effective Programmable H/W Based Data Plane Acceleration: Linking PCI-Express Commodity I/O H/W with FPGAs |
Woong Shin, Heon Y. Yeom |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Verification of Resilient Communication Models for the Simulation of a Highly Adaptive Energy-Efficient Computer |
Mario Bielert, Kim Feldhoff, Florina M. Ciorba, Stefan Pfennig, Elke Franz, Thomas Ilsche, Wolfgang E. Nagel |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Network-Attached Accelerators: Host-Independent Accelerators for Future HPC Systems |
Sarah Marie Neuwirth, Dirk Frey, Ulrich Bruening |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Exploring Asynchronous Many-Task Runtime Systems Toward Extreme Scales |
Samuel Knight, Marc Gamell, Gavin Baker, David Hollman, Gregory Sjaadema, Hemanth Kolla, Keita Teranishi, Jeremiah Wilke, Nicole Slattengren, Janine C. Bennett |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Memory Hotplug for Energy Savings of HPC systems |
Shinobu Miwa, Hiroki Honda |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Inverse Modeling Nanostructures from X-Ray Scattering Data through Massive Parallelism |
Abhinav Sarje, Dinesh Kumar, Singanallur Venkatakrishnan, Slim Chourou, Xiaoye S. Li, Alexander Hexemer |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Accelerating Tridiagonal Matrix Inversion on the GPU |
Bemnet Demere, Peter Yoon, Ebenezer Hormenou |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Quantifying Productivity - Towards Development Effort Estimation in HPC |
Sandra Wienke, Tim Cramer, Matthias S. Müller, Martin Schulz |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Optimizing CUDA Shared Memory Usage |
Shuang Gao, Gregory D. Peterson |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Optimization of Stencil-Based Fusion Kernels on Tera-Flops Many-Core Architectures |
Yuuichi Asahi, Guillaume Latu, Takuya Ina, Yasuhiro Idomura, Virginie Grandgirard, Xavier Garbet |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
GPU-STREAM: Benchmarking the Achievable Memory Bandwidth of Graphics Processing Units |
Tom Deakin, Simon McIntosh-Smith |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Optimization of an Ocean Model Using Performance Tools |
Oriol Tintó Prims, Miguel Castrillo, Harald Servat, German Llort, Kim Serradell, Oriol Mula-Valls, Francisco Doblas-Reyes |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Parallel Cardiac Electrophysiology Modeling Framework |
Jacob Pollack, Xiaopeng Zhao, Kwai Wong |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A Standard Debug Interface for OpenMP Target Regions |
Andreas Erik Hindborg, Ignacio Laguna, Sven Karlsson, Dong H. Ahn |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Advanced Tiling Techniques for Memory-Starved Streaming Numerical Kernels |
Tareq Malas, Georg Hager, Hatem Ltaief, David Keyes |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
MINIO: An I/O Benchmark for Investigating High Level Parallel Libraries |
James Dickson, Satheesh Maheswaran, Steven Wright, Andy Herdman, Stephen Jarvis |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Energy-Efficient Graph Traversal on Integrated CPU-GPU Architecture |
Heng Lin, Jidong Zhai, Wenguang Chen |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Comparison of Virtualization and Containerization Techniques for High-Performance Computing |
Yuyu Zhou, Balaji Subramaniam, Kate Keahey, John Lange |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
RendezView: An Interactive Visual Mining Tool for Discerning Flock Relationships in Social Media Data |
Melissa J. Bica, Kyoung-Sook Kim |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A Deadlock Detection Concept for OpenMP Tasks and Fully Hybrid MPI-OpenMP Applications |
Tobias Hilbrich, Bronis R. de Supinski, Andreas Knuepfer, Robert Dietrich, Christian Terboven, Felix Muenchhalfen, Wolfgang E. Nagel |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Design and Modelling of Cloud-Based Burst Buffers |
Tianqi Xu, Kento Sato, Satoshi Matsuoka |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Towards Scalable Graph Analytics on Time Dependent Graphs |
Suraj Poudel, Roger Pearce, Maya Gokhale |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Bellerophon: A Computational Workflow Environment for Real-time Analysis, Artifact Management, and Regression Testing of Core-Collapse Supernova Simulations |
Eric Lingerfelt, Bronson Messer |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Caliper: Composite Performance Data Collection in HPC Codes |
David Boehme, Todd Gamblin, Peer-Timo Bremer, Olga T. Pearce, Martin Schulz |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A Real-Time Tsunami Inundation Forecast System for Tsunami Disaster Prevention and Mitigation |
Akihiro Musa, Hiroshi Matsuoka, Osamu Watanabe, Yoichi Murashima, Shunichi Koshimura, Ryota Hino, Yusaku Ohta, Hiroaki Kobayashi |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Multi-GPU Graph Analytics |
Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, John D. Owens |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Multi-Level Blocking Optimization for Fast Sparse Matrix Vector Multiplication on GPUs |
Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
GLOVE: An Interactive Visualization Service Framework with Multi-Dimensional Indexing on the GPU |
Jinwoong Kim, Sehoon Lee, Joong-Youn Lee, Beomseok Nam, Min Ah Kim |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Molecular Electrostatic Potential Evaluation with the Fragment Molecular Orbital Method |
Yuri Alexeev, Dmitri Fedorov |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Benchmark Simulation and Experimental Testbed Studies of AWGR-Based, Multi-Layer Photonic Interconnects for Low- Latency, Energy-Efficient Computing Architectures |
Paolo Grani, Roberto Proietti, Zheng Cao, S. J. Ben Yoo |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Large-Scale and Massively Parallel Phase-Field Simulations of Pattern Formation in Ternary Eutectic Alloys |
Johannes Hötzer, Martin Bauer, Marcus Jainta, Philipp Steinmetz, Marco Berghoff, Florian Schornbaum, Christian Godenschwager, Harald Köstler, Ulrich Rüde, Britta Nestler |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Accurate and Efficient QM/MM Molecular Dynamics on 86,016 Cores of SuperMUC Phase 2 |
Magnus Schwörer, Konstantin Lorenzen, Momme Allalen, Ferdinand Jamitzky, Helmut Satzger, Gerald Mathias |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
A Splitting Approach for the Parallel Solution of Large Linear Systems on GPU Cards |
Ang Li, Radu Serban, Dan Negrut |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
MLTUNE: A Tool-Chain for Automating the Workflow of Machine-Learning Based Performance Tuning |
Biplab Kumar Saha, Saami Rahman, Apan Qasem |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
High Performance Data Structures for Multicore Environments |
Giuliano Laccetti, Marco Lapegna |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
BurstFS: A Distributed Burst Buffer File System for Scientific Applications |
Teng Wang, Kathryn Mohror, Adam Moody, Weikuan Yu, Kento Sato |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Increasing Fabric Utilization with Job-Aware Routing |
Jens Domke |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Exploiting Domain Knowledge to Optimize Mesh Partitioning for Multi-Scale Methods |
Muhammad Hasan Jamal, Milind Kulkarni, Arun Prakash |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Efficient GPU Techniques for Processing Temporally Correlated Satellite Image Data |
Tahsin A. Reza, Dipayan Mukherjee, Tanuj Kr Aasawat, Matei Ripeanu |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
PDE Preconditioner Resilient to Soft and Hard Faults |
Francesco Rizzi, Karla Morris, Kathryn Dahlgren, Khachik Sargsyan, Paul Mycek, Cosmin Safta, Olivier LeMaitre, Omar Knio, Bert Debusschere |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Evaluating DVFS and Concurrency Throttling on IBM's Power8 Architecture |
Wei Wang, Edgar A. Leon |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Analyzing the Performance of a Sparse Matrix Vector Multiply for Extreme Scale Computers |
Amanda Bienz, Jon Calhoun, Luke Olson, Marc Snir, William D. Gropp |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
PPP: Parallel Programming with Pictures |
Annette C. Feng, Wu Feng, Eli Tilevich |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Accelerating the B-Spline Evaluation in Quantum Monte Carlo |
Ye Luo, Anouar Benali, Vitali Morozov |
Level 4 - Lobby |
|
5:15PM - 7:00PM |
Design of a NVRAM Specialized Degree Aware Dynamic Graph Data Structure |
Keita Iwabuchi, Roger A. Pearce, Brian Van Essen, Maya Gokhale, Satoshi Matsuoka |
Level 4 - Lobby |
|