- Home
- Register
- Attend
- Conference Program
- SC15 Schedule
- Technical Program
- Awards
- Students@SC
- Research with SCinet
- HPC Impact Showcase
- HPC Matters Plenary
- Keynote Address
- Support SC
- SC15 Archive
- Exhibits
- Media
- SCinet
- HPC Matters
SCHEDULE: NOV 15-20, 2015
When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.
A Parallel Connectivity Algorithm for de Bruijn Graphs in Metagenomic Applications
SESSION: Applications: Biophysics and Genomics
EVENT TYPE: Papers
TIME: 2:00PM - 2:30PM
SESSION CHAIR(S): Amanda Randles
AUTHOR(S):Patrick Flick, Chirag Jain, Tony Pan, Srinivas Aluru
ROOM:18AB
ABSTRACT:
Dramatic advances in DNA sequencing technology have made it possible to study microbial environments by direct sequencing of environmental DNA samples. Yet, due to huge volume and high data complexity, current de novo assemblers cannot handle large metagenomic datasets or fail to perform assembly with acceptable quality. This paper presents the first parallel solution for decomposing the metagenomic assembly problem without compromising post-assembly quality. We transform this problem into that of finding weakly connected components in the de Bruijn graph. We propose a novel distributed memory algorithm to identify the connected subgraphs, and present strategies to minimize the communication volume. We demonstrate scalability of our algorithm on soil metagenome dataset with 1.8 billion reads. Our approach achieves a runtime of 22 minutes using 1280 Intel Xeon cores for 421 GB uncompressed FASTQ dataset. Moreover, our solution is generalizable to finding connected components in arbitrary undirected graphs.
Chair/Author Details:
Amanda Randles (Chair) - Duke University|
Patrick Flick - Georgia Institute of Technology
Chirag Jain - Georgia Institute of Technology
Tony Pan - Georgia Institute of Technology
Srinivas Aluru - Georgia Institute of Technology
Click here to download .ics calendar file
Click here to download .vcs calendar file
Click here to add event to your Google Calendar