SC is the International Conference for
 High Performnance Computing, Networking, Storage and Analysis

SCHEDULE: NOV 13-19, 2010

DASH: a Recipe for a Flash-based Data Intensive Supercomputer

SESSION: Storage Technologies


TIME: 2:00PM - 2:30PM

SESSION CHAIR: Ethan L. Miller

AUTHOR(S):Jiahua He, Arun Jagatheesan, Sandeep Gupta, Jeffrey Bennett, Allan Snavely


Data intensive computing can be defined as computation involving large datasets and complicated I/O patterns. Data intensive computing is challenging because there is a five-orders-of-magnitude latency gap between main memory DRAM and spinning hard disks. To address this problem we designed and built a prototype data intensive supercomputer named DASH that exploits flash-based Solid State Drive (SSD) technology and also virtually aggregated DRAM to fill the “latency gap”. DASH uses commodity parts including Intel® X25-E flash drives and distributed shared memory (DSM) software from ScaleMP®. We present here an overview of the design of DASH, an analysis of its cost efficiency, then a detailed recipe for how we designed and tuned it for high data-performance, lastly show that running data-intensive scientific applications from graph theory, biology, and astronomy, we achieved as much as two orders-of-magnitude speedup compared to the same applications run on traditional architectures.

Chair/Author Details:

Ethan L. Miller (Chair) - University of California, Santa Cruz

Jiahua He - University of California, San Diego

Arun Jagatheesan - University of California, San Diego

Sandeep Gupta - University of California, San Diego

Jeffrey Bennett - University of California, San Diego

Allan Snavely - University of California, San Diego

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar

The full paper can be found in the ACM Digital Library and IEEE Computer Society

   Sponsors    IEEE    ACM