SC is the International Conference for
 High Performnance Computing, Networking, Storage and Analysis

SCHEDULE: NOV 13-19, 2010

PLFS: A Fast Checkpoint Filesystem

SESSION: Research Poster Reception


TIME: 5:15PM - 7:00PM

AUTHOR(S):Milo Polte, John Bent, Garth Gibson, Gary Grider, Ben McClelland, James Nunez, Meghan Wingate

ROOM:Main Lobby

Parallel applications running on high performance computing clusters across thousands of processors rely on checkpointing to protect themselves from failures. The process of writing a checkpoint must be completed quickly so that applications may return to useful work. We present the Parallel Log-structured Filesystem (PLFS), a middleware layer for accelerating application checkpoints. PLFS transparently decouples concurrent checkpoints into a filesystem-efficient access pattern of independent writes to individual log-files. By decoupling writes in this manner PLFS dramatically decreases the time required to perform the checkpoint. Our evaluation demonstrates that PLFS provides 2x-150x speedups for application checkpointing, with greater greater benefits at larger scale. PLFS has been implemented as a FUSE-based filesystem requiring no changes to either application code or the underlying parallel filesystem and is being put into production at Los Alamos National Laboratory. This poster also presents new performance numbers for accessing PLFS directly as a library or through MPI-IO.

Chair/Author Details:

Milo Polte - Carnegie Mellon University

John Bent - Los Alamos National Laboratory

Garth Gibson - Carnegie Mellon University / Panasas Inc.

Gary Grider - Los Alamos National Laboratory

Ben McClelland - Los Alamos National Laboratory

James Nunez - Los Alamos National Laboratory

Meghan Wingate - Los Alamos National Laboratory

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar

   Sponsors    IEEE    ACM