BEGIN:VCALENDAR
PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN
VERSION:1.0
BEGIN:VEVENT
DTSTART:20101118T193000Z
DTEND:20101118T200000Z
LOCATION:391-392
DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: This paper focuses on the use of clusters of use of GPGPU-based clusters =0Afor hierarchical N-body simulations. Whereas the behavior of these hierarchical methods has been studied in the past on CPU-based architectures, we investigate key performance issues in the context of clusters of GPUs. These include kernel organization and efficiency, the balance between tree traversal and force computation work, grain size selection through the tuning of offloaded work request sizes, and the reduction of sequential bottlenecks. The effect of various application parameters is modeled and experiments are carried out to quantify gains in performance. Our studies are carried out in the context of a production-quality parallel cosmological simulator called ChaNGa. We highlight the  re-engineering of the application to make it more suitable for GPU-based environments. Finally, we present scaling performance results from experiments on the NCSA Lincoln GPU cluster.
SUMMARY:Scaling Hierarchical N-Body Simulations on GPU Clusters
PRIORITY:3
END:VEVENT
END:VCALENDAR
