Conferences in DBLP
Entering the petaflop era: the architecture and performance of Roadrunner. [Citation Graph (, )][DBLP ] High performance discrete Fourier transforms on graphics processors. [Citation Graph (, )][DBLP ] Dynamically adapting file domain partitioning methods for collective I/O based on underlying parallel file system locking protocols. [Citation Graph (, )][DBLP ] Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures. [Citation Graph (, )][DBLP ] Bandwidth intensive 3-D FFT kernel for GPUs using CUDA. [Citation Graph (, )][DBLP ] Using server-to-server communication in parallel file systems to simplify consistency and improve performance. [Citation Graph (, )][DBLP ] Scientific application-based performance comparison of SGI Altix 4700, IBM POWER5+, and SGI ICE 8200 supercomputers. [Citation Graph (, )][DBLP ] Adapting a message-driven parallel application to GPU-accelerated clusters. [Citation Graph (, )][DBLP ] Scaling parallel I/O performance through I/O delegate and caching system. [Citation Graph (, )][DBLP ] Efficient management of data center resources for massively multiplayer online games. [Citation Graph (, )][DBLP ] Performance optimization of TCP/IP over 10 gigabit ethernet by precise instrumentation. [Citation Graph (, )][DBLP ] A multi-level parallel simulation approach to electron transport in nano-scale transistors. [Citation Graph (, )][DBLP ] Feedback-controlled resource sharing for predictable eScience. [Citation Graph (, )][DBLP ] Wide-area performance profiling of 10GigE and InfiniBand technologies. [Citation Graph (, )][DBLP ] Accelerating configuration interaction calculations for nuclear structure. [Citation Graph (, )][DBLP ] Efficient auction-based grid reservations using dynamic programming. [Citation Graph (, )][DBLP ] Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation. [Citation Graph (, )][DBLP ] Dendro: parallel algorithms for multigrid and AMR methods on 2: 1 balanced octrees. [Citation Graph (, )][DBLP ] Characterizing application sensitivity to OS interference using kernel-level noise injection. [Citation Graph (, )][DBLP ] Performance prediction of large-scale parallell system and application using macro-level simulation. [Citation Graph (, )][DBLP ] A novel domain oriented approach for scientific grid workflow composition. [Citation Graph (, )][DBLP ] Toward loosely coupled programming on petascale systems. [Citation Graph (, )][DBLP ] Early evaluation of IBM BlueGene/P. [Citation Graph (, )][DBLP ] Nimrod/K: towards massively parallel dynamic grid workflows. [Citation Graph (, )][DBLP ] SMARTMAP: operating system support for efficient data sharing among processes on a multi-core processor. [Citation Graph (, )][DBLP ] Lessons learned at 208K: towards debugging millions of cores. [Citation Graph (, )][DBLP ] Applying double auctions for scheduling of workflows on the Grid. [Citation Graph (, )][DBLP ] A novel migration-based NUCA design for chip multiprocessors. [Citation Graph (, )][DBLP ] Communication avoiding Gaussian elimination. [Citation Graph (, )][DBLP ] Extending CC-NUMA systems to support write update optimizations. [Citation Graph (, )][DBLP ] Benchmarking GPUs to tune dense linear algebra. [Citation Graph (, )][DBLP ] High-radix crossbar switches enabled by proximity communication. [Citation Graph (, )][DBLP ] Massively parallel genomic sequence search on the Blue Gene/P architecture. [Citation Graph (, )][DBLP ] The role of MPI in development time: a case study. [Citation Graph (, )][DBLP ] An efficient parallel approach for identifying protein families in large-scale metagenomic data sets. [Citation Graph (, )][DBLP ] An adaptive cut-off for task parallelism. [Citation Graph (, )][DBLP ] EpiSimdemics: an efficient algorithm for simulating the spread of infectious disease over large realistic social networks. [Citation Graph (, )][DBLP ] Programming the Intel 80-core network-on-a-chip terascale processor. [Citation Graph (, )][DBLP ] PAM: a novel performance/power aware meta-scheduler for multi-core systems. [Citation Graph (, )][DBLP ] Hiding I/O latency with pre-execution prefetching for parallel applications. [Citation Graph (, )][DBLP ] A dynamic scheduler for balancing HPC applications. [Citation Graph (, )][DBLP ] Characterizing and predicting the I/O performance of HPC applications using a parameterized synthetic benchmark. [Citation Graph (, )][DBLP ] Proactive process-level live migration in HPC environments. [Citation Graph (, )][DBLP ] Parallel I/O prefetching using MPI file caching and I/O signatures. [Citation Graph (, )][DBLP ] BitDew: a programmable environment for large-scale data management and distribution. [Citation Graph (, )][DBLP ] Scalable load-balance measurement for SPMD codes. [Citation Graph (, )][DBLP ] Using overlays for efficient data transfer over shared wide-area networks. [Citation Graph (, )][DBLP ] Massively parallel volume rendering using 2-3 swap image compositing. [Citation Graph (, )][DBLP ] Capturing performance knowledge for automated analysis. [Citation Graph (, )][DBLP ] The cost of doing science on the cloud: the Montage example. [Citation Graph (, )][DBLP ] High performance multivariate visual data exploration for extremely large data. [Citation Graph (, )][DBLP ] Analysis of application heartbeats: learning structural and temporal features in time series data for identification of performance problems. [Citation Graph (, )][DBLP ] Server-storage virtualization: integration and load balancing in data centers. [Citation Graph (, )][DBLP ] Materialized community ground models for large-scale earthquake simulation. [Citation Graph (, )][DBLP ] Positivity, posynomials and tile size selection. [Citation Graph (, )][DBLP ] A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories. [Citation Graph (, )][DBLP ] Global trees: a framework for linked data structures on distributed memory parallel systems. [Citation Graph (, )][DBLP ] Parallel exact inference on the cell broadband engine processor. [Citation Graph (, )][DBLP ] Prefetch throttling and data pinning for improving performance of shared caches. [Citation Graph (, )][DBLP ] High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors. [Citation Graph (, )][DBLP ] New algorithm to enable 400+ TFlop/s sustained performance in simulations of disorder effects in high-T c superconductors. [Citation Graph (, )][DBLP ] Scalable adaptive mantle convection simulation on petascale supercomputers. [Citation Graph (, )][DBLP ] 0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on Roadrunner. [Citation Graph (, )][DBLP ] 369 Tflop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer. [Citation Graph (, )][DBLP ] Linearly scaling 3D fragment method for large-scale electronic structure calculations. [Citation Graph (, )][DBLP ]