Search the dblp DataBase
P. Sadayappan :
[Publications ]
[Author Rank by year ]
[Co-authors ]
[Prefers ]
[Cites ]
[Cited by ]
Publications of Author
Qingda Lu , Sriram Krishnamoorthy , P. Sadayappan Combining analytical and empirical approaches in tuning matrix transposition. [Citation Graph (0, 0)][DBLP ] PACT, 2006, pp:233-242 [Conf ] V. Prasad Krothapalli , P. Sadayappan Exploiting Parallelism Through Run-Time Analysis on a Vector Processor (Abstract). [Citation Graph (0, 0)][DBLP ] ACM Conference on Computer Science, 1990, pp:434- [Conf ] Darius Buntinas , Dhabaleswar K. Panda , José Duato , P. Sadayappan Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages. [Citation Graph (0, 0)][DBLP ] CANPC, 2000, pp:115-129 [Conf ] Matthew G. Jacunski , Vijay Moorthy , Peter P. Ware , Manoj Pillai , Dhabaleswar K. Panda , P. Sadayappan Low Latency Message-Passing for Reflective Memory Networks. [Citation Graph (0, 0)][DBLP ] CANPC, 1999, pp:211-224 [Conf ] Vijay Moorthy , Dhabaleswar K. Panda , P. Sadayappan Fast Collective Communication Algorithms for Reflective Memory Network Clusters. [Citation Graph (0, 0)][DBLP ] CANPC, 2000, pp:100-114 [Conf ] Gaurav Khanna 0002 , Nagavijayalakshmi Vydyanathan , Tahsin M. Kurç , Ümit V. Çatalyürek , Pete Wyckoff , Joel H. Saltz , P. Sadayappan A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O. [Citation Graph (0, 0)][DBLP ] CCGRID, 2005, pp:792-799 [Conf ] Mohammad Islam , Pavan Balaji , P. Sadayappan , Dhabaleswar K. Panda Towards provision of quality of service guarantees in job scheduling. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2004, pp:245-254 [Conf ] Sriram Krishnamoorthy , Gerald Baumgartner , Daniel Cociorva , Chi-Chung Lam , P. Sadayappan Efficient Parallel Out-of-Core Matrix Transposition. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2003, pp:300-307 [Conf ] Sudha Srinivasan , Sriram Krishnamoorthy , P. Sadayappan A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2003, pp:92-99 [Conf ] Vijay Subramani , Rajkumar Kettimuthu , Srividya Srinivasan , Jeanette Johnston , P. Sadayappan Selective Buddy Allocation for Scheduling Parallel Jobs on Clusters. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2002, pp:107-0 [Conf ] Gerald Sabin , V. Sahasrabudhe , P. Sadayappan On fairness in distributed job scheduling across multiple sites. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2004, pp:35-44 [Conf ] Debabrata Ghosh , S. K. Nandy , P. Sadayappan , K. Parthasarathy Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving. [Citation Graph (0, 0)][DBLP ] DAC, 1993, pp:303-307 [Conf ] P. Sadayappan , V. Visvanathan Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers. [Citation Graph (0, 0)][DBLP ] DAC, 1989, pp:13-18 [Conf ] V. Ashok , Roger L. Costello , P. Sadayappan Modeling switch-level simulation using data flow. [Citation Graph (0, 0)][DBLP ] DAC, 1985, pp:637-644 [Conf ] Uday Bondhugula , Ananth Devulapalli , James Dinan , Joseph Fernando , Pete Wyckoff , Eric Stahlberg , P. Sadayappan Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths. [Citation Graph (0, 0)][DBLP ] FCCM, 2006, pp:152-164 [Conf ] Nagavijayalakshmi Vydyanathan , Gaurav Khanna 0002 , Tahsin M. Kurç , Ümit V. Çatalyürek , Pete Wyckoff , Joel H. Saltz , P. Sadayappan Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O. [Citation Graph (0, 0)][DBLP ] GRID, 2004, pp:235-242 [Conf ] Mohammad Banikazemi , Jayanthi Sampathkumar , Sandeep Prabhu , Dhabaleswar K. Panda , P. Sadayappan Communication Modeling of Heterogeneous Networks of Workstations for Performance Characterization of Collective Operations. [Citation Graph (0, 0)][DBLP ] Heterogeneous Computing Workshop, 1999, pp:125-0 [Conf ] Daniel Cociorva , J. W. Wilkins , Gerald Baumgartner , P. Sadayappan , J. Ramanujam , Marcel Nooijen , David E. Bernholdt , Robert J. Harrison Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. [Citation Graph (0, 0)][DBLP ] HiPC, 2001, pp:237-248 [Conf ] Praveen Holenarsipur , Vladimir Yarmolenko , José Duato , Dhabaleswar K. Panda , P. Sadayappan Characterization and enhancement of Static Mapping Heuristics for Heterogeneous Systems. [Citation Graph (0, 0)][DBLP ] HiPC, 2000, pp:37-48 [Conf ] Sriram Krishnamoorthy , Gerald Baumgartner , Chi-Chung Lam , Jarek Nieplocha , P. Sadayappan Efficient Layout Transformation for Disk-Based Multidimensional Arrays. [Citation Graph (0, 0)][DBLP ] HiPC, 2004, pp:386-398 [Conf ] Sandhya Krishnan , Sriram Krishnamoorthy , Gerald Baumgartner , Daniel Cociorva , Chi-Chung Lam , P. Sadayappan , J. Ramanujam , David E. Bernholdt , Venkatesh Choppella Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. [Citation Graph (0, 0)][DBLP ] HiPC, 2003, pp:406-417 [Conf ] Chi-Chung Lam , Daniel Cociorva , Gerald Baumgartner , P. Sadayappan Memory-Optimal Evaluation of Expression Trees Involving Large Objects. [Citation Graph (0, 0)][DBLP ] HiPC, 1999, pp:103-110 [Conf ] Srividya Srinivasan , Vijay Subramani , Rajkumar Kettimuthu , Praveen Holenarsipur , P. Sadayappan Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs. [Citation Graph (0, 0)][DBLP ] HiPC, 2002, pp:174-183 [Conf ] Sriram Krishnamoorthy , Jarek Nieplocha , P. Sadayappan Data and Computation Abstractions for Dynamic and Irregular Computations. [Citation Graph (0, 0)][DBLP ] HiPC, 2005, pp:258-269 [Conf ] Vijay Subramani , Rajkumar Kettimuthu , Srividya Srinivasan , P. Sadayappan Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests. [Citation Graph (0, 0)][DBLP ] HPDC, 2002, pp:359-0 [Conf ] Albert Hartono , Qingda Lu , Xiaoyang Gao , Sriram Krishnamoorthy , Marcel Nooijen , Gerald Baumgartner , David E. Bernholdt , Venkatesh Choppella , Russell M. Pitzer , J. Ramanujam , Atanas Rountev , P. Sadayappan Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. [Citation Graph (0, 0)][DBLP ] International Conference on Computational Science (1), 2006, pp:267-275 [Conf ] Albert Hartono , Alexander Sibiryakov , Marcel Nooijen , Gerald Baumgartner , David E. Bernholdt , So Hirata , Chi-Chung Lam , Russell M. Pitzer , J. Ramanujam , P. Sadayappan Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. [Citation Graph (0, 0)][DBLP ] International Conference on Computational Science (1), 2005, pp:155-164 [Conf ] Thiagaraja Gopalsamy , Mukesh Singhal , Dhabaleswar K. Panda , P. Sadayappan A Reliable Multicast Algorithm for Mobile Ad Hoc Networks. [Citation Graph (0, 0)][DBLP ] ICDCS, 2002, pp:563-570 [Conf ] Sandeep K. S. Gupta , Chua-Huang Huang , Rodney W. Johnson , P. Sadayappan Communication-Efficient Implementation of Block Recursive Algorithms on Distributed-Memory Machines. [Citation Graph (0, 0)][DBLP ] ICPADS, 1994, pp:113-119 [Conf ] Mohammad Banikazemi , Jiuxing Liu , Dhabaleswar K. Panda , P. Sadayappan Implementing TreadMarksover VIA on Myrinet and Gigabit Ethernet: Challenges, Design Experience, and Performance Evaluation. [Citation Graph (0, 0)][DBLP ] ICPP, 2001, pp:167-174 [Conf ] Kalluri Eswar , P. Sadayappan , Chua-Huang Huang Compile-Time Characterization of Recurrent Patterns in Irregular Computations. [Citation Graph (0, 0)][DBLP ] ICPP, 1993, pp:148-155 [Conf ] Kalluri Eswar , P. Sadayappan , Chua-Huang Huang , V. Visvanathan Supernodal Sparse Cholesky Facotrization on Distributed-Memory Multiprocessors. [Citation Graph (0, 0)][DBLP ] ICPP, 1993, pp:18-22 [Conf ] Kalluri Eswar , P. Sadayappan , V. Visvanathan Multifrontal Factorization of Sparse Matrices on Shared-Memory Multiprocessors. [Citation Graph (0, 0)][DBLP ] ICPP (3), 1991, pp:159-166 [Conf ] Abhishek Gulati , Dhabaleswar K. Panda , P. Sadayappan , Pete Wyckoff NIC-Based Rate Control for Proportional Bandwidth Allocation in Myrinet Clusters. [Citation Graph (0, 0)][DBLP ] ICPP, 2001, pp:305-312 [Conf ] Sandeep K. S. Gupta , S. D. Kaushik , S. Mufti , Sanjay Sharma , Chua-Huang Huang , P. Sadayappan On Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. [Citation Graph (0, 0)][DBLP ] ICPP, 1993, pp:301-305 [Conf ] S. K. Nandy , Ranjani Narayan , V. Visvanathan , P. Sadayappan , Prashant S. Chauhan A Parallel Progressive Refinement Image Rendering Algorithm on a Scalable Multithreaded VLSI Processor Array. [Citation Graph (0, 0)][DBLP ] ICPP, 1993, pp:94-97 [Conf ] J. Ramanujam , P. Sadayappan Tiling of Iteration Spaces for Multicomputers. [Citation Graph (0, 0)][DBLP ] ICPP (2), 1990, pp:179-186 [Conf ] Gerald Sabin , Garima Kochhar , P. Sadayappan Job Fairness in Non-Preemptive Job Scheduling. [Citation Graph (0, 0)][DBLP ] ICPP, 2004, pp:186-194 [Conf ] P. Sadayappan , Fikret Erçal , Steven Martin Mapping Finite Element Graphs onto Processor Meshes. [Citation Graph (0, 0)][DBLP ] ICPP, 1987, pp:192-195 [Conf ] N. S. Sundar , S. Jayanthi , P. Sadayappan , Miguel Visbal An Incremental Methodology for Parallelizing Legacy Stencil Codes on Message-Passing Computers. [Citation Graph (0, 0)][DBLP ] ICPP, 1999, pp:302-310 [Conf ] Nagavijayalakshmi Vydyanathan , Sriram Krishnamoorthy , Gerald Sabin , Ümit V. Çatalyürek , Tahsin M. Kurç , P. Sadayappan , Joel H. Saltz An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications. [Citation Graph (0, 0)][DBLP ] ICPP, 2006, pp:443-450 [Conf ] Scott Whitman , P. Sadayappan Computer Graphics Rendering on a Shared Memory Multiprocessor. [Citation Graph (0, 0)][DBLP ] ICPP (3), 1991, pp:197-200 [Conf ] Amr Zaky , P. Sadayappan Optimal Static Scheduling of Sequential Loops on Multiprocessors. [Citation Graph (0, 0)][DBLP ] ICPP (3), 1989, pp:130-137 [Conf ] Vipin Chaudhary , P. Sadayappan Message from the Co-Chairs. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2002, pp:547-550 [Conf ] Vipin Chaudhary , P. Sadayappan Message from the Chairs: International Workshop on Compile and Run Time Techniques for Parallel Computing. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2004, pp:497- [Conf ] Vipin Chaudhary , P. Sadayappan Message from the Chairs. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2005, pp:282- [Conf ] Qingda Lu , Jiesheng Wu , Dhabaleswar K. Panda , P. Sadayappan Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2004, pp:538-545 [Conf ] Arindam Paul , Wu-chi Feng , Dhabaleswar K. Panda , P. Sadayappan Balancing Web Server Load for Adaptable Video Distribution. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2000, pp:469-0 [Conf ] P. Sadayappan Message from the Chair. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2000, pp:391-0 [Conf ] P. Sadayappan Message from the Chair. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2002, pp:495-498 [Conf ] Srividya Srinivasan , Rajkumar Kettimuthu , Vijay Subramani , P. Sadayappan Characterization of Backfilling Strategies for Parallel Job Scheduling. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2002, pp:514-522 [Conf ] Vladimir Yarmolenko , José Duato , Dhabaleswar K. Panda , P. Sadayappan Characterization and Enhancement of Dynamic Mapping Heuristics for Heterogeneous Systems. [Citation Graph (0, 0)][DBLP ] ICPP Workshops, 2000, pp:437-0 [Conf ] Daniel Cociorva , J. W. Wilkins , Chi-Chung Lam , Gerald Baumgartner , J. Ramanujam , P. Sadayappan Loop optimization for a class of memory-constrained computations. [Citation Graph (0, 0)][DBLP ] ICS, 2001, pp:103-113 [Conf ] Fikret Erçal , P. Sadayappan One-to-one mapping of process graphs onto a hypercube. [Citation Graph (0, 0)][DBLP ] ICS, 1989, pp:91-98 [Conf ] S. D. Kaushik , Chua-Huang Huang , Rodney W. Johnson , P. Sadayappan An approach to communication-efficient data redistribution. [Citation Graph (0, 0)][DBLP ] International Conference on Supercomputing, 1994, pp:364-373 [Conf ] V. Prasad Krothapalli , P. Sadayappan An approach to synchronization for parallel computing. [Citation Graph (0, 0)][DBLP ] ICS, 1988, pp:573-581 [Conf ] Bharat Kumar , P. Sadayappan , Chua-Huang Huang On sparse matrix reordering for parallel factorization. [Citation Graph (0, 0)][DBLP ] International Conference on Supercomputing, 1994, pp:431-438 [Conf ] P. Sadayappan , Fikret Erçal Cluster-Partitioning Approaches to Mapping Parallel Programs onto a Hypercube. [Citation Graph (0, 0)][DBLP ] ICS, 1987, pp:475-497 [Conf ] P. Sadayappan , V. Visvanathan Parallelization and performance evaluation of circuit simulation on a shared-memory multiprocessor. [Citation Graph (0, 0)][DBLP ] ICS, 1988, pp:254-265 [Conf ] N. S. Sundar , D. N. Jayasimha , Dhabaleswar K. Panda , P. Sadayappan Hybrid Algorithms for Complete Exchange in 2D Meshes. [Citation Graph (0, 0)][DBLP ] International Conference on Supercomputing, 1996, pp:181-188 [Conf ] Alpesh Amin , P. Sadayappan , Murali Gudavalli A Clustered Reduced Communication Element by Element Preconditioned Conjugate Gradient Algorithm for Finite Element Computations. [Citation Graph (0, 0)][DBLP ] IPPS, 1994, pp:509-516 [Conf ] Mohammad Banikazemi , Jiuxing Liu , S. Kutlug , P. Sadayappan , H. Shah , Dhabaleswar K. Panda VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations. [Citation Graph (0, 0)][DBLP ] IPDPS, 2001, pp:24- [Conf ] Gerald Baumgartner , David E. Bernholdt , Daniel Cociorva , Chi-Chung Lam , J. Ramanujam , Robert J. Harrison , Marcel Nooijen , P. Sadayappan A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. [Citation Graph (0, 0)][DBLP ] IPDPS, 2002, pp:- [Conf ] Darius Buntinas , Dhabaleswar K. Panda , P. Sadayappan Fast NIC-Based Barrier over Myrinet/GM. [Citation Graph (0, 0)][DBLP ] IPDPS, 2001, pp:52- [Conf ] Darius Buntinas , Dhabaleswar K. Panda , P. Sadayappan Performance Benefits of NIC-Based Barrier on Myrinet/GM. [Citation Graph (0, 0)][DBLP ] IPDPS, 2001, pp:166- [Conf ] Daniel Cociorva , Xiaoyang Gao , Sandhya Krishnan , Gerald Baumgartner , Chi-Chung Lam , P. Sadayappan , J. Ramanujam Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. [Citation Graph (0, 0)][DBLP ] IPDPS, 2003, pp:37- [Conf ] Matthew G. Jacunski , P. Sadayappan , Dhabaleswar K. Panda All-to-All Broadcast on Switch-Based Clusters of Workstations. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP, 1999, pp:325-329 [Conf ] S. D. Kaushik , Chua-Huang Huang , J. Ramanujam , P. Sadayappan Multi-phase array redistribution: modeling and evaluation. [Citation Graph (0, 0)][DBLP ] IPPS, 1995, pp:441-445 [Conf ] Sandhya Krishnan , Sriram Krishnamoorthy , Gerald Baumgartner , Chi-Chung Lam , J. Ramanujam , P. Sadayappan , Venkatesh Choppella Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. [Citation Graph (0, 0)][DBLP ] IPDPS, 2004, pp:- [Conf ] Bharat Kumar , Chua-Huang Huang , Rodney W. Johnson , P. Sadayappan A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. [Citation Graph (0, 0)][DBLP ] IPPS, 1993, pp:582-588 [Conf ] Vijay Moorthy , Matthew G. Jacunski , Manoj Pillai , Peter P. Ware , Dhabaleswar K. Panda , Thomas W. Page Jr. , P. Sadayappan , V. Nagarajan , Johns Daniel Low-Latency Message Passing on Workstation Clusters using SCRAMNet. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP, 1999, pp:148-152 [Conf ] Swarup Kumar Sahoo , Rajkiran Panuganti , Sriram Krishnamoorthy , P. Sadayappan Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors. [Citation Graph (0, 0)][DBLP ] IPDPS, 2005, pp:- [Conf ] Amit Singhal , Mohammad Banikazemi , P. Sadayappan , Dhabaleswar K. Panda Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations. [Citation Graph (0, 0)][DBLP ] IPDPS, 2001, pp:71- [Conf ] Uday Bondhugula , Ananth Devulapalli , Joseph Fernando , Pete Wyckoff , P. Sadayappan Parallel FPGA-based all-pairs shortest-paths in a directed graph. [Citation Graph (0, 0)][DBLP ] IPDPS, 2006, pp:- [Conf ] Sriram Krishnamoorthy , Ümit V. Çatalyürek , Jarek Nieplocha , Atanas Rountev , P. Sadayappan An extensible global address space framework with decoupled task and data abstractions. [Citation Graph (0, 0)][DBLP ] IPDPS, 2006, pp:- [Conf ] A. Allam , J. Ramanujam , Gerald Baumgartner , P. Sadayappan Memory minimization for tensor contractions using integer linear programming. [Citation Graph (0, 0)][DBLP ] IPDPS, 2006, pp:- [Conf ] Sriram Krishnamoorthy , Ümit V. Çatalyürek , Jarek Nieplocha , P. Sadayappan An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model. [Citation Graph (0, 0)][DBLP ] IPDPS, 2006, pp:- [Conf ] Gerald Sabin , Rajkumar Kettimuthu , Arun Rajan , P. Sadayappan Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement. [Citation Graph (0, 0)][DBLP ] JSSPP, 2003, pp:87-104 [Conf ] Gerald Sabin , P. Sadayappan Unfairness Metrics for Space-Sharing Parallel Job Schedulers. [Citation Graph (0, 0)][DBLP ] JSSPP, 2005, pp:238-256 [Conf ] Srividya Srinivasan , Rajkumar Kettimuthu , Vijay Subramani , P. Sadayappan Selective Reservation Strategies for Backfill Job Scheduling. [Citation Graph (0, 0)][DBLP ] JSSPP, 2002, pp:55-71 [Conf ] Mohammad Islam , Pavan Balaji , P. Sadayappan , Dhabaleswar K. Panda QoPS: A QoS Based Scheme for Parallel Job Scheduling. [Citation Graph (0, 0)][DBLP ] JSSPP, 2003, pp:252-268 [Conf ] Gaurav Khanna 0002 , Ümit V. Çatalyürek , Tahsin M. Kurç , P. Sadayappan , Joel H. Saltz A Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing. [Citation Graph (0, 0)][DBLP ] JSSPP, 2006, pp:141-160 [Conf ] Gerald Sabin , Matthew Lang , P. Sadayappan Moldable Parallel Job Scheduling Using Job Efficiency: An Iterative Approach. [Citation Graph (0, 0)][DBLP ] JSSPP, 2006, pp:94-114 [Conf ] Daniel Cociorva , Gerald Baumgartner , Chi-Chung Lam , P. Sadayappan , J. Ramanujam Memory-Constrained Communication Minimization for a Class of Array Computations. [Citation Graph (0, 0)][DBLP ] LCPC, 2002, pp:1-15 [Conf ] Alina Bibireata , Sandhya Krishnan , Gerald Baumgartner , Daniel Cociorva , Chi-Chung Lam , P. Sadayappan , J. Ramanujam , David E. Bernholdt , Venkatesh Choppella Memory-Constrained Data Locality Optimization for Tensor Contractions. [Citation Graph (0, 0)][DBLP ] LCPC, 2003, pp:93-108 [Conf ] Chua-Huang Huang , P. Sadayappan Communication-Free Hyperplane Partitioning of Nested Loops. [Citation Graph (0, 0)][DBLP ] LCPC, 1991, pp:186-200 [Conf ] S. D. Kaushik , Chua-Huang Huang , Rodney W. Johnson , P. Sadayappan A Methodology for Generating Efficient Disk-Based Algorithms from Tensor Product Formulas. [Citation Graph (0, 0)][DBLP ] LCPC, 1993, pp:358-373 [Conf ] S. D. Kaushik , Chua-Huang Huang , P. Sadayappan Incremental Generation of Index Sets for Array Statement Execution on Distributed-Memory Machines. [Citation Graph (0, 0)][DBLP ] LCPC, 1994, pp:251-265 [Conf ] S. D. Kaushik , Chua-Huang Huang , P. Sadayappan Compiling Array Statements for Efficient Execution on Distributed-Memory Machines: Two-Level Mappings. [Citation Graph (0, 0)][DBLP ] LCPC, 1995, pp:209-223 [Conf ] Konstantin Berlin , Jun Huan , Mary Jacob , Garima Kochhar , Jan Prins , William Pugh , P. Sadayappan , Jaime Spacco , Chau-Wen Tseng Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures. [Citation Graph (0, 0)][DBLP ] LCPC, 2003, pp:194-208 [Conf ] Sandeep K. S. Gupta , Chua-Huang Huang , P. Sadayappan , Rodney W. Johnson On the Synthesis of Parallel Programs from Tensor Product Formulas for Block Recursive Algorithms. [Citation Graph (0, 0)][DBLP ] LCPC, 1992, pp:264-280 [Conf ] Chi-Chung Lam , Daniel Cociorva , Gerald Baumgartner , P. Sadayappan Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals. [Citation Graph (0, 0)][DBLP ] LCPC, 1999, pp:350-364 [Conf ] Chi-Chung Lam , P. Sadayappan , Rephael Wenger Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution. [Citation Graph (0, 0)][DBLP ] LCPC, 1996, pp:315-329 [Conf ] Qingda Lu , Xiaoyang Gao , Sriram Krishnamoorthy , Gerald Baumgartner , J. Ramanujam , P. Sadayappan Empirical Performance-Model Driven Data Layout Optimization. [Citation Graph (0, 0)][DBLP ] LCPC, 2004, pp:72-86 [Conf ] Xiaoyang Gao , Sriram Krishnamoorthy , Swarup Kumar Sahoo , Chi-Chung Lam , Gerald Baumgartner , J. Ramanujam , P. Sadayappan Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. [Citation Graph (0, 0)][DBLP ] LCPC, 2005, pp:215-229 [Conf ] V. Prasad Krothapalli , P. Sadayappan Dynamic Scheduling of DOACROSS Loops for Multiprocessors. [Citation Graph (0, 0)][DBLP ] PARBASE / Architectures, 1990, pp:141-160 [Conf ] Daniel Cociorva , Gerald Baumgartner , Chi-Chung Lam , P. Sadayappan , J. Ramanujam , Marcel Nooijen , David E. Bernholdt , Robert J. Harrison Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. [Citation Graph (0, 0)][DBLP ] PLDI, 2002, pp:177-186 [Conf ] Xiaoyang Gao , Swarup Kumar Sahoo , Chi-Chung Lam , J. Ramanujam , Qingda Lu , Gerald Baumgartner , P. Sadayappan Performance modeling and optimization of parallel out-of-core tensor contractions. [Citation Graph (0, 0)][DBLP ] PPOPP, 2005, pp:266-276 [Conf ] V. Prasad Krothapalli , P. Sadayappan Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences. [Citation Graph (0, 0)][DBLP ] PPOPP, 1991, pp:51-60 [Conf ] Uday Bondhugula , J. Ramanujam , P. Sadayappan Automatic mapping of nested loops to FPGAS. [Citation Graph (0, 0)][DBLP ] PPOPP, 2007, pp:101-111 [Conf ] Chi-Chung Lam , P. Sadayappan , Daniel Cociorva , Mebarek Alouani , John Wilkins Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays. [Citation Graph (0, 0)][DBLP ] PPSC, 1999, pp:- [Conf ] Chi-Chung Lam , P. Sadayappan , Rephael Wenger Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines. [Citation Graph (0, 0)][DBLP ] PPSC, 1997, pp:- [Conf ] Gerald Baumgartner , David E. Bernholdt , Daniel Cociorva , Robert J. Harrison , So Hirata , Chi-Chung Lam , Marcel Nooijen , Russell M. Pitzer , J. Ramanujam , P. Sadayappan A high-level approach to synthesis of high-performance codes for quantum chemistry. [Citation Graph (0, 0)][DBLP ] SC, 2002, pp:1-10 [Conf ] D. L. Dai , Sandeep K. S. Gupta , S. D. Kaushik , J. H. Lu , R. V. Singh , Chua-Huang Huang , P. Sadayappan , Rodney W. Johnson EXTENT: a portable programming environment for designing and implementing high-performance block recursive algorithms. [Citation Graph (0, 0)][DBLP ] SC, 1994, pp:49-58 [Conf ] S. D. Kaushik , Chua-Huang Huang , John R. Johnson , Rodney W. Johnson , P. Sadayappan Efficient transposition algorithms for large matrices. [Citation Graph (0, 0)][DBLP ] SC, 1993, pp:656-665 [Conf ] S. D. Kaushik , Sanjay Sharma , Chua-Huang Huang , Jeremy R. Johnson , Rodney W. Johnson , P. Sadayappan An Algebraic Theory for Modeling Direct Interconnection Networks. [Citation Graph (0, 0)][DBLP ] SC, 1992, pp:488-497 [Conf ] J. Ramanujam , P. Sadayappan Tiling multidimensional iteration spaces for nonshared memory machines. [Citation Graph (0, 0)][DBLP ] SC, 1991, pp:111-120 [Conf ] Swarup Kumar Sahoo , Sriram Krishnamoorthy , Rajkiran Panuganti , P. Sadayappan Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions. [Citation Graph (0, 0)][DBLP ] SC, 2005, pp:13- [Conf ] Sriram Krishnamoorthy , Ümit V. Çatalyürek , Jarek Nieplocha , Atanas Rountev , P. Sadayappan Data management and query - Hypergraph partitioning for automatic memory hierarchy management. [Citation Graph (0, 0)][DBLP ] SC, 2006, pp:98- [Conf ] Jarek Nieplocha , Bruce Palmer , Manojkumar Krishnan , P. Sadayappan M12 - Overview of the global arrays parallel software development toolkit. [Citation Graph (0, 0)][DBLP ] SC, 2006, pp:226- [Conf ] Sandeep K. S. Gupta , S. D. Kaushik , Chua-Huang Huang , John R. Johnson , Rodney W. Johnson , P. Sadayappan On the Automatic Generation of Data Distributions. [Citation Graph (0, 0)][DBLP ] SIGPLAN Workshop, 1992, pp:82- [Conf ] Sanjay Sharma , Chua-Huang Huang , P. Sadayappan On Data Dependence Analysis for Compiling Programs on Distributed-Memory Machines (Extended Abstract). [Citation Graph (0, 0)][DBLP ] SIGPLAN Workshop, 1992, pp:13-16 [Conf ] Himanshu Gupta , P. Sadayappan Communication Efficient Matrix Multiplication on Hypercubes. [Citation Graph (0, 0)][DBLP ] SPAA, 1994, pp:320-329 [Conf ] Sandeep K. S. Gupta , S. D. Kaushik , Chua-Huang Huang , John R. Johnson , Rodney W. Johnson , P. Sadayappan A Methodology for Generating Data Distributions to Optimize Communication. [Citation Graph (0, 0)][DBLP ] SPDP, 1992, pp:436-441 [Conf ] Sailesh K. Rao , P. Sadayappan , Frank K. Hwang , Peter W. Shor The Rectilinear Steiner Arborescence Problem. [Citation Graph (0, 0)][DBLP ] Algorithmica, 1992, v:7, n:2&3, pp:277-288 [Journal ] Sandeep K. S. Gupta , Chua-Huang Huang , P. Sadayappan , Rodney W. Johnson A technique for overlapping computation and communication for block recursive algorithms. [Citation Graph (0, 0)][DBLP ] Concurrency - Practice and Experience, 1998, v:10, n:2, pp:73-90 [Journal ] P. Sadayappan , Fikret Erçal , J. Ramanujam Partitioning Graphs on Message-Passing Machines by Pairwise Mincut. [Citation Graph (0, 0)][DBLP ] Inf. Sci., 1998, v:111, n:1-4, pp:223-237 [Journal ] S. D. Kaushik , Sanjay Sharma , Chua-Huang Huang , John R. Johnson , Rodney W. Johnson , P. Sadayappan An Algebraic Theory for Modeling Directt Interconnection Networks. [Citation Graph (0, 0)][DBLP ] J. Inf. Sci. Eng., 1996, v:12, n:1, pp:25-49 [Journal ] Fikret Erçal , J. Ramanujam , P. Sadayappan Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1990, v:10, n:1, pp:35-44 [Journal ] Sandeep K. S. Gupta , Chua-Huang Huang , P. Sadayappan , Rodney W. Johnson A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1996, v:34, n:2, pp:137-153 [Journal ] Sandeep K. S. Gupta , S. D. Kaushik , Chua-Huang Huang , P. Sadayappan Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1996, v:32, n:2, pp:155-172 [Journal ] Chua-Huang Huang , P. Sadayappan Communication-Free Hyperplane Partitioning of Nested Loops. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1993, v:19, n:2, pp:90-102 [Journal ] S. D. Kaushik , Chua-Huang Huang , P. Sadayappan Efficient Index Set Generation for Compiling HPF Array Statements on Distributed-Memory Machines. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1996, v:38, n:2, pp:237-247 [Journal ] Chi-Chung Lam , Chua-Huang Huang , P. Sadayappan Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1997, v:43, n:1, pp:3-13 [Journal ] J. Ramanujam , P. Sadayappan Tiling Multidimensional Itertion Spaces for Multicomputers. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1992, v:16, n:2, pp:108-120 [Journal ] Sandhya Krishnan , Sriram Krishnamoorthy , Gerald Baumgartner , Chi-Chung Lam , J. Ramanujam , P. Sadayappan , Venkatesh Choppella Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 2006, v:66, n:5, pp:659-673 [Journal ] Himanshu Gupta , P. Sadayappan Communication-Efficient Matrix Multiplication on Hypercubes. [Citation Graph (0, 0)][DBLP ] Parallel Computing, 1996, v:22, n:1, pp:75-99 [Journal ] P. Sadayappan , Fikret Erçal , J. Ramanujam Cluster partitioning approaches to mapping parallel programs onto a hypercube. [Citation Graph (0, 0)][DBLP ] Parallel Computing, 1990, v:13, n:1, pp:1-16 [Journal ] Sandeep K. S. Gupta , Chua-Huang Huang , P. Sadayappan , Rodney W. Johnson Implementing Fast Fourier Transforms on Distributed-Memory Multiprocessors Using Data Redistributions. [Citation Graph (0, 0)][DBLP ] Parallel Processing Letters, 1994, v:4, n:, pp:477-488 [Journal ] Bharat Kumar , Kalluri Eswar , P. Sadayappan , Chua-Huang Huang A Clustering Algorithm for Parallel Sparse Cholesky Factorization. [Citation Graph (0, 0)][DBLP ] Parallel Processing Letters, 1995, v:5, n:, pp:685-696 [Journal ] Chi-Chung Lam , P. Sadayappan , Rephael Wenger On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. [Citation Graph (0, 0)][DBLP ] Parallel Processing Letters, 1997, v:7, n:2, pp:157-168 [Journal ] Christian Engelmann , Stephen L. Scott , David E. Bernholdt , Narasimha R. Gottumukkala , Chokchai Leangsuksun , Jyothish Varma , Chao Wang , Frank Mueller , Aniruddha G. Shet , P. Sadayappan MOLAR: adaptive runtime support for high-end computing operating and runtime systems. [Citation Graph (0, 0)][DBLP ] Operating Systems Review, 2006, v:40, n:2, pp:63-72 [Journal ] Cevdet Aykanat , Füsun Özgüner , Fikret Erçal , P. Sadayappan Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Computers, 1988, v:37, n:12, pp:1554-1568 [Journal ] P. Sadayappan , Fikret Erçal Nearest-Neighbor Mapping of Finite Element Graphs onto Processor Meshes. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Computers, 1987, v:36, n:12, pp:1408-1424 [Journal ] P. Sadayappan , V. Visvanathan Circuit Simulation on Shared-Memory Multiprocessors. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Computers, 1988, v:37, n:12, pp:1634-1642 [Journal ] P. Sadayappan , V. Visvanathan Efficient sparse matrix factorization for circuit simulation on vector supercomputers. [Citation Graph (0, 0)][DBLP ] IEEE Trans. on CAD of Integrated Circuits and Systems, 1989, v:8, n:12, pp:1276-1285 [Journal ] Sriram Krishnamoorthy , Gerald Baumgartner , Chi-Chung Lam , Jarek Nieplocha , P. Sadayappan Layout transformation support for the disk resident arrays framework. [Citation Graph (0, 0)][DBLP ] The Journal of Supercomputing, 2006, v:36, n:2, pp:153-170 [Journal ] V. Prasad Krothapalli , P. Sadayappan Removal of Redundant Dependences in DOACROSS Loops with Constant Dependences. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Parallel Distrib. Syst., 1991, v:2, n:3, pp:281-289 [Journal ] J. Ramanujam , P. Sadayappan Compile-Time Techniques for Data Distribution in Distributed Memory Machines. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Parallel Distrib. Syst., 1991, v:2, n:4, pp:472-482 [Journal ] Nagavijayalakshmi Vydyanathan , Sriram Krishnamoorthy , Gerald Sabin , Ümit V. Çatalyürek , Tahsin M. Kurç , P. Sadayappan , Joel H. Saltz Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2006, pp:- [Conf ] Aniruddha G. Shet , P. Sadayappan , David E. Bernholdt , Jarek Nieplocha , Vinod Tipparaju A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2006, pp:- [Conf ] Nagavijayalakshmi Vydyanathan , Ümit V. Çatalyürek , Tahsin M. Kurç , P. Sadayappan , Joel H. Saltz Toward Optimizing Latency Under Throughput Constraints for Application Workflows on Clusters. [Citation Graph (0, 0)][DBLP ] Euro-Par, 2007, pp:173-183 [Conf ] Gaurav Khanna 0002 , Ümit V. Çatalyürek , Tahsin M. Kurç , P. Sadayappan , Joel H. Saltz Scheduling File Transfers for Data-Intensive Jobs on Heterogeneous Clusters. [Citation Graph (0, 0)][DBLP ] Euro-Par, 2007, pp:214-223 [Conf ] Mohammad Islam , Pavan Balaji , Gerald Sabin , P. Sadayappan Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. [Citation Graph (0, 0)][DBLP ] ICPP, 2007, pp:42- [Conf ] Sriram Krishnamoorthy , Ümit V. Çatalyürek , Jarek Nieplocha , Atanas Rountev , P. Sadayappan A global address space framework for locality aware scheduling of block-sparse computations. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] James Dinan , Stephen Olivier , Gerald Sabin , Jan Prins , P. Sadayappan , Chau-Wen Tseng Dynamic Load Balancing of Unbalanced Computations Using Message Passing. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Stephen Olivier , Jun Huan , Jinze Liu , Jan Prins , James Dinan , P. Sadayappan , Chau-Wen Tseng UTS: An Unbalanced Tree Search Benchmark. [Citation Graph (0, 0)][DBLP ] LCPC, 2006, pp:235-250 [Conf ] Sriram Krishnamoorthy , Muthu Baskaran , Uday Bondhugula , J. Ramanujam , Atanas Rountev , P. Sadayappan Effective automatic parallelization of stencil computations. [Citation Graph (0, 0)][DBLP ] PLDI, 2007, pp:235-244 [Conf ] Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors. [Citation Graph (, )][DBLP ] Soft-OLP: Improving Hardware Cache Performance through Software-Controlled Object-Level Partitioning. [Citation Graph (, )][DBLP ] Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. [Citation Graph (, )][DBLP ] Automatic C-to-CUDA Code Generation for Affine Programs. [Citation Graph (, )][DBLP ] Selective Recovery from Failures in a Task Parallel Programming Model. [Citation Graph (, )][DBLP ] Hybrid parallel programming with MPI and unified parallel C. [Citation Graph (, )][DBLP ] Parameterized tiling revisited. [Citation Graph (, )][DBLP ] An OSD-based approach to managing directory operations in parallel file systems. [Citation Graph (, )][DBLP ] Are nonblocking networks really needed for high-end-computing workloads? [Citation Graph (, )][DBLP ] Non-collective parallel I/O for global address space programming models. [Citation Graph (, )][DBLP ] Scalable I/O forwarding framework for high-performance computing systems. [Citation Graph (, )][DBLP ] Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. [Citation Graph (, )][DBLP ] Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O. [Citation Graph (, )][DBLP ] Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP. [Citation Graph (, )][DBLP ] An integrated framework for performance-based optimization of scientific workflows. [Citation Graph (, )][DBLP ] Assessment and enhancement of meta-schedulers for multi-site job sharing. [Citation Graph (, )][DBLP ] Integrated Data and Task Management for Scientific Applications. [Citation Graph (, )][DBLP ] A Duplication Based Algorithm for Optimizing Latency Under Throughput Constraints for Streaming Workflows. [Citation Graph (, )][DBLP ] Scioto: A Framework for Global-View Task Parallelism. [Citation Graph (, )][DBLP ] Parametric multi-level tiling of imperfectly nested loops. [Citation Graph (, )][DBLP ] A compiler framework for optimization of affine loop nests for gpgpus. [Citation Graph (, )][DBLP ] A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP. [Citation Graph (, )][DBLP ] Towards effective automatic parallelization for multicore systems. [Citation Graph (, )][DBLP ] A practical automatic polyhedral parallelizer and locality optimizer. [Citation Graph (, )][DBLP ] Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. [Citation Graph (, )][DBLP ] Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. [Citation Graph (, )][DBLP ] Integrating parallel file systems with object-based storage devices. [Citation Graph (, )][DBLP ] Global trees: a framework for linked data structures on distributed memory parallel systems. [Citation Graph (, )][DBLP ] Scalable work stealing. [Citation Graph (, )][DBLP ] Using overlays for efficient data transfer over shared wide-area networks. [Citation Graph (, )][DBLP ] Enabling software management for multicore caches with a lightweight hardware support. [Citation Graph (, )][DBLP ] A framework for characterizing overlap of communication and computation in parallel applications. [Citation Graph (, )][DBLP ] Efficient search-space pruning for integrated fusion and tiling transformations. [Citation Graph (, )][DBLP ] Search in 0.229secs, Finished in 0.235secs