Search the dblp DataBase
Guang R. Gao :
[Publications ]
[Author Rank by year ]
[Co-authors ]
[Prefers ]
[Cites ]
[Cited by ]
Publications of Author
Rad Silvera , Jian Wang , Ramaswamy Govindarajan , Guang R. Gao A Register Pressure Sensitive Instruction Scheduler for Dynamic Issue Processors. [Citation Graph (0, 0)][DBLP ] IEEE PACT, 1997, pp:78-89 [Conf ] Xinan Tang , Rakesh Ghiya , Laurie J. Hendren , Guang R. Gao Heap Analysis and Optimizations for Threaded Programs. [Citation Graph (0, 0)][DBLP ] IEEE PACT, 1997, pp:14-25 [Conf ] Guang R. Gao The Era of Multi-core Chips -A Fresh Look on Software Challenges. [Citation Graph (0, 0)][DBLP ] Asia-Pacific Computer Systems Architecture Conference, 2006, pp:1- [Conf ] Ramaswamy Govindarajan , Erik R. Altman , Guang R. Gao A Theory for Software-Hardware Co-Scheduling for ASIPs and Embedded Processors. [Citation Graph (0, 0)][DBLP ] ASAP, 2000, pp:329-338 [Conf ] Vincent Van Dongen , Christophe Bonello , Guang R. Gao Data parallelism with high performance C. [Citation Graph (0, 0)][DBLP ] CASCON, 1994, pp:69- [Conf ] Guang R. Gao , Yue-Bong Wong , Qi Ning A timed Petri-net model for fine-grain loop scheduling. [Citation Graph (0, 0)][DBLP ] CASCON, 1991, pp:395-415 [Conf ] Gilles Hurteau , Vincent Van Dongen , Guang R. Gao EPPP - an integrated environment for portable parallel programming. [Citation Graph (0, 0)][DBLP ] CASCON, 1994, pp:31- [Conf ] Ivan Kalas , Eshrat Arjomandi , Guang R. Gao , William G. O'Farrell FTL: a multithreaded environment for parallel computation. [Citation Graph (0, 0)][DBLP ] CASCON, 1994, pp:33- [Conf ] Qi Ning , Vincent Van Dongen , Guang R. Gao Automatic decomposition in EPPP compiler. [Citation Graph (0, 0)][DBLP ] CASCON, 1994, pp:49- [Conf ] Xinmin Tian , Shashank S. Nemawarkar , Guang R. Gao , Herbert H. J. Hum Data locality sensitivity of multithreaded computations on a distributed-memory multiprocessor. [Citation Graph (0, 0)][DBLP ] CASCON, 1996, pp:37- [Conf ] Hongbo Yang , Guang R. Gao , Clement Leung On achieving balanced power consumption in software pipelined loops. [Citation Graph (0, 0)][DBLP ] CASES, 2002, pp:210-217 [Conf ] Laurie J. Hendren , Guang R. Gao , Erik R. Altman , Chandrika Mukerji A Register Allocation Framework Based on Hierarchical Cyclic Interval Graphs. [Citation Graph (0, 0)][DBLP ] CC, 1992, pp:176-191 [Conf ] Sylvain Lelait , Guang R. Gao , Christine Eisenbeis A New Fast Algorithm for Optimal Register Allocation in Modulo Scheduled Loops. [Citation Graph (0, 0)][DBLP ] CC, 1998, pp:204-218 [Conf ] Artour Stoutchinin , José Nelson Amaral , Guang R. Gao , James C. Dehnert , Suneel Jain , Alban Douillet Speculative Prefetching of Induction Pointers. [Citation Graph (0, 0)][DBLP ] CC, 2001, pp:289-303 [Conf ] Jian Wang , Guang R. Gao Pipelining-Dovetailing: A Transformation to Enhance Software Pipelining for Nested Loops. [Citation Graph (0, 0)][DBLP ] CC, 1996, pp:1-17 [Conf ] Chihong Zhang , Ramaswamy Govindarajan , Sean Ryan , Guang R. Gao Efficient State-Diagram Construction Methods for Software Pipelining. [Citation Graph (0, 0)][DBLP ] CC, 1999, pp:153-167 [Conf ] Juan del Cuvillo , Weirong Zhu , Guang R. Gao Landing openMP on cyclops-64: an efficient mapping of openMP to a many-core system-on-a-chip. [Citation Graph (0, 0)][DBLP ] Conf. Computing Frontiers, 2006, pp:41-50 [Conf ] Hongbo Rong , Alban Douillet , Ramaswamy Govindarajan , Guang R. Gao Code Generation for Single-Dimension Software Pipelining of Multi-Dimensional Loops. [Citation Graph (0, 0)][DBLP ] CGO, 2004, pp:175-188 [Conf ] Hongbo Rong , Zhizhong Tang , Ramaswamy Govindarajan , Alban Douillet , Guang R. Gao Single-Dimension Software Pipelining for Multi-Dimensional Loops. [Citation Graph (0, 0)][DBLP ] CGO, 2004, pp:163-174 [Conf ] Fei Chen , Kevin B. Theobald , Guang R. Gao Implementing parallel conjugate gradient on the EARTH multithreaded architecture. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2004, pp:459-469 [Conf ] Weirong Zhu , Yanwei Niu , Jizhu Lu , Chuan Shen , Guang R. Gao A Cluster-Based Solution for High Performance Hmmpfam Using EARTH Execution Model. [Citation Graph (0, 0)][DBLP ] CLUSTER, 2003, pp:30-37 [Conf ] Vincent Van Dongen , Guang R. Gao , Qi Ning A Polynomial Time Method for Optimal Software Pipelining. [Citation Graph (0, 0)][DBLP ] CONPAR, 1992, pp:613-624 [Conf ] Guang R. Gao , Herbert H. J. Hum , Yue-Bong Wong An Efficient Scheme for Fine-Grain Software Pipelining. [Citation Graph (0, 0)][DBLP ] CONPAR, 1990, pp:709-720 [Conf ] Ramaswamy Govindarajan , Erik R. Altman , Guang R. Gao A Framework for Resource-Constrained Rate-Optimal Software Pipelining. [Citation Graph (0, 0)][DBLP ] CONPAR, 1994, pp:640-651 [Conf ] Rishi Khan , Yujing Zeng , Javier Garcia-Frias , Guang R. Gao A Bayesian Modeling Framework for Genetic Regulation. [Citation Graph (0, 0)][DBLP ] CSB, 2002, pp:330-332 [Conf ] Yujing Zeng , Jianshan Tang , Javier Garcia-Frias , Guang R. Gao An Adaptive Meta-Clustering Approach: Combining the Information from Different Clustering Results. [Citation Graph (0, 0)][DBLP ] CSB, 2002, pp:276-0 [Conf ] Weirong Zhu , Yanwei Niu , Jizhu Lu , Guang R. Gao Implementing Parallel Hmm-pfam on the EARTH Multithreaded Architecture. [Citation Graph (0, 0)][DBLP ] CSB, 2003, pp:549-550 [Conf ] Haiping Wu , Ziang Hu , Joseph Manzano , Yingping Zhang , Guang R. Gao Identifying Multiply-Add Operations in Kylin Compiler. [Citation Graph (0, 0)][DBLP ] ESA, 2005, pp:81-87 [Conf ] Erik R. Altman , Guang R. Gao Optimal Software Pipelining Through Enumeration of Schedules. [Citation Graph (0, 0)][DBLP ] Euro-Par, Vol. II, 1996, pp:833-840 [Conf ] Eduard Ayguadé , Fredrik Dahlgren , Christine Eisenbeis , Roger Espasa , Guang R. Gao , Henk L. Muller , Rizos Sakellariou , André Seznec Topic 08+13: Instruction-Level Parallelism and Computer Architecture. [Citation Graph (0, 0)][DBLP ] Euro-Par, 2001, pp:385- [Conf ] Alban Douillet , Hongbo Rong , Guang R. Gao Multi-dimensional Kernel Generation for Loop Nest Software Pipelining. [Citation Graph (0, 0)][DBLP ] Euro-Par, 2006, pp:311-322 [Conf ] Ziang Hu , Juan del Cuvillo , Weirong Zhu , Guang R. Gao Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences. [Citation Graph (0, 0)][DBLP ] Euro-Par, 2006, pp:134-144 [Conf ] Olivier Maquelin , Herbert H. J. Hum , Guang R. Gao Costs and Benefits of Multithreading with Off-the-Shelf RISC Processors. [Citation Graph (0, 0)][DBLP ] Euro-Par, 1995, pp:117-128 [Conf ] Arthur Stoutchinin , Guang R. Gao If-Conversion in SSA Form. [Citation Graph (0, 0)][DBLP ] Euro-Par, 2004, pp:336-345 [Conf ] Kevin B. Theobald , Rishi Kumar , Gagan Agrawal , Gerd Heber , Ruppa K. Thulasiram , Guang R. Gao Developing a Communication Intensive Application on the EARTH Multithreaded Architecture (Distinguished Paper). [Citation Graph (0, 0)][DBLP ] Euro-Par, 2000, pp:625-637 [Conf ] Guang R. Gao , René Tio , Herbert H. J. Hum Design of an Efficient Dataflow Architecture without Data Flow. [Citation Graph (0, 0)][DBLP ] FGCS, 1988, pp:861-868 [Conf ] Guoning Liao , Erik R. Altman , Vinod K. Agarwal , Guang R. Gao A Comparative Study of Multiprocessor List Scheduling Heuristics. [Citation Graph (0, 0)][DBLP ] HICSS (1), 1994, pp:68-77 [Conf ] Qi Ning , Vincent Van Dongen , Guang R. Gao Automatic data and computation decomposition for distributed memory machines. [Citation Graph (0, 0)][DBLP ] HICSS (2), 1995, pp:103-112 [Conf ] Ramaswamy Govindarajan , Erik R. Altman , Guang R. Gao Co-Scheduling Hardware and Software Pipelines. [Citation Graph (0, 0)][DBLP ] HPCA, 1996, pp:52-61 [Conf ] Kevin B. Theobald , Herbert H. J. Hum , Guang R. Gao A Design Frame for Hybrid Access Caches. [Citation Graph (0, 0)][DBLP ] HPCA, 1995, pp:144-153 [Conf ] Dean M. Tullsen , Guang R. Gao Multithreaded Execution Architecture and Compilation. [Citation Graph (0, 0)][DBLP ] HPCA, 1999, pp:321- [Conf ] Darren Erik Vengroff , Guang R. Gao Partial Sampling with Reverse State Reconstruction: A New Technique for Branch Predictor Performance Estimation. [Citation Graph (0, 0)][DBLP ] HPCA, 1998, pp:342-351 [Conf ] Juan del Cuvillo , Weirong Zhu , Ziang Hu , Guang R. Gao Toward a Software Infrastructure for the Cyclops-64 Cellular Architecture. [Citation Graph (0, 0)][DBLP ] HPCS, 2006, pp:9- [Conf ] Maria-Dana Tarlescu , Kevin B. Theobald , Guang R. Gao Elastic History Buffer: A Low-Cost Method to Improve Branch Prediction Accuracy. [Citation Graph (0, 0)][DBLP ] ICCD, 1997, pp:82-87 [Conf ] Hongbo Yang , Ramaswamy Govindarajan , Guang R. Gao , Kevin B. Theobald Power-Performance Trade-Offs for Energy-Efficient Architectures: A Quantitative Study. [Citation Graph (0, 0)][DBLP ] ICCD, 2002, pp:174-179 [Conf ] Shashank S. Nemawarkar , Ramaswamy Govindarajan , Guang R. Gao , Vinod K. Agarwal Performance Evaluation of Latency Tolerant Architectures. [Citation Graph (0, 0)][DBLP ] ICCI, 1992, pp:183-186 [Conf ] Laurie J. Hendren , Guang R. Gao Designing programming languages for analyzability: a fresh look at pointer data structures. [Citation Graph (0, 0)][DBLP ] ICCL, 1992, pp:242-251 [Conf ] Erik R. Altman , Vinod K. Agarwal , Guang R. Gao A Novel Methodology Using Genetic Algorithms for the Design of Caches and Cache Replacement Policy. [Citation Graph (0, 0)][DBLP ] ICGA, 1993, pp:392-399 [Conf ] Xinan Tang , Guang R. Gao Automatically Partitioning Threads Based on Remote Paths. [Citation Graph (0, 0)][DBLP ] ICPADS, 1998, pp:632-639 [Conf ] Jack B. Dennis , Guang R. Gao Maximum Pipelining of Array Operations on Static Data Flow Machine. [Citation Graph (0, 0)][DBLP ] ICPP, 1983, pp:331-334 [Conf ] Guang R. Gao A Pipelined Solution Method of Tridiagonal Linear Equation Systems. [Citation Graph (0, 0)][DBLP ] ICPP, 1986, pp:84-91 [Conf ] Guang R. Gao , Vivek Sarkar Location Consistency: Stepping Beyond the Memory Coherence Barrier. [Citation Graph (0, 0)][DBLP ] ICPP (2), 1995, pp:73-76 [Conf ] Jean-Marc Monti , Guang R. Gao Efficient Interprocessor Synchronization/Communication on a Dataflow Multiprocessor Architecture. [Citation Graph (0, 0)][DBLP ] ICPP (1), 1992, pp:220-223 [Conf ] Nasser Elmasri , Herbert H. J. Hum , Guang R. Gao The Threaded Communication Library: Preliminary Experiences on a Multiprocessor with Dual-Processor Nodes. [Citation Graph (0, 0)][DBLP ] International Conference on Supercomputing, 1995, pp:195-199 [Conf ] Guang R. Gao , Herbert H. J. Hum , Yue-Bong Wong Towards efficient fine-grain software pipelining. [Citation Graph (0, 0)][DBLP ] ICS, 1990, pp:369-379 [Conf ] Vivek Sarkar , Guang R. Gao Optimization of array accesses by collective loop transformations. [Citation Graph (0, 0)][DBLP ] ICS, 1991, pp:194-205 [Conf ] Kevin B. Theobald , Guang R. Gao , Laurie J. Hendren Speculative Execution and Branch Prediction on Parallel Machines. [Citation Graph (0, 0)][DBLP ] International Conference on Supercomputing, 1993, pp:77-86 [Conf ] Liu Yang , Sun Chan , Guang R. Gao , Roy Ju , Guei-Yuan Lueh , Zhaoqing Zhang Inter-procedural stacked register allocation for itanium® like architecture. [Citation Graph (0, 0)][DBLP ] ICS, 2003, pp:215-225 [Conf ] Gary M. Zoppetti , Gagan Agrawal , Lori L. Pollock , José Nelson Amaral , Xinan Tang , Guang R. Gao Automatic compiler techniques for thread coarsening for multithreaded architectures. [Citation Graph (0, 0)][DBLP ] ICS, 2000, pp:306-315 [Conf ] Robel Y. Kahsay , Li Liao , Guang R. Gao An Improved Hidden Markov Model for Transmembrane Topology Prediction. [Citation Graph (0, 0)][DBLP ] ICTAI, 2004, pp:634-639 [Conf ] Praveen R. Thiagarajan , Guang R. Gao Visualizing Biosequence Data Using Texture Mapping. [Citation Graph (0, 0)][DBLP ] INFOVIS, 2002, pp:103-109 [Conf ] Guang R. Gao Sustained Petaflop and Beyond: Can Parallel Computing Systems Meet The Challenges? [Citation Graph (0, 0)][DBLP ] IPDPS, 2005, pp:- [Conf ] Guang R. Gao From EARTH to HTMT: An Evolution of a Multiheaded Architecture Model (Abstract). [Citation Graph (0, 0)][DBLP ] IPPS/SPDP Workshops, 1999, pp:1025- [Conf ] Bruce Carter , Chuin-Shan Chen , L. Paul Chew , Nikos Chrisochoides , Guang R. Gao , Gerd Heber , Anthony R. Ingraffea , Roland Krause , Chris Myers , Démian Nave , Keshav Pingali , Paul Stodghill , Stephen A. Vavasis , Paul A. Wawrzynek Parallel FEM Simulation of Crack Propagation - Challenges, Status, and Perspectives. [Citation Graph (0, 0)][DBLP ] IPDPS Workshops, 2000, pp:443-449 [Conf ] Juan del Cuvillo , Weirong Zhu , Ziang Hu , Guang R. Gao TiNy Threads: A Thread Virtual Machine for the Cyclops64 Cellular Architecture. [Citation Graph (0, 0)][DBLP ] IPDPS, 2005, pp:- [Conf ] Guang R. Gao , Kevin B. Theobald , Ramaswamy Govindarajan , Clement Leung , Ziang Hu , Haiping Wu , Jizhu Lu , Juan del Cuvillo , Adeline Jacquet , Vincent Janot , Thomas L. Sterling Programming Models and System Software for Future High-End Computing Systems: Work-in-Progress. [Citation Graph (0, 0)][DBLP ] IPDPS, 2003, pp:206- [Conf ] Guang R. Gao , Kevin B. Theobald , Ziang Hu , Haiping Wu , Jizhu Lu , Keshav Pingali , Paul Stodghill , Thomas L. Sterling , Rick Stevens , Mark Hereld Next Generation System Software for Future High-End Computing Systems. [Citation Graph (0, 0)][DBLP ] IPDPS, 2002, pp:- [Conf ] Ramaswamy Govindarajan , N. S. S. Narasimha Rao , Erik R. Altman , Guang R. Gao An Enhanced Co-Scheduling Method Using Reduced MS-State Diagrams. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP, 1998, pp:168-175 [Conf ] Ramaswamy Govindarajan , Hongbo Yang , Chihong Zhang , José Nelson Amaral , Guang R. Gao Minimum Register Instruction Sequence Problem: Revisiting Optimal Code Generation for DAGs. [Citation Graph (0, 0)][DBLP ] IPDPS, 2001, pp:26- [Conf ] Gerd Heber , Rupak Biswas , Guang R. Gao Self-Avoiding Walks over Adaptive Unstructured Grids. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP Workshops, 1999, pp:968-977 [Conf ] Gerd Heber , Guang R. Gao , Rupak Biswas A New Approach to Parallel Dynamic Partitioning for Adaptive Unstructured Meshes. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP, 1999, pp:360-364 [Conf ] Herbert H. J. Hum , Kevin B. Theobald , Guang R. Gao Building Multithreaded Architectures with Off-the-Shelf Microprocessors. [Citation Graph (0, 0)][DBLP ] IPPS, 1994, pp:288-294 [Conf ] Adeline Jacquet , Vincent Janot , Clement Leung , Guang R. Gao , Ramaswamy Govindarajan , Thomas L. Sterling An Executable Analytical Performance Evaluation Approach for Early Performance Prediction. [Citation Graph (0, 0)][DBLP ] IPDPS, 2003, pp:268- [Conf ] Ashfaq A. Khokhar , Gerd Heber , Parimala Thulasiraman , Guang R. Gao Load Adaptive Algorithms and Implementations for the 2D Discrete Wavelet Transform on Fine-Grain Multithreaded Architectures. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP, 1999, pp:458-462 [Conf ] Rishi Kumar , Gagan Agrawal , Guang R. Gao Compiling Several Classes of Communication Patterns on a Multithreaded Architecture. [Citation Graph (0, 0)][DBLP ] IPDPS, 2002, pp:- [Conf ] Shigeru Kusakabe , Kentaro Inenaga , Makoto Amamiya , Xinan Tang , Andrés Márquez , Guang R. Gao Implementing a Non-Strict Functional Programming Language on a Threaded Architecture. [Citation Graph (0, 0)][DBLP ] IPPS/SPDP Workshops, 1999, pp:138-152 [Conf ] Wen-Yen Lin , Jean-Luc Gaudiot , José Nelson Amaral , Guang R. Gao Caching Single-Assignment Structures to Build a Robust Fine-Grain Multi-Threading System. [Citation Graph (0, 0)][DBLP ] IPDPS, 2000, pp:589-594 [Conf ] Shashank S. Nemawarkar , Guang R. Gao Latency Tolerance: A Metric for Performance Analysis of Multithreaded Architectures. [Citation Graph (0, 0)][DBLP ] IPPS, 1997, pp:227-232 [Conf ] Ruppa K. Thulasiram , Lubomir Litov , Hassan Nojumi , Christopher T. Downing , Guang R. Gao Multithreaded Algorithms for Pricing a Class of Complex Options. [Citation Graph (0, 0)][DBLP ] IPDPS, 2001, pp:18- [Conf ] Weirong Zhu , Yanwei Niu , Guang R. Gao Performance Portability on EARTH: A Case Study across Several Parallel Architectures. [Citation Graph (0, 0)][DBLP ] IPDPS, 2005, pp:- [Conf ] Yingping Zhang , Taikyeong Jeong , Fei Chen , Haiping Wu , R. Nitzsche , Guang R. Gao A study of the on-chip interconnection network for the IBM Cyclops64 multi-core architecture. [Citation Graph (0, 0)][DBLP ] IPDPS, 2006, pp:- [Conf ] Guang R. Gao , Thomas L. Sterling , Rick L. Stevens , Mark Hereld , Weirong Zhu Hierarchical multithreading: programming model and system software. [Citation Graph (0, 0)][DBLP ] IPDPS, 2006, pp:- [Conf ] Vugranam C. Sreedhar , Guang R. Gao , Yong-Fong Lee Incremental Computation of Dominator Trees. [Citation Graph (0, 0)][DBLP ] Intermediate Representations Workshop, 1995, pp:1-12 [Conf ] Gerd Heber , Rupak Biswas , Parimala Thulasiraman , Guang R. Gao Using Multithreading for the Automatic Load Balancing of Adaptive Finite Element Meshes. [Citation Graph (0, 0)][DBLP ] IRREGULAR, 1998, pp:132-143 [Conf ] Olivier Maquelin , Guang R. Gao , Herbert H. J. Hum , Kevin B. Theobald , Xinmin Tian Polling Watchdog: Combining Polling and Interrupts for Efficient Message Handling. [Citation Graph (0, 0)][DBLP ] ISCA, 1996, pp:179-188 [Conf ] José Nelson Amaral , Guang R. Gao , Erturk Dogan Kocalar , Patrick O'Neill , Xinan Tang Design and Implementation of an Efficient Thread Partitioning Algorithm. [Citation Graph (0, 0)][DBLP ] ISHPC, 2000, pp:252-259 [Conf ] Guang R. Gao , Vivek Sarkar On the Importance of an End-To-End View of Memory Consistency in Future Computer Systems. [Citation Graph (0, 0)][DBLP ] ISHPC, 1997, pp:30-41 [Conf ] Juan del Cuvillo , Xinmin Tian , Guang R. Gao , Milind Girkar Performance Study of a Whole Genome Comparison Tool on a Hyper-Threading Multiprocessor. [Citation Graph (0, 0)][DBLP ] ISHPC, 2003, pp:450-457 [Conf ] Andrés Márquez , Guang R. Gao CARE: Overview of an Adaptive Multithreaded Architecture. [Citation Graph (0, 0)][DBLP ] ISHPC, 2003, pp:26-38 [Conf ] Sean Ryan , José Nelson Amaral , Guang R. Gao , Zachary Ruiz , Andrés Márquez , Kevin B. Theobald Coping with very High Latencies in Petaflop Computer Systems. [Citation Graph (0, 0)][DBLP ] ISHPC, 1999, pp:71-82 [Conf ] Dongrui Fan , Zhimin Tang , Hailin Huang , Guang R. Gao An energy efficient TLB design methodology. [Citation Graph (0, 0)][DBLP ] ISLPED, 2005, pp:351-356 [Conf ] Weirong Zhu , Parimala Thulasiraman , Ruppa K. Thulasiram , Guang R. Gao Exploring Financial Applications on Many-Core-on-a-Chip Architecture: A First Experiment. [Citation Graph (0, 0)][DBLP ] ISPA Workshops, 2006, pp:221-230 [Conf ] Guang R. Gao Bridging the gap between ISA compilers and silicon compilers a challenge for future SoC design. [Citation Graph (0, 0)][DBLP ] ISSS, 2001, pp:93- [Conf ] Wolfgang Rosenstiel , Brian Bailey , Masahiro Fujita , Guang R. Gao , Rajesh K. Gupta , Preeti Ranjan Panda New design paradigms. [Citation Graph (0, 0)][DBLP ] ISSS, 2001, pp:94- [Conf ] Erik R. Altman , Guang R. Gao , Ramaswamy Govindarajan An Experimental Study of an ILP-based Exact Solution Method for Software Pipelining. [Citation Graph (0, 0)][DBLP ] LCPC, 1995, pp:16-30 [Conf ] Alban Douillet , José Nelson Amaral , Guang R. Gao Fine-Grain Stacked Register Allocation for the Itanium Architecture. [Citation Graph (0, 0)][DBLP ] LCPC, 2002, pp:344-361 [Conf ] Guang R. Gao , Qi Ning Loop Storage Optimization for Dataflow Machines. [Citation Graph (0, 0)][DBLP ] LCPC, 1991, pp:359-373 [Conf ] Guang R. Gao , Qi Ning , Vincent Van Dongen Extending Software Pipelining Techniques for Scheduling Nested Loops. [Citation Graph (0, 0)][DBLP ] LCPC, 1993, pp:340-357 [Conf ] Guang R. Gao , R. Olsen , Vivek Sarkar , Radhika Thekkath Collective Loop Fusion for Array Contraction. [Citation Graph (0, 0)][DBLP ] LCPC, 1992, pp:281-295 [Conf ] Ramaswamy Govindarajan , Chihong Zhang , Guang R. Gao Minimum Register Instruction Scheduling: A New Approach for Dynamic Instruction Issue Processors. [Citation Graph (0, 0)][DBLP ] LCPC, 1999, pp:70-84 [Conf ] Laurie J. Hendren , C. Donawa , Maryam Emami , Guang R. Gao , Justiani , B. Sridharan Designing the McCAT Compiler Based on a Family of Structured Intermediate Representations. [Citation Graph (0, 0)][DBLP ] LCPC, 1992, pp:406-420 [Conf ] Vivek Sarkar , Guang R. Gao , Shaohua Han Locality Analysis for Distributed Shared-Memory Multiprocessors. [Citation Graph (0, 0)][DBLP ] LCPC, 1996, pp:20-40 [Conf ] Hongbo Yang , Ramaswamy Govindarajan , Guang R. Gao , Ziang Hu Compiler-Assisted Cache Replacement: Problem Formulation and Performance Evaluation. [Citation Graph (0, 0)][DBLP ] LCPC, 2003, pp:77-92 [Conf ] Alban Douillet , Guang R. Gao Register Pressure in Software-Pipelined Loop Nests: Fast Computation and Impact on Architecture Design. [Citation Graph (0, 0)][DBLP ] LCPC, 2005, pp:17-31 [Conf ] Shashank S. Nemawarkar , Guang R. Gao Measurement and Modeling of EARTH-MANNA Multithreaded Architecture. [Citation Graph (0, 0)][DBLP ] MASCOTS, 1996, pp:109-114 [Conf ] Ramaswamy Govindarajan , Erik R. Altman , Guang R. Gao Minimizing register requirements under resource-constrained rate-optimal software pipelining. [Citation Graph (0, 0)][DBLP ] MICRO, 1994, pp:85-94 [Conf ] Luis A. Lozano , Guang R. Gao Exploiting short-lived variables in superscalar processors. [Citation Graph (0, 0)][DBLP ] MICRO, 1995, pp:292-302 [Conf ] Kevin B. Theobald , Guang R. Gao , Laurie J. Hendren On the limits of program parallelism and its smoothability. [Citation Graph (0, 0)][DBLP ] MICRO, 1992, pp:10-19 [Conf ] Yanwei Niu , Ziang Hu , Kenneth E. Barner , Guang R. Gao Performance Modelling and Optimization of Memory Access on Cellular Computer Architecture Cyclops64. [Citation Graph (0, 0)][DBLP ] NPC, 2005, pp:132-143 [Conf ] Guang R. Gao , Herbert H. J. Hum , Jean-Marc Monti Towards an Efficient Hybrid Dataflow Architecture Model. [Citation Graph (0, 0)][DBLP ] PARLE (1), 1991, pp:355-371 [Conf ] Herbert H. J. Hum , Guang R. Gao A Novel High-Speed Memory Organization for Fine-Grain Multi-Thread Computing. [Citation Graph (0, 0)][DBLP ] PARLE (1), 1991, pp:34-51 [Conf ] Shashank S. Nemawarkar , Ramaswamy Govindarajan , Guang R. Gao , Vinod K. Agarwal Performance of Interconnection Network in Multithreaded Architectures. [Citation Graph (0, 0)][DBLP ] PARLE, 1994, pp:823-826 [Conf ] Qi Ning , Guang R. Gao Minimizing Loop Storage Allocation for An Argument-Fetching Dataflow Architecture Model. [Citation Graph (0, 0)][DBLP ] PARLE, 1992, pp:585-600 [Conf ] Robert Kim Yates , Guang R. Gao A Kahn Principle for Networks of Nonmonotonic Real-time Processes. [Citation Graph (0, 0)][DBLP ] PARLE, 1993, pp:209-227 [Conf ] Yuan Zhang , Weirong Zhu , Fei Chen , Ziang Hu , Guang R. Gao Sequential Consistency Revisit: The Sufficient Condition and Method to Reason the Consistency Model of a Multiprocessor-on-a-Chip Architecture. [Citation Graph (0, 0)][DBLP ] Parallel and Distributed Computing and Networks, 2005, pp:13-19 [Conf ] Ruppa K. Thulasiram , Christopher T. Downing , Guang R. Gao Recursive and Iterative Multithreaded Algorithms for Pricing American Securities. [Citation Graph (0, 0)][DBLP ] PDPTA, 2000, pp:- [Conf ] Erik R. Altman , Ramaswamy Govindarajan , Guang R. Gao Scheduling and Mapping: Software Pipelining in the Presence of Structural Hazards. [Citation Graph (0, 0)][DBLP ] PLDI, 1995, pp:139-150 [Conf ] Guang R. Gao , Yue-Bong Wong , Qi Ning A Timed Petri-Net Model for Fine-Grain Loop Scheduling. [Citation Graph (0, 0)][DBLP ] PLDI, 1991, pp:204-218 [Conf ] Hongbo Rong , Alban Douillet , Guang R. Gao Register allocation for software pipelined multi-dimensional loops. [Citation Graph (0, 0)][DBLP ] PLDI, 2005, pp:154-167 [Conf ] John C. Ruttenberg , Guang R. Gao , Woody Lichtenstein , Artour Stoutchinin Software Pipelining Showdown: Optimal vs. Heuristic Methods in a Production Compiler. [Citation Graph (0, 0)][DBLP ] PLDI, 1996, pp:1-11 [Conf ] Vugranam C. Sreedhar , Guang R. Gao , Yong-Fong Lee A New Framework for Exhaustive and Incremental Data Flow Analysis Using DJ Graphs. [Citation Graph (0, 0)][DBLP ] PLDI, 1996, pp:278-290 [Conf ] Qi Ning , Guang R. Gao A Novel Framework of Register Allocation for Software Pipelining. [Citation Graph (0, 0)][DBLP ] POPL, 1993, pp:29-42 [Conf ] Vugranam C. Sreedhar , Guang R. Gao A Linear Time Algorithm for Placing phi-nodes. [Citation Graph (0, 0)][DBLP ] POPL, 1995, pp:62-73 [Conf ] Angela Sodan , Guang R. Gao , Olivier Maquelin , Jens-Uwe Schultz , Xinmin Tian Experiences with Non-numeric Applications on Multithreaded Architectures. [Citation Graph (0, 0)][DBLP ] PPOPP, 1997, pp:124-135 [Conf ] Yuan Zhang , Vugranam C. Sreedhar , Weirong Zhu , Vivek Sarkar , Guang R. Gao Optimized lock assignment and allocation: a method for exploiting concurrency among critical sections. [Citation Graph (0, 0)][DBLP ] PPOPP, 2007, pp:146-147 [Conf ] Gerd Heber , Rupak Biswas , Guang R. Gao Self-Avoiding Walks Over Adaptive Triangular Grids. [Citation Graph (0, 0)][DBLP ] PPSC, 1999, pp:- [Conf ] W. S. Martins , Juan del Cuvillo , F. J. Useche , Kevin B. Theobald , Guang R. Gao A Multithreaded Parallel Implementation of a Dynamic Programming Algorithm for Sequence Comparison. [Citation Graph (0, 0)][DBLP ] Pacific Symposium on Biocomputing, 2001, pp:311-322 [Conf ] Jack B. Dennis , Guang R. Gao An efficient pipelined dataflow processor architecture. [Citation Graph (0, 0)][DBLP ] SC, 1988, pp:368-373 [Conf ] Kevin B. Theobald , Gagan Agrawal , Rishi Kumar , Gerd Heber , Guang R. Gao , Paul Stodghill , Keshav Pingali Landing CG on EARTH: A Case Study of Fine-Grained Multithreading on an Evolutionary Path. [Citation Graph (0, 0)][DBLP ] SC, 2000, pp:- [Conf ] Kevin B. Theobald , Guang R. Gao An efficient parallel algorithm for all pairs examination. [Citation Graph (0, 0)][DBLP ] SC, 1991, pp:742-753 [Conf ] Haiping Wu , Long Chen , Joseph Manzano , Guang R. Gao A User-Friendly Methodology for Automatic Exploration of Compiler Options. [Citation Graph (0, 0)][DBLP ] Software Engineering Research and Practice, 2006, pp:873-882 [Conf ] Haiping Wu , Eunjung Park , Long Chen , Juan del Cuvillo , Guang R. Gao User-Friendly Methodology for Automatic Exploration of Compiler Options: A Case Study on the Intel XScale Microarchitecture. [Citation Graph (0, 0)][DBLP ] Software Engineering Research and Practice, 2006, pp:866-872 [Conf ] Xinan Tang , Guang R. Gao How "Hard" is Thread Partitioning and How "Bad" is a List Scheduling Based Partitioning Algorithm? [Citation Graph (0, 0)][DBLP ] SPAA, 1998, pp:130-139 [Conf ] Parimala Thulasiraman , Kevin B. Theobald , Ashfaq A. Khokhar , Guang R. Gao Multithreaded algorithms for the fast Fourier transform. [Citation Graph (0, 0)][DBLP ] SPAA, 2000, pp:176-185 [Conf ] Xinan Tang , Jing Wang , Kevin B. Theobald , Guang R. Gao Thread Partitioning and Scheduling Based on Cost Model. [Citation Graph (0, 0)][DBLP ] SPAA, 1997, pp:272-281 [Conf ] Shashank S. Nemawarkar , Ramaswamy Govindarajan , Guang R. Gao , Vinod K. Agarwal Analysis of Multithreaded Multiprocessors with Distributed Shared Memory. [Citation Graph (0, 0)][DBLP ] SPDP, 1993, pp:114-121 [Conf ] Guang R. Gao , Robert Kim Yates , Jack B. Dennis , Lenor M. R. Mullin A strict monolithic array constructor. [Citation Graph (0, 0)][DBLP ] SPDP, 1990, pp:596-603 [Conf ] Adalberto T. Castelo , Wellington Martins , Guang R. Gao TROLL-Tandem Repeat Occurrence Locator. [Citation Graph (0, 0)][DBLP ] Bioinformatics, 2002, v:18, n:4, pp:634-636 [Journal ] Robel Y. Kahsay , Guang R. Gao , Li Liao An improved hidden Markov model for transmembrane protein detection and topology prediction and its applications to complete genomes. [Citation Graph (0, 0)][DBLP ] Bioinformatics, 2005, v:21, n:9, pp:1853-1858 [Journal ] Robel Y. Kahsay , Guoli Wang , Nataraj Dongre , Guang R. Gao , Roland L. Dunbrack Jr. CASA: a server for the critical assessment of protein sequence alignment accuracy. [Citation Graph (0, 0)][DBLP ] Bioinformatics, 2002, v:18, n:3, pp:496-497 [Journal ] Robel Y. Kahsay , Guoli Wang , Guang R. Gao , Li Liao , Roland L. Dunbrack Jr. Quasi-consensus-based comparison of profile hidden Markov models for protein sequences. [Citation Graph (0, 0)][DBLP ] Bioinformatics, 2005, v:21, n:10, pp:2287-2293 [Journal ] Laurie J. Hendren , Guang R. Gao Designing Programming Languages for the Analyzability of Pointer Data Structures. [Citation Graph (0, 0)][DBLP ] Comput. Lang., 1993, v:19, n:2, pp:119-134 [Journal ] José Nelson Amaral , Wen-Yen Lin , Jean-Luc Gaudiot , Guang R. Gao Exploiting Locality in Single Assignment Data Structures Updated Through Split-Phase Transactions. [Citation Graph (0, 0)][DBLP ] Cluster Computing, 2001, v:4, n:4, pp:281-293 [Journal ] Gerd Heber , Rupak Biswas , Guang R. Gao Self-Avoiding Walks over Adaptive Unstructured Grids. [Citation Graph (0, 0)][DBLP ] Concurrency - Practice and Experience, 2000, v:12, n:2-3, pp:85-109 [Journal ] Kevin B. Theobald , Rishi Kumar , Gagan Agrawal , Gerd Heber , Ruppa K. Thulasiram , Guang R. Gao Implementation and evaluation of a communication intensive application on the EARTH multithreaded system. [Citation Graph (0, 0)][DBLP ] Concurrency and Computation: Practice and Experience, 2002, v:14, n:3, pp:183-201 [Journal ] Guy Tremblay , C. J. Morrone , José Nelson Amaral , Guang R. Gao Implementation of the EARTH programming model on SMP clusters: a multi-threaded language and runtime system. [Citation Graph (0, 0)][DBLP ] Concurrency and Computation: Practice and Experience, 2003, v:15, n:9, pp:821-844 [Journal ] Eshrat Arjomandi , William G. O'Farrell , Ivan Kalas , Gita Koblents , Frank Ch. Eigler , Guang R. Gao ABC++: Concurrency by Inheritance in C++. [Citation Graph (0, 0)][DBLP ] IBM Systems Journal, 1995, v:34, n:1, pp:120-137 [Journal ] Erik R. Altman , Guang R. Gao Optimal Modulo Scheduling Through Enumeration. [Citation Graph (0, 0)][DBLP ] International Journal of Parallel Programming, 1998, v:26, n:2, pp:313-344 [Journal ] Ramaswamy Govindarajan , N. S. S. Narasimha Rao , Erik R. Altman , Guang R. Gao Enhanced Co-Scheduling: A Software Pipelining Method Using Modulo-Scheduled Pipeline Theory. [Citation Graph (0, 0)][DBLP ] International Journal of Parallel Programming, 2000, v:28, n:1, pp:1-46 [Journal ] Haiping Wu , Ziang Hu , Joseph Manzano , Guang R. Gao Madd Operation Aware Redundancy Elimination. [Citation Graph (0, 0)][DBLP ] International Journal of Software Engineering and Knowledge Engineering, 2005, v:15, n:2, pp:357-362 [Journal ] Guang R. Gao A Maximally Pipelined Tridiagonal Linear Equation Solver. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1986, v:3, n:2, pp:215-235 [Journal ] Guang R. Gao Algorithmic Aspects of Balancing Techniques for Pipelined Data Flow Code Generation. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1989, v:6, n:1, pp:39-61 [Journal ] Erik R. Altman , Ramaswamy Govindarajan , Guang R. Gao A Unified Framework for Instruction Scheduling and Mapping for Function Units with Structural Hazards. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1998, v:49, n:2, pp:259-293 [Journal ] Guang R. Gao An Efficient Hybrid Dataflow Architecture Modle. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1993, v:19, n:4, pp:293-307 [Journal ] Guang R. Gao , Jean-Luc Gaudiot , Lubomir Bic Special Issue on DataFlow and Multithreaded Architectures - Guest Editors' Introduction. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1993, v:18, n:3, pp:271-272 [Journal ] Xinan Tang , Guang R. Gao Automatically Partitioning Threads for Multithreaded Architectures. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 1999, v:58, n:2, pp:159-189 [Journal ] Parimala Thulasiraman , Ashfaq A. Khokhar , Gerd Heber , Guang R. Gao A fine-grain load-adaptive algorithm of the 2D discrete wavelet transform for multithreaded architectures. [Citation Graph (0, 0)][DBLP ] J. Parallel Distrib. Comput., 2004, v:64, n:1, pp:68-78 [Journal ] Vugranam C. Sreedhar , Guang R. Gao Computing phi-nodes in linear time using DJ graphs. [Citation Graph (0, 0)][DBLP ] J. Prog. Lang., 1995, v:3, n:4, pp:- [Journal ] Guang R. Gao A stability classification method and its application to pipelined solution of linear recurrences. [Citation Graph (0, 0)][DBLP ] Parallel Computing, 1987, v:4, n:3, pp:305-321 [Journal ] Guang R. Gao Exploiting fine-grain parallelism on dataflow architectures. [Citation Graph (0, 0)][DBLP ] Parallel Computing, 1990, v:13, n:3, pp:309-320 [Journal ] Walid A. Najjar , Edward A. Lee , Guang R. Gao Advances in the dataflow computational model. [Citation Graph (0, 0)][DBLP ] Parallel Computing, 1999, v:25, n:13-14, pp:1907-1929 [Journal ] Prasad Kakulavarapu , Olivier Maquelin , José Nelson Amaral , Guang R. Gao Dynamic Load Balancers for a Multithreaded Multiprocessor System. [Citation Graph (0, 0)][DBLP ] Parallel Processing Letters, 2001, v:11, n:1, pp:169-184 [Journal ] Qi Ning , Guang R. Gao Automatic Data and Computation Decomposition for Distributed-Memory Machines. [Citation Graph (0, 0)][DBLP ] Parallel Processing Letters, 1995, v:5, n:, pp:539-550 [Journal ] Jack B. Dennis , Guang R. Gao , Kenneth W. Todd Modeling the Weather with a Data Flow Supercomputer. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Computers, 1984, v:33, n:7, pp:592-603 [Journal ] Guang R. Gao , Vivek Sarkar Location Consistency-A New Memory Model and Cache Consistency Protocol. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Computers, 2000, v:49, n:8, pp:798-813 [Journal ] Ramaswamy Govindarajan , Hongbo Yang , José Nelson Amaral , Chihong Zhang , Guang R. Gao Minimum Register Instruction Sequencing to Reduce Register Spills in Out-of-Order Issue Superscalar Architectures. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Computers, 2003, v:52, n:1, pp:4-20 [Journal ] Guang R. Gao , Trevor N. Mudge Special issue on compilers, architecture, and synthesis for embedded systems. [Citation Graph (0, 0)][DBLP ] ACM Trans. Embedded Comput. Syst., 2003, v:2, n:2, pp:131- [Journal ] Vugranam C. Sreedhar , Guang R. Gao , Yong-Fong Lee Identifying Loops Using DJ Graphs. [Citation Graph (0, 0)][DBLP ] ACM Trans. Program. Lang. Syst., 1996, v:18, n:6, pp:649-658 [Journal ] Vugranam C. Sreedhar , Guang R. Gao , Yong-Fong Lee Incremental Computation of Dominator Trees. [Citation Graph (0, 0)][DBLP ] ACM Trans. Program. Lang. Syst., 1997, v:19, n:2, pp:239-252 [Journal ] Vugranam C. Sreedhar , Guang R. Gao , Yong-Fong Lee A New Framework for Elimination-Based Data Flow Analysis Using DJ Graphs. [Citation Graph (0, 0)][DBLP ] ACM Trans. Program. Lang. Syst., 1998, v:20, n:2, pp:388-435 [Journal ] Ramaswamy Govindarajan , Erik R. Altman , Guang R. Gao A Framework for Resource-Constrained Rate-Optimal Software Pipelining. [Citation Graph (0, 0)][DBLP ] IEEE Trans. Parallel Distrib. Syst., 1996, v:7, n:11, pp:1133-1149 [Journal ] Daniel Orozco , Liping Xue , Murat Bolat , Xiaoming Li , Guang R. Gao Experience of Optimizing FFT on Intel Architectures. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Ge Gan , Ziang Hu , Juan del Cuvillo , Guang R. Gao Exploring a Multithreaded Methodology to Implement a Network Communication Protocol on the Cyclops-64 Multithreaded Architecture. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Weirong Zhu , Ziang Hu , Guang R. Gao On the Role of Deterministic Fine-Grain Data Synchronization for Scientific Applications: A Revisit in the Emerging Many-Core Era. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Long Chen , Ziang Hu , Junmin Lin , Guang R. Gao Optimizing the Fast Fourier Transform on a Multi-core Architecture. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Weirong Zhu , Ziang Hu , Guang R. Gao On the Role of Deterministic Fine-Grain Data Synchronization for Scientific Applications: A Revisit in the Emerging Many-Core Era. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Haiping Wu , Eunjung Park , Mihailo Kaplarevic , Yingping Zhang , Murat Bolat , Xiaoming Li , Guang R. Gao Automatic Program Segment Similarity Detection in Targeted Program Performance Improvement. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-8 [Conf ] Guang R. Gao , Thomas L. Sterling , Rick Stevens , Mark Hereld , Weirong Zhu ParalleX: A Study of A New Parallel Computation Model. [Citation Graph (0, 0)][DBLP ] IPDPS, 2007, pp:1-6 [Conf ] Weirong Zhu , Vugranam C. Sreedhar , Ziang Hu , Guang R. Gao Synchronization state buffer: supporting efficient fine-grain synchronization on many-core architectures. [Citation Graph (0, 0)][DBLP ] ISCA, 2007, pp:35-45 [Conf ] Guang R. Gao On Parallel Models of Computation. [Citation Graph (0, 0)][DBLP ] NPC, 2007, pp:541- [Conf ] Guangming Tan , Ninghui Sun , Guang R. Gao A parallel dynamic programming algorithm on a multi-core architecture. [Citation Graph (0, 0)][DBLP ] SPAA, 2007, pp:135-144 [Conf ] Weirong Zhu , Yanwei Niu , Guang R. Gao Performance portability on EARTH: a case study across several parallel architectures. [Citation Graph (0, 0)][DBLP ] Cluster Computing, 2007, v:10, n:2, pp:115-126 [Journal ] Hongbo Rong , Zhizhong Tang , Ramaswamy Govindarajan , Alban Douillet , Guang R. Gao Single-dimension software pipelining for multidimensional loops. [Citation Graph (0, 0)][DBLP ] TACO, 2007, v:4, n:1, pp:- [Journal ] Software-Pipelining on Multi-Core Architectures. [Citation Graph (, )][DBLP ] Mapping the LU decomposition on a many-core architecture: challenges and solutions. [Citation Graph (, )][DBLP ] Minimizing communication in rate-optimal software pipelining for stream programs. [Citation Graph (, )][DBLP ] Tile Percolation: An OpenMP Tile Aware Parallelization Technique for the Cyclops-64 Multicore Processor. [Citation Graph (, )][DBLP ] A Study of a Software Cache Implementation of the OpenMP Memory Model for Multicore and Manycore Architectures. [Citation Graph (, )][DBLP ] Optimized Dense Matrix Multiplication on a Many-Core Architecture. [Citation Graph (, )][DBLP ] Discriminating transmembrane proteins from signal peptides using SVM-Fisher approach. [Citation Graph (, )][DBLP ] Mapping the FDTD Application to Many-Core Chip Architectures. [Citation Graph (, )][DBLP ] Just-In-Time Locality and Percolation for Optimizing Irregular Applications on a Manycore Architecture. [Citation Graph (, )][DBLP ] Concurrency Analysis for Shared Memory Programs with Textually Unaligned Barriers. [Citation Graph (, )][DBLP ] Minimum Lock Assignment: A Method for Exploiting Concurrency among Critical Sections. [Citation Graph (, )][DBLP ] Experience on optimizing irregular computation for memory hierarchy in manycore architecture. [Citation Graph (, )][DBLP ] Implementation of the Smith-Waterman algorithm on a reconfigurable supercomputing platform. [Citation Graph (, )][DBLP ] Efficient support of concurrent threads in a hybrid dataflow/von Neumann architecture. [Citation Graph (, )][DBLP ] Iterative layer-based raytracing on CUDA. [Citation Graph (, )][DBLP ] Tile Reduction: The First Step towards Tile Aware Parallelization in OpenMP. [Citation Graph (, )][DBLP ] Search in 0.012secs, Finished in 0.021secs