SCEAS

The SCEAS System

Search the dblp DataBase

Title:
Author:

Wen-mei W. Hwu: [Publications] [Author Rank by year] [Co-authors] [Prefers] [Cites] [Cited by]

Publications of Author

Brian L. Deitrich, Ben-Chung Cheng, Wen-mei W. Hwu
Improving Static Branch Prediction in a Compiler. [Citation Graph (0, 0)][DBLP]
IEEE PACT, 1998, pp:214-221 [Conf]
Erik M. Nystrom, Ronald D. Barnes, Matthew C. Merten, Wen-mei W. Hwu
Code Reordering and Speculation Support for Dynamic Optimization System. [Citation Graph (0, 0)][DBLP]
IEEE PACT, 2001, pp:163-174 [Conf]
David M. Gallagher, William Y. Chen, Scott A. Mahlke, John C. Gyllenhaal, Wen-mei W. Hwu
Dynamic Memory Disambiguation Using the Memory Conflict Buffer. [Citation Graph (0, 0)][DBLP]
ASPLOS, 1994, pp:183-193 [Conf]
Daniel A. Connors, Hillery C. Hunter, Ben-Chung Cheng, Wen-mei W. Hwu
Hardware Support for Dynamic Management of Compiler-Directed Computation Reuse. [Citation Graph (0, 0)][DBLP]
ASPLOS, 2000, pp:222-233 [Conf]
Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker
Sentinel Scheduling for VLIW and Superscalar Processors. [Citation Graph (0, 0)][DBLP]
ASPLOS, 1992, pp:238-247 [Conf]
Hillery C. Hunter, Wen-mei W. Hwu
Code coverage and input variability: effects on architecture and compiler research. [Citation Graph (0, 0)][DBLP]
CASES, 2002, pp:79-87 [Conf]
Yale N. Patt, Wen-mei W. Hwu, Stephen W. Melvin, Michael Shebanow, Chein Chen, Jiajuin Wei
Experiments with HPS, a Restricted Data Flow Microarchitecture for High Performance Computers. [Citation Graph (0, 0)][DBLP]
COMPCON, 1986, pp:254-258 [Conf]
Daniel A. Connors, Jean-Michel Puiatti, David I. August, Kevin M. Crozier, Wen-mei W. Hwu
An Architecture Framework for Introducing Predicated Execution into Embedded Microprocessors. [Citation Graph (0, 0)][DBLP]
Euro-Par, 1999, pp:1301-1311 [Conf]
Neal J. Alewine, Shyh-Kwei Chen, Chung-Chi Jim Li, W. Kent Fuchs, Wen-mei W. Hwu
Branch Recovery with Compiler-Assisted Multiple Instruction Retry. [Citation Graph (0, 0)][DBLP]
FTCS, 1992, pp:66-73 [Conf]
Roger A. Bringmann, Scott A. Mahlke, Wen-mei W. Hwu
A study of the effects of compiler-controlled speculation on instruction and data caches. [Citation Graph (0, 0)][DBLP]
HICSS (1), 1995, pp:211-220 [Conf]
David I. August, Daniel A. Connors, John C. Gyllenhaal, Wen-mei W. Hwu
Architectural Support for Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial Results. [Citation Graph (0, 0)][DBLP]
HPCA, 1997, pp:84-93 [Conf]
Wen-mei W. Hwu, Sanjay J. Patel
The Future of Computer Architecture Research: An Industrial Perspective. [Citation Graph (0, 0)][DBLP]
HPCA, 2005, pp:264- [Conf]
W. Kent Fuchs, Wen-mei W. Hwu, Neal J. Alewine
Application of Compiler-Assisted Rollback Recovery to Speculative Execution Repair. [Citation Graph (0, 0)][DBLP]
Hardware and Software Architectures for Fault Tolerance, 1993, pp:45-65 [Conf]
Sadun Anik, Wen-mei W. Hwu
Executing Nested Parallel Loops on Shared-Memory Multiprocessors. [Citation Graph (0, 0)][DBLP]
ICPP (3), 1992, pp:241-244 [Conf]
Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu
An Analytical Approach to Scheduling Code for Superscalar and VLIW Architectures. [Citation Graph (0, 0)][DBLP]
ICPP (1), 1994, pp:285-292 [Conf]
William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu
Tolerating First Level Memory Access Latency in High-Performance Systems. [Citation Graph (0, 0)][DBLP]
ICPP (1), 1992, pp:36-43 [Conf]
Scott A. Mahlke, Nancy J. Warter, William Y. Chen, Pohua P. Chang, Wen-mei W. Hwu
The Effect of Compiler Optimizations on Available Parallelism in Scalar Programs. [Citation Graph (0, 0)][DBLP]
ICPP (2), 1991, pp:142-145 [Conf]
William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu, Tokuzo Kiyohara, Pohua P. Chang
Tolerating data access latency with register preloading. [Citation Graph (0, 0)][DBLP]
ICS, 1992, pp:104-113 [Conf]
Pohua P. Chang, Wen-mei W. Hwu
Control flow optimization for supercomputer scalar processing. [Citation Graph (0, 0)][DBLP]
ICS, 1989, pp:145-153 [Conf]
Jeffrey P. Monks, Vaduvur Bharghavan, Wen-mei W. Hwu
A Power Controlled Multiple Access Protocol for Wireless Packet Networks. [Citation Graph (0, 0)][DBLP]
INFOCOM, 2001, pp:219-228 [Conf]
David I. August, Daniel A. Connors, Scott A. Mahlke, John W. Sias, Kevin M. Crozier, Ben-Chung Cheng, Patrick R. Eaton, Qudus B. Olaniran, Wen-mei W. Hwu
Integrated Predicated and Speculative Execution in the IMPACT EPIC Architecture. [Citation Graph (0, 0)][DBLP]
ISCA, 1998, pp:227-237 [Conf]
David I. August, John W. Sias, Jean-Michel Puiatti, Scott A. Mahlke, Daniel A. Connors, Kevin M. Crozier, Wen-mei W. Hwu
The Program Decision Logic Approach to Predicated Execution. [Citation Graph (0, 0)][DBLP]
ISCA, 1999, pp:208-219 [Conf]
Wen-mei W. Hwu
Retrospective: IMPACT: An Architectural Framework for Multiple-Instruction Issue. [Citation Graph (0, 0)][DBLP]
25 Years ISCA: Retrospectives and Reprints, 1998, pp:77-79 [Conf]
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu
IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. [Citation Graph (0, 0)][DBLP]
ISCA, 1991, pp:266-275 [Conf]
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu
IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. [Citation Graph (0, 0)][DBLP]
25 Years ISCA: Retrospectives and Reprints, 1998, pp:408-417 [Conf]
Wen-mei W. Hwu, Pohua P. Chang
Exploiting Parallel Microprocessor Microarchitectures With a Compiler Code Generator. [Citation Graph (0, 0)][DBLP]
ISCA, 1988, pp:45-53 [Conf]
Wen-mei W. Hwu, Pohua P. Chang
Achieving High Instruction Cache Performance with an Optimizing Compiler. [Citation Graph (0, 0)][DBLP]
ISCA, 1989, pp:242-251 [Conf]
Wen-mei W. Hwu, Thomas M. Conte, Pohua P. Chang
Comparing Software and Hardware Schemes For Reducing the Cost of Branches. [Citation Graph (0, 0)][DBLP]
ISCA, 1989, pp:224-233 [Conf]
Wen-mei W. Hwu, Yale N. Patt
HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. [Citation Graph (0, 0)][DBLP]
ISCA, 1986, pp:297-306 [Conf]
Wen-mei W. Hwu, Yale N. Patt
Checkpoint Repair for Out-of-order Execution Machines. [Citation Graph (0, 0)][DBLP]
ISCA, 1987, pp:18-26 [Conf]
Wen-mei W. Hwu, Yale N. Patt
Retrospective: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. [Citation Graph (0, 0)][DBLP]
25 Years ISCA: Retrospectives and Reprints, 1998, pp:43-44 [Conf]
Wen-mei W. Hwu, Yale N. Patt
HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. [Citation Graph (0, 0)][DBLP]
25 Years ISCA: Retrospectives and Reprints, 1998, pp:300-308 [Conf]
Teresa L. Johnson, Wen-mei W. Hwu
Run-Time Adaptive Cache Hierarchy Management via Reference Analysis. [Citation Graph (0, 0)][DBLP]
ISCA, 1997, pp:315-326 [Conf]
Tokuzo Kiyohara, Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Sadun Anik, Wen-mei W. Hwu
Register Connection: A New Approach to Adding Registers into Instruction Set Architectures. [Citation Graph (0, 0)][DBLP]
ISCA, 1993, pp:247-256 [Conf]
Scott A. Mahlke, Richard E. Hank, James E. McCormick, David I. August, Wen-mei W. Hwu
A Comparison of Full and Partial Predicated Execution Support for ILP Processors. [Citation Graph (0, 0)][DBLP]
ISCA, 1995, pp:138-150 [Conf]
Matthew C. Merten, Andrew R. Trick, Christopher N. George, John C. Gyllenhaal, Wen-mei W. Hwu
A Hardware-Driven Profiling Scheme for Identifying Program Hot Spots to Support Runtime Optimization. [Citation Graph (0, 0)][DBLP]
ISCA, 1999, pp:136-147 [Conf]
Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald D. Barnes, Wen-mei W. Hwu
A hardware mechanism for dynamic extraction and relayout of program hot spots. [Citation Graph (0, 0)][DBLP]
ISCA, 2000, pp:59-70 [Conf]
John W. Sias, Sain-zee Ueng, Geoff A. Kent, Ian M. Steiner, Erik M. Nystrom, Wen-mei W. Hwu
Field-testing IMPACT EPIC research results in Itanium 2. [Citation Graph (0, 0)][DBLP]
ISCA, 2004, pp:26-39 [Conf]
Jeffrey P. Monks, Vaduvur Bharghavan, Wen-mei W. Hwu
Transmission Power Control for Multiple Access Wireless Packet Networks. [Citation Graph (0, 0)][DBLP]
LCN, 2000, pp:12-21 [Conf]
Jeffrey P. Monks, Jean-Pierre Ebert, Adam Wolisz, Wen-mei W. Hwu
A Study of the Energy Saving and Capacity Improvement Potential of Power Control in Multi-Hop Wireless Networks. [Citation Graph (0, 0)][DBLP]
LCN, 2001, pp:550-559 [Conf]
Lakshmi N. Chakrapani, John C. Gyllenhaal, Wen-mei W. Hwu, Scott A. Mahlke, Krishna V. Palem, Rodric M. Rabbah
Trimaran: An Infrastructure for Research in Instruction-Level Parallelism. [Citation Graph (0, 0)][DBLP]
LCPC, 2004, pp:32-41 [Conf]
William Y. Chen, Roger A. Bringmann, Scott A. Mahlke, Sadun Anik, Tokuzo Kiyohara, Nancy J. Warter, Daniel M. Lavery, Wen-mei W. Hwu, Richard E. Hank, John C. Gyllenhaal
Using Profile Information to Assist Advaced Compiler Optimization and Scheduling. [Citation Graph (0, 0)][DBLP]
LCPC, 1992, pp:31-48 [Conf]
Ben-Chung Cheng, Wen-mei W. Hwu
An Empirical Study of Function Pointers Using SPEC Benchmarks. [Citation Graph (0, 0)][DBLP]
LCPC, 1999, pp:490-493 [Conf]
Daniel A. Connors, Wen-mei W. Hwu
Compiler-Directed Dynamic Computation Reuse: Rationale and Initial Results. [Citation Graph (0, 0)][DBLP]
MICRO, 1999, pp:158-169 [Conf]
Roger A. Bringmann, Scott A. Mahlke, Richard E. Hank, John C. Gyllenhaal, Wen-mei W. Hwu
Speculative execution exception recovery using write-back suppression. [Citation Graph (0, 0)][DBLP]
MICRO, 1993, pp:214-223 [Conf]
David I. August, Wen-mei W. Hwu, Scott A. Mahlke
A Framework for Balancing Control Flow and Predication. [Citation Graph (0, 0)][DBLP]
MICRO, 1997, pp:92-103 [Conf]
Pohua P. Chang, William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu
Comparing Static and Dynamic Code Scheduling for Multiple-Instruction-Issue Processors. [Citation Graph (0, 0)][DBLP]
MICRO, 1991, pp:25-33 [Conf]
Pohua P. Chang, Wen-mei W. Hwu
Trace selection for compiling large C application programs to microcode. [Citation Graph (0, 0)][DBLP]
MICRO, 1988, pp:21-29 [Conf]
P.-H. Chang, Wen-mei W. Hwu
Forward semantic: a compiler-assisted instruction fetch method for heavily pipelined processors. [Citation Graph (0, 0)][DBLP]
MICRO, 1989, pp:188-198 [Conf]
Ronald D. Barnes, Erik M. Nystrom, Matthew C. Merten, Wen-mei W. Hwu
Vacuum packing: extracting hardware-detected program phases for post-link optimization. [Citation Graph (0, 0)][DBLP]
MICRO, 2002, pp:233-244 [Conf]
Ronald D. Barnes, Erik M. Nystrom, John W. Sias, Sanjay J. Patel, Nacho Navarro, Wen-mei W. Hwu
Beating in-order stalls with "flea-flicker" two-pass pipelining. [Citation Graph (0, 0)][DBLP]
MICRO, 2003, pp:387-398 [Conf]
Ronald D. Barnes, Shane Ryoo, Wen-mei W. Hwu
"Flea-flicker" Multipass Pipelining: An Alternative to the High-Power Out-of-Order Offense. [Citation Graph (0, 0)][DBLP]
MICRO, 2005, pp:319-330 [Conf]
William Y. Chen, Scott A. Mahlke, Pohua P. Chang, Wen-mei W. Hwu
Data Access Microarchitectures for Superscalar Processors with Compiler-Assisted Data Prefetching. [Citation Graph (0, 0)][DBLP]
MICRO, 1991, pp:69-73 [Conf]
Ben-Chung Cheng, Daniel A. Connors, Wen-mei W. Hwu
Compiler-Directed Early Load-Address Generation. [Citation Graph (0, 0)][DBLP]
MICRO, 1998, pp:138-147 [Conf]
Brian L. Deitrich, Wen-mei W. Hwu
Speculative Hedge: Regulating Compile-time Speculation Against Profile Variations. [Citation Graph (0, 0)][DBLP]
MICRO, 1996, pp:70-79 [Conf]
Wen-mei W. Hwu, Yale N. Patt
Exploiting horizontal and vertical concurrency via the HPSm microprocessor. [Citation Graph (0, 0)][DBLP]
MICRO, 1987, pp:154-161 [Conf]
Teresa L. Johnson, Matthew C. Merten, Wen-mei W. Hwu
Run-Time Spatial Locality Detection and Optimization. [Citation Graph (0, 0)][DBLP]
MICRO, 1997, pp:57-64 [Conf]
John C. Gyllenhaal, Wen-mei W. Hwu, B. Ramakrishna Rau
Optimization of Machine Descriptions for Efficient Use. [Citation Graph (0, 0)][DBLP]
MICRO, 1996, pp:349-358 [Conf]
Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau
Region-based compilation: an introduction and motivation. [Citation Graph (0, 0)][DBLP]
MICRO, 1995, pp:158-168 [Conf]
Richard E. Hank, Scott A. Mahlke, Roger A. Bringmann, John C. Gyllenhaal, Wen-mei W. Hwu
Superblock formation using static program analysis. [Citation Graph (0, 0)][DBLP]
MICRO, 1993, pp:247-255 [Conf]
Cheng-Hsueh A. Hsieh, John C. Gyllenhaal, Wen-mei W. Hwu
Java Bytecode to Native Code Translation: The Caffeine Prototype and Preliminary Results. [Citation Graph (0, 0)][DBLP]
MICRO, 1996, pp:90-99 [Conf]
Daniel M. Lavery, Wen-mei W. Hwu
Unrolling-based optimizations for modulo scheduling. [Citation Graph (0, 0)][DBLP]
MICRO, 1995, pp:327-337 [Conf]
Daniel M. Lavery, Wen-mei W. Hwu
Modulo Scheduling of Loops in Control-intensive Non-numeric Programs. [Citation Graph (0, 0)][DBLP]
MICRO, 1996, pp:126-137 [Conf]
Scott A. Mahlke, Richard E. Hank, Roger A. Bringmann, John C. Gyllenhaal, David M. Gallagher, Wen-mei W. Hwu
Characterizing the impact of predicated execution on branch prediction. [Citation Graph (0, 0)][DBLP]
MICRO, 1994, pp:217-227 [Conf]
Matthew C. Merten, Wen-mei W. Hwu
Modulo schedule buffers. [Citation Graph (0, 0)][DBLP]
MICRO, 2001, pp:138-149 [Conf]
John W. Sias, Wen-mei W. Hwu, David I. August
Accurate and efficient predicate analysis with binary decision diagrams. [Citation Graph (0, 0)][DBLP]
MICRO, 2000, pp:112-123 [Conf]
John W. Sias, Hillery C. Hunter, Wen-mei W. Hwu
Enhancing loop buffering of media and telecommunications applications using low-overhead predication. [Citation Graph (0, 0)][DBLP]
MICRO, 2001, pp:262-273 [Conf]
Yoji Yamada, John Gyllenhall, Grant Haab, Wen-mei W. Hwu
Data relocation and prefetching for programs with large data sets. [Citation Graph (0, 0)][DBLP]
MICRO, 1994, pp:118-127 [Conf]
James E. Wilson, Stephen W. Melvin, Michael Shebanow, Wen-mei W. Hwu, Yale N. Patt
On tuning the microarchitecture of an HPS implementation of the VAX. [Citation Graph (0, 0)][DBLP]
MICRO, 1987, pp:162-167 [Conf]
Erik M. Nystrom, Hong-Seok Kim, Wen-mei W. Hwu
Importance of heap specialization in pointer analysis. [Citation Graph (0, 0)][DBLP]
PASTE, 2004, pp:43-48 [Conf]
Ben-Chung Cheng, Wen-mei W. Hwu
Modular interprocedural pointer analysis using access paths: design, implementation, and evaluation. [Citation Graph (0, 0)][DBLP]
PLDI, 2000, pp:57-69 [Conf]
Wen-mei W. Hwu, Pohua P. Chang
Inline Function Expansion for Compiling C Programs. [Citation Graph (0, 0)][DBLP]
PLDI, 1989, pp:246-257 [Conf]
Nancy J. Warter, Scott A. Mahlke, Wen-mei W. Hwu, B. Ramakrishna Rau
Reverse If-Conversion. [Citation Graph (0, 0)][DBLP]
PLDI, 1993, pp:290-299 [Conf]
Le-Chun Wu, Rajiv Mirani, Harish Patil, Bruce Olsen, Wen-mei W. Hwu
A New Framework for Debugging Globally Optimized Code. [Citation Graph (0, 0)][DBLP]
PLDI, 1999, pp:181-191 [Conf]
Erik M. Nystrom, Hong-Seok Kim, Wen-mei W. Hwu
Bottom-Up and Top-Down Context-Sensitive Summary-Based Pointer Analysis. [Citation Graph (0, 0)][DBLP]
SAS, 2004, pp:165-180 [Conf]
Scott A. Mahlke, William Y. Chen, John C. Gyllenhaal, Wen-mei W. Hwu
Compiler Code Transformations for Superscalar-Based High Performance Systems. [Citation Graph (0, 0)][DBLP]
SC, 1992, pp:808-817 [Conf]
Aloke Gupta, Wen-mei W. Hwu
Xprof: Profiling the Execution of X Window Programs. [Citation Graph (0, 0)][DBLP]
SIGMETRICS, 1992, pp:253-254 [Conf]
Wen-mei W. Hwu, Thomas M. Conte
A Simulation Study of Simultaneous Vector Prefetch Performance in Multiprocessor Memory Subsystems (Extended Abstract). [Citation Graph (0, 0)][DBLP]
SIGMETRICS, 1989, pp:227- [Conf]
Thomas M. Conte, Wen-mei W. Hwu
Advances in Benchmarking Techniques: New Standards and Quantitative Metrics. [Citation Graph (0, 0)][DBLP]
Advances in Computers, 1995, v:41, n:, pp:231-253 [Journal]
Jeffrey P. Monks, Jean-Pierre Ebert, Wen-mei W. Hwu, Adam Wolisz
Energy saving and capacity improvement potential of power control in multi-hop wireless networks. [Citation Graph (0, 0)][DBLP]
Computer Networks, 2003, v:41, n:3, pp:313-330 [Journal]
Wen-mei W. Hwu
Introduction to Predicate Execution. [Citation Graph (0, 0)][DBLP]
IEEE Computer, 1998, v:31, n:, pp:49-50 [Journal]
Thomas M. Conte, Wen-mei W. Hwu
Benchmark Characterization. [Citation Graph (0, 0)][DBLP]
IEEE Computer, 1991, v:24, n:1, pp:48-56 [Journal]
Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal, Wen-mei W. Hwu
Optimizing NET Compilers for Improved Java Performance. [Citation Graph (0, 0)][DBLP]
IEEE Computer, 1997, v:30, n:6, pp:67-75 [Journal]
John C. Gyllenhaal, Wen-mei W. Hwu, B. Ramakrishna Rau
Optimization of Machine Descriptions for Efficient Use. [Citation Graph (0, 0)][DBLP]
International Journal of Parallel Programming, 1998, v:26, n:4, pp:417-447 [Journal]
David I. August, Wen-mei W. Hwu, Scott A. Mahlke
The Partial Reverse If-Conversion Framework for Balancing Control Flow and Predication. [Citation Graph (0, 0)][DBLP]
International Journal of Parallel Programming, 1999, v:27, n:5, pp:381-423 [Journal]
Steve Beaty, Wen-mei W. Hwu
Foreword to the Special Issue. [Citation Graph (0, 0)][DBLP]
International Journal of Parallel Programming, 1998, v:26, n:4, pp:345-347 [Journal]
Thomas M. Conte, Wen-mei W. Hwu, Mark Smotherman
Editor's Introduction. [Citation Graph (0, 0)][DBLP]
International Journal of Parallel Programming, 1999, v:27, n:5, pp:325-326 [Journal]
Thomas M. Conte, Wen-mei W. Hwu, Mark Smotherman
Editors' Introduction. [Citation Graph (0, 0)][DBLP]
International Journal of Parallel Programming, 1999, v:27, n:6, pp:425-426 [Journal]
Sadun Anik, Wen-mei W. Hwu
Performance Implications of Synchronization Support for Parallel Fortran Programs. [Citation Graph (0, 0)][DBLP]
J. Parallel Distrib. Comput., 1994, v:22, n:2, pp:202-215 [Journal]
Ronald D. Barnes, Shane Ryoo, Wen-mei W. Hwu
Tolerating Cache-Miss Latency with Multipass Pipelines. [Citation Graph (0, 0)][DBLP]
IEEE Micro, 2006, v:26, n:1, pp:40-47 [Journal]
Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu
Profile-guided Automatic Inline Expansion for C Programs. [Citation Graph (0, 0)][DBLP]
Softw., Pract. Exper., 1992, v:22, n:5, pp:349-369 [Journal]
Pohua P. Chang, Scott A. Mahlke, Wen-mei W. Hwu
Using Profile Information to Assist Classic Code Optimizations. [Citation Graph (0, 0)][DBLP]
Softw., Pract. Exper., 1991, v:21, n:12, pp:1301-1321 [Journal]
Shyh-Kwei Chen, Neal J. Alewine, W. Kent Fuchs, Wen-mei W. Hwu
Incremental Compiler Transformations for Multiple Instruction Retry. [Citation Graph (0, 0)][DBLP]
Softw., Pract. Exper., 1994, v:24, n:12, pp:1179-1198 [Journal]
Aloke Gupta, Wen-mei W. Hwu
An execution Profiler for Window-oriented Applications. [Citation Graph (0, 0)][DBLP]
Softw., Pract. Exper., 1993, v:23, n:5, pp:487-510 [Journal]
Neal J. Alewine, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu
Compiler-Assisted Multiple Instruction Rollback Recovery Using a Read Buffer. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1995, v:44, n:9, pp:1096-1107 [Journal]
Ronald D. Barnes, John W. Sias, Erik M. Nystrom, Sanjay J. Patel, Jose (Nacho) Navarro, Wen-mei W. Hwu
Beating In-Order Stalls with "Flea-Flicker" Two-Pass Pipelining. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 2006, v:55, n:1, pp:18-33 [Journal]
Pohua P. Chang, Daniel M. Lavery, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu
The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1995, v:44, n:3, pp:353-370 [Journal]
Pohua P. Chang, Nancy J. Warter, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu
Three Architecutral Models for Compiler-Controlled Speculative Execution. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1995, v:44, n:4, pp:481-494 [Journal]
William Y. Chen, Pohua P. Chang, Thomas M. Conte, Wen-mei W. Hwu
The Effect of Code Expanding Optimizations on Instruction Cache Design. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1993, v:42, n:9, pp:1045-1057 [Journal]
Thomas M. Conte, Mary Ann Hirsch, Wen-mei W. Hwu
Combining Trace Sampling with Single Pass Methods for Efficient Cache Simulation. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1998, v:47, n:6, pp:714-720 [Journal]
Wen-mei W. Hwu, Pohua P. Chang
Efficient Instruction Sequencing with Inline Target Insertion. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1992, v:41, n:12, pp:1537-1551 [Journal]
Wen-mei W. Hwu, Thomas M. Conte
The Susceptibility of Programs to Context Switching. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1994, v:43, n:9, pp:994-1003 [Journal]
Wen-mei W. Hwu, Krishna V. Palem
Guest Editors' Introduction. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 2005, v:54, n:10, pp:1185-1187 [Journal]
Wen-mei W. Hwu, Yale N. Patt
Checkpoint Repair for High-Performance Out-of-Order Execution Machines. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1987, v:36, n:12, pp:1496-1514 [Journal]
Teresa L. Johnson, Daniel A. Connors, Matthew C. Merten, Wen-mei W. Hwu
Run-Time Cache Bypassing. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1999, v:48, n:12, pp:1338-1354 [Journal]
Chung-Chi Jim Li, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu
Compiler-Based Multiple Instruction Retry. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 1995, v:44, n:1, pp:35-46 [Journal]
Matthew C. Merten, Andrew R. Trick, Ronald D. Barnes, Erik M. Nystrom, Christopher N. George, John C. Gyllenhaal, Wen-mei W. Hwu
An Architectural Framework for Runtime Optimization. [Citation Graph (0, 0)][DBLP]
IEEE Trans. Computers, 2001, v:50, n:6, pp:567-589 [Journal]
Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker
Sentinel Scheduling for VLIW and Superscalar Processors. [Citation Graph (0, 0)][DBLP]
ACM Trans. Comput. Syst., 1993, v:11, n:4, pp:376-408 [Journal]
Wen-mei W. Hwu, Shane Ryoo, Sain-zee Ueng, John H. Kelm, Isaac Gelado, Sam S. Stone, Robert E. Kidd, Sara S. Baghsorkhi, Aqeel Mahesri, Stephanie C. Tsao, Nacho Navarro, Steven S. Lumetta, Matthew I. Frank, Sanjay J. Patel
Implicitly Parallel Programming Models for Thousand-Core Microprocessors. [Citation Graph (0, 0)][DBLP]
DAC, 2007, pp:754-759 [Conf]
Lauren Sarno, Wen-mei W. Hwu, Craig Lund, Markus Levy, James R. Larus, James Reinders, Gordon Cameron, Chris Lennard, Takashi Yoshimori
Corezilla: Build and Tame the Multicore Beast? [Citation Graph (0, 0)][DBLP]
DAC, 2007, pp:632-633 [Conf]
XMalloc: A Scalable Lock-free Dynamic Memory Allocator for Many-core Machines. [Citation Graph (, )][DBLP]
CIGAR: Application Partitioning for a CPU/Coprocessor Architecture. [Citation Graph (, )][DBLP]
Optimization of tele-immersion codes. [Citation Graph (, )][DBLP]
High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs. [Citation Graph (, )][DBLP]
An asymmetric distributed shared memory model for heterogeneous parallel systems. [Citation Graph (, )][DBLP]
Accelerating advanced mri reconstructions on gpus. [Citation Graph (, )][DBLP]
GPU acceleration of cutoff pair potentials for molecular modeling applications. [Citation Graph (, )][DBLP]
Program optimization space pruning for a multithreaded gpu. [Citation Graph (, )][DBLP]
Efficient compilation of fine-grained SPMD-threaded programs for multicore CPUs. [Citation Graph (, )][DBLP]
GPU clusters for high-performance computing. [Citation Graph (, )][DBLP]
CUBA: an architecture for efficient CPU/co-processor data communication. [Citation Graph (, )][DBLP]
High-performance CUDA kernel execution on FPGAs. [Citation Graph (, )][DBLP]
Many-core parallel computing - Can compilers and tools do the heavy lifting? [Citation Graph (, )][DBLP]
Long time-scale simulations of in vivo diffusion using GPU hardware. [Citation Graph (, )][DBLP]
Accelerating MR Image Reconstruction on GPUS. [Citation Graph (, )][DBLP]
Iteration Disambiguation for Parallelism Identification in Time-Sliced Applications. [Citation Graph (, )][DBLP]
CUDA-Lite: Reducing GPU Programming Complexity. [Citation Graph (, )][DBLP]
MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs. [Citation Graph (, )][DBLP]
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. [Citation Graph (, )][DBLP]
An adaptive performance modeling tool for GPU architectures. [Citation Graph (, )][DBLP]
Application Acceleration with the Explicitly Parallel Operations System - the EPOS Processor. [Citation Graph (, )][DBLP]
FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs. [Citation Graph (, )][DBLP]
Compute Unified Device Architecture Application Suitability. [Citation Graph (, )][DBLP]

Search in 0.095secs, Finished in 0.100secs

NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades

NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP