Follow
Wen-mei W. Hwu
Wen-mei W. Hwu
Senior Distinguished Research Scientist, NVIDIA; Professor and Sanders-AMD Chair of Electrical and
Verified email at illinois.edu - Homepage
Title
Cited by
Cited by
Year
Programming massively parallel processors: a hands-on approach
DB Kirk, WH Wen-Mei
Morgan kaufmann, 2016
41702016
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
S Ryoo, CI Rodrigues, SS Baghsorkhi, SS Stone, DB Kirk, WW Hwu
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008
13222008
A power controlled multiple access protocol for wireless packet networks
JP Monks, V Bharghavan, WMW Hwu
Proceedings IEEE INFOCOM 2001. Conference on Computer Communications …, 2001
9962001
Parboil: A revised benchmark suite for scientific and commercial throughput computing
JA Stratton, C Rodrigues, IJ Sung, N Obeid, LW Chang, N Anssari, GD Liu, ...
Center for Reliable and High-Performance Computing 127 (7.2), 2012
9742012
The superblock: An effective technique for VLIW and superscalar compilation
WMW Hwu, SA Mahlke, WY Chen, PP Chang, NJ Warter, RA Bringmann, ...
Instruction-Level Parallelism: A Special Issue of The Journal of …, 2011
8842011
GPU computing gems jade edition
W Hwu
Elsevier, 2011
507*2011
IMPACT: An architectural framework for multiple-instruction-issue processors
PP Chang, SA Mahlke, WY Chen, NJ Warter, WW Hwu
ACM SIGARCH Computer Architecture News 19 (3), 266-275, 1991
5051991
PUMA: A programmable ultra-efficient memristor-based accelerator for machine learning inference
A Ankit, IE Hajj, SR Chalamalasetti, G Ndu, M Foltin, RS Williams, ...
Proceedings of the twenty-fourth international conference on architectural …, 2019
4712019
An adaptive performance modeling tool for GPU architectures
SS Baghsorkhi, M Delahaye, SJ Patel, WD Gropp, WW Hwu
Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of …, 2010
4312010
Accelerating advanced MRI reconstructions on GPUs
SS Stone, JP Haldar, SC Tsao, WW Hwu, ZP Liang, BP Sutton
Proceedings of the 5th conference on Computing frontiers, 261-272, 2008
4222008
DNNBuilder: An automated tool for building high-performance DNN hardware accelerators for FPGAs
X Zhang, J Wang, C Zhu, Y Lin, J Xiong, W Hwu, D Chen
2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2018
4012018
Program optimization space pruning for a multithreaded GPU
S Ryoo, CI Rodrigues, SS Stone, SS Baghsorkhi, SZ Ueng, JA Stratton, ...
Proceedings of the 6th annual IEEE/ACM international symposium on Code …, 2008
3902008
MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs
JA Stratton, SS Stone, WMW Hwu
Languages and Compilers for Parallel Computing: 21th International Workshop …, 2008
3682008
Checkpoint repair for out-of-order execution machines
WW Hwu, YN Patt
Proceedings of the 14th annual international symposium on Computer …, 1987
3581987
Using profile information to assist classic code optimizations
PP Chang, SA Mahlke, WMW Hwu
Software: Practice and Experience 21 (12), 1301-1321, 1991
3531991
An effective GPU implementation of breadth-first search
L Luo, M Wong, W Hwu
Proceedings of the 47th design automation conference, 52-55, 2010
3432010
CUDA-lite: Reducing GPU programming complexity
SZ Ueng, M Lathara, SS Baghsorkhi, WMW Hwu
Languages and Compilers for Parallel Computing: 21th International Workshop …, 2008
3352008
GPU clusters for high-performance computing
VV Kindratenko, JJ Enos, G Shi, MT Showerman, GW Arnold, JE Stone, ...
2009 IEEE International Conference on Cluster Computing and Workshops, 1-8, 2009
3312009
Achieving high instruction cache performance with an optimizing compiler
WW Hwu, PP Chang
Proceedings of the 16th Annual International Symposium on Computer …, 1989
3041989
Differential treatment for stuff and things: A simple unsupervised domain adaptation method for semantic segmentation
Z Wang, M Yu, Y Wei, R Feris, J Xiong, W Hwu, TS Huang, H Shi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
2752020
The system can't perform the operation now. Try again later.
Articles 1–20