Gennady Pekhimenko

Cited by

	All	Since 2020
Citations	6801	4659
h-index	42	37
i10-index	65	60

1100

550

275

825

201320142015201620172018201920202021202220232024202524 58 153 346 339 605 534 660 816 905 957 1090 221

Public access

View all

44 articles

1 article

available

not available

Based on funding mandates

Co-authors

Onur MutluETH Zürich and Carnegie Mellon UniversityVerified email at inf.ethz.ch
Donghyuk LeeNVIDIAVerified email at nvidia.com
Todd C. MowryProfessor of Computer Science, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Samira KhanUniversity of VirginiaVerified email at virginia.edu
Vivek SeshadriStudent, CS, CMUVerified email at cs.cmu.edu
Saugata GhoseUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Phillip GibbonsCarnegie Mellon UniversityVerified email at cs.cmu.edu
Michael A. KozuchIntelVerified email at intel.com
Bojian ZhengUniversity of TorontoVerified email at cs.toronto.edu
Nandita VijaykumarAssistant Professor, University of TorontoVerified email at cs.toronto.edu
Yoongu KimGraduate Student, Carnegie Mellon UniversityVerified email at cmu.edu
Hasan HassanETH ZurichVerified email at inf.ethz.ch
Anand JayarajanUniversity of TorontoVerified email at cs.toronto.edu
Rachata AusavarungnirunMangoBoostVerified email at mangoboost.io
Kevin HsiehPrincipal Researcher at MicrosoftVerified email at microsoft.com
Oğuz ErginProfessor, University of Sharjah, TOBB ETÜ (on leave), Sharjah, UAEVerified email at etu.edu.tr
Amar PhanishayeeMicrosoft ResearchVerified email at cs.cmu.edu
Hadi EsmaeilzadehAssociate Professor; Computer Science and Engineering; University of California, San DiegoVerified email at eng.ucsd.edu
Yixin LuoCarnegie Mellon UniversityVerified email at cs.cmu.edu
Chris FallinGraduate Student, Carnegie Mellon UniversityVerified email at cmu.edu

Gennady Pekhimenko

University of Toronto

Verified email at cs.toronto.edu - Homepage

Computer Architecture Systems Systems for ML Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mlperf inference benchmark VJ Reddi, C Cheng, D Kanter, P Mattson, G Schmuelling, CJ Wu, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020	585	2020
RowClone: fast and energy-efficient in-DRAM bulk data copy and initialization V Seshadri, Y Kim, C Fallin, D Lee, R Ausavarungnirun, G Pekhimenko, ... Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013	552	2013
Base-delta-immediate compression: practical data compression for on-chip caches G Pekhimenko, V Seshadri, O Mutlu, PB Gibbons, MA Kozuch, TC Mowry Proceedings of the 21st international conference on Parallel architectures …, 2012	519	2012
MLPerf Training Benchmark P Mattson, C Cheng, C Coleman, G Diamos, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2020, 336-349, 2020	365	2020
Adaptive-Latency DRAM: Optimizing DRAM Timing for the Common-Case D Lee, Y Kim, G Pekhimenko, S Khan, V Seshadri, K Chang, O Mutlu High Performance Computer Architecture (HPCA), 2015 IEEE 21st International …, 2015	278	2015
Understanding latency variation in modern DRAM chips: Experimental characterization, analysis, and optimization KK Chang, A Kashyap, H Hassan, S Ghose, K Hsieh, D Lee, T Li, ... Proceedings of the 2016 ACM SIGMETRICS International Conference on …, 2016	252	2016
Benchmarking and analyzing deep neural network training H Zhu, M Akrout, B Zheng, A Pelegris, A Jayarajan, A Phanishayee, ... 2018 IEEE International Symposium on Workload Characterization (IISWC), 88-100, 2018	242	2018
Linearly compressed pages: a low-complexity, low-latency main memory compression framework G Pekhimenko, V Seshadri, Y Kim, H Xin, O Mutlu, PB Gibbons, ... Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013	208	2013
Priority-based parameter propagation for distributed DNN training A Jayarajan, J Wei, G Gibson, A Fedorova, G Pekhimenko Proceedings of Machine Learning and Systems 2019, 2019	197	2019
ChargeCache: Reducing DRAM latency by exploiting row access locality H Hassan, G Pekhimenko, N Vijaykumar, V Seshadri, D Lee, O Ergin, ... 2016 IEEE International Symposium on High Performance Computer Architecture …, 2016	190	2016
Simultaneous multi-layer access: Improving 3D-stacked memory bandwidth at low cost D Lee, S Ghose, G Pekhimenko, S Khan, O Mutlu ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-29, 2016	184	2016
Gist: Efficient data encoding for deep neural network training A Jain, A Phanishayee, J Mars, L Tang, G Pekhimenko 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018	182	2018
Design-induced latency variation in modern DRAM chips: Characterization, analysis, and latency reduction mechanisms D Lee, S Khan, L Subramanian, S Ghose, R Ausavarungnirun, ... Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (1 …, 2017	160	2017
SoftMC: A flexible and practical open-source infrastructure for enabling experimental DRAM studies H Hassan, N Vijaykumar, S Khan, S Ghose, K Chang, G Pekhimenko, ... 2017 IEEE International Symposium on High Performance Computer Architecture …, 2017	155	2017
A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps N Vijaykumar, G Pekhimenko, A Jog, A Bhowmick, R Ausavarungnirun, ... ACM SIGARCH Computer Architecture News 43 (3S), 41-53, 2015	145	2015
Mlperf inference benchmark. In 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) VJ Reddi, C Cheng, D Kanter, P Mattson, G Schmuelling, CJ Wu, ... IEEE, 2020	111	2020
Shifted Hamming distance: a fast and accurate SIMD-friendly filter to accelerate alignment verification in read mapping H Xin, J Greth, J Emmons, G Pekhimenko, C Kingsford, C Alkan, O Mutlu Bioinformatics 31 (10), 1553-1560, 2015	107*	2015
Federated benchmarking of medical artificial intelligence with MedPerf A Karargyris, R Umeton, MJ Sheller, A Aristizabal, J George, A Wuest, ... Nature machine intelligence 5 (7), 799-810, 2023	103	2023
RFVP: Rollback-free value prediction with safe-to-approximate loads A Yazdanbakhsh, G Pekhimenko, B Thwaites, H Esmaeilzadeh, O Mutlu, ... ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-26, 2016	101	2016
{StreamBox}: Modern Stream Processing on a Multicore Machine H Miao, H Park, M Jeon, G Pekhimenko, KS McKinley, FX Lin 2017 USENIX Annual Technical Conference (USENIX ATC 17), 617-629, 2017	98	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors