Marek Petrik

Cited by

	All	Since 2019
Citations	2080	1327
h-index	23	19
i10-index	46	36

380

190

285

20062007200820092010201120122013201420152016201720182019202020212022202320246 14 20 46 50 45 45 48 53 52 57 71 107 170 188 223 264 361 120

Public access

View all

23 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shlomo ZilbersteinProfessor of Computer Science, University of Massachusetts AmherstVerified email at cs.umass.edu
Mohammad GhavamzadehAmazonVerified email at amazon.com
Dharmashankar SubramanianPrincipal Research Staff Member/Manager, IBM ResearchVerified email at us.ibm.com
Reazul Hasan RusselResearch Scientist at MetaVerified email at wildcats.unh.edu
Chin Pang HoCity University of Hong KongVerified email at cityu.edu.hk
Bahram BehzadianMetaVerified email at meta.com
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerified email at cs.umass.edu
Sechan OhMoloco, Previously at IBM, StanfordVerified email at molocoads.com
Ji LiuMetaVerified email at meta.com
Bo LiuAAAI SM, IEEE SMVerified email at cs.umass.edu
Wolfram WiesemannProfessor of Analytics and Operations, Imperial College Business SchoolVerified email at imperial.ac.uk
Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
Daniel S. BrownAssistant Professor, Robotics Center and School of Computing, University of UtahVerified email at cs.utah.edu
Amit DhurandharPrincipal Research Scientist, IBMVerified email at us.ibm.com
Elita LoboUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Adam N. ElmachtoubColumbia University, Dept. of Industrial Engineering and Operations ResearchVerified email at ieor.columbia.edu
Wheeler RumlUniversity of New HampshireVerified email at cs.unh.edu
Dan A. IancuAssociate Professor of Operations, Information and Technology, Stanford UniversityVerified email at stanford.edu
Prateek JainThe TradedeskVerified email at prateekjain.name
Gavin TaylorUS Naval AcademyVerified email at usna.edu

Marek Petrik

University of New Hampshire

Verified email at cs.unh.edu - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
An approximate solution method for large risk-averse Markov decision processes M Petrik, D Subramanian arXiv preprint arXiv:1210.4901, 2012	229	2012
Finite-sample analysis of proximal gradient td algorithms B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik arXiv preprint arXiv:2006.14364, 2020	172	2020
Safe policy improvement by minimizing robust baseline regret M Ghavamzadeh, M Petrik, Y Chow Advances in Neural Information Processing Systems 29, 2016	146	2016
Feature selection using regularization in approximate linear programs for Markov decision processes M Petrik, G Taylor, R Parr, S Zilberstein arXiv preprint arXiv:1005.1860, 2010	89	2010
An Analysis of Laplacian Methods for Value Function Approximation in MDPs. M Petrik IJCAI, 2574-2579, 2007	87	2007
Biasing approximate dynamic programming with a lower discount factor M Petrik, B Scherrer Advances in neural information processing systems 21, 2008	70	2008
Beyond confidence regions: Tight bayesian ambiguity sets for robust mdps M Petrik, RH Russel Advances in neural information processing systems 32, 2019	61	2019
Fast Bellman updates for robust MDPs CP Ho, M Petrik, W Wiesemann International Conference on Machine Learning, 1979-1988, 2018	61	2018
Learning parallel portfolios of algorithms M Petrik, S Zilberstein Annals of Mathematics and Artificial Intelligence 48, 85-106, 2006	61	2006
A practical method for solving contextual bandit problems using decision trees AN Elmachtoub, R McNellis, S Oh, M Petrik arXiv preprint arXiv:1706.04687, 2017	60	2017
Partial policy iteration for l1-robust markov decision processes CP Ho, M Petrik, W Wiesemann Journal of Machine Learning Research 22 (275), 1-46, 2021	47	2021
Tight approximations of dynamic risk measures DA Iancu, M Petrik, D Subramanian Mathematics of Operations Research 40 (3), 655-682, 2015	46	2015
A bilinear programming approach for multiagent planning M Petrik, S Zilberstein Journal of Artificial Intelligence Research 35, 235-274, 2009	46	2009
Constraint relaxation in approximate linear programs M Petrik, S Zilberstein Proceedings of the 26th Annual International Conference on Machine Learning …, 2009	46	2009
RAAM: The benefits of robustness in approximating aggregated MDPs in reinforcement learning M Petrik, D Subramanian Advances in Neural Information Processing Systems 27, 2014	43	2014
Average-Reward Decentralized Markov Decision Processes. M Petrik, S Zilberstein IJCAI, 1997-2002, 2007	40	2007
Bayesian robust optimization for imitation learning D Brown, S Niekum, M Petrik Advances in Neural Information Processing Systems 33, 2479-2491, 2020	36	2020
Anytime coordination using separable bilinear programs M Petrik, S Zilberstein PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE 22 (1), 750, 2007	33	2007
Proximal Gradient Temporal Difference Learning Algorithms. B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik IJCAI, 4195-4199, 2016	30	2016
Social media and customer behavior analytics for personalized customer engagements S Buckley, M Ettl, P Jain, R Luss, M Petrik, RK Ravi, C Venkatramani IBM Journal of Research and Development 58 (5/6), 7: 1-7: 12, 2014	29	2014

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors