3385 - 3396 of 18052 Results
Format results
- R. Srikant (University of Illinois at Urbana-Champaign)
Zap Q-learning with Nonlinear Function Approximation
Sean Meyn (University of Florida)Special Topics in Astrophysics - Numerical Hydrodynamics - Lecture 22
Daniel Siegel University of Greifswald
Uniform Offline Policy Evaluation and Offline Learning in Tabular RL
Yu-Xiang Wang (UC Santa Barbara)Testing Gauge Gravity duality with Matrix models
Denjoe O'Connor Dublin Institute for Advanced Studies
Batch Value-function Approximation with Only Realizability
Nan Jiang (University of Illinois at Urbana-Champaign)Thermal Dark Sectors in the Early and Late Universe
Linda Xu Harvard University
Monte Carlo Sampling Approach to Solving Stochastic Multistage Programs
Alex Shapiro (Georgia Tech)Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting
Ilja Kuzborskij (DeepMind)