5617 - 5628 of 20071 Results
Format results
Zap Q-learning with Nonlinear Function Approximation
Sean Meyn (University of Florida)Special Topics in Astrophysics - Numerical Hydrodynamics - Lecture 22
Daniel Siegel University of Greifswald
Uniform Offline Policy Evaluation and Offline Learning in Tabular RL
Yu-Xiang Wang (UC Santa Barbara)Testing Gauge Gravity duality with Matrix models
Denjoe O'Connor Dublin Institute For Advanced Studies
Batch Value-function Approximation with Only Realizability
Nan Jiang (University of Illinois at Urbana-Champaign)Thermal Dark Sectors in the Early and Late Universe
Linda Xu Harvard University
Monte Carlo Sampling Approach to Solving Stochastic Multistage Programs
Alex Shapiro (Georgia Tech)Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting
Ilja Kuzborskij (DeepMind)Decoherent Quench Across Quantum Phase Transitions
Yi-Zhuang You University of California, San Diego