Batch Policy Learning in Average Reward Markov Decision Processes
APA
(2020). Batch Policy Learning in Average Reward Markov Decision Processes. The Simons Institute for the Theory of Computing. https://simons.berkeley.edu/talks/tbd-247
MLA
Batch Policy Learning in Average Reward Markov Decision Processes. The Simons Institute for the Theory of Computing, Dec. 03, 2020, https://simons.berkeley.edu/talks/tbd-247
BibTex
@misc{ scivideos_16828, doi = {}, url = {https://simons.berkeley.edu/talks/tbd-247}, author = {}, keywords = {}, language = {en}, title = {Batch Policy Learning in Average Reward Markov Decision Processes}, publisher = {The Simons Institute for the Theory of Computing}, year = {2020}, month = {dec}, note = {16828 see, \url{https://scivideos.org/index.php/Simons-Institute/16828}} }
Peng Liao (Harvard)
Talk number16828
Source RepositorySimons Institute
Subject