Niladri Chatterji
Home
Publications
Contact
On the theory of reinforcement learning with once-per-episode feedback
Niladri S. Chatterji*
,
Aldo Pacchiano*
,
Peter L. Bartlett
,
Michael I. Jordan
May 2021
PDF
Type
Preprint
Cite
×