News & Events

Subscribe to email list

Please select the email list(s) to which you wish to subscribe.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Image CAPTCHA

Enter the characters shown in the image.

User menu

You are here

CANCELLED: Instance-dependent Reinforcement Learning: A statistical viewpoint

Wednesday, March 2, 2022 - 11:00 to 12:00
Koulik Khamaru, PhD student, Department of Statistics, University of California Berkeley
Statistics Seminar
Zoom

To join via Zoom: To join this seminar, please request Zoom connection details from headsec [at] stat.ubc.ca

Title: Instance-dependent Reinforcement Learning: A statistical viewpoint

Abstract: In recent years, there has been tremendous progress in the field of reinforcement learning (RL), especially on the empirical side. But it is fair to say that there is a considerable gap between theory and practice: many RL methods behave far better than existing worst-case theory would suggest, and often they work in settings where the current worst-case guarantees are completely prohibitive. In this talk, we will discuss why worst-case guarantees can severely overestimate the difficulty of reinforcement learning problems in presence of favorable structure. This motivates us to consider an instance-dependent difficulty measure that is responsive to the problem structure. Next, we discuss how we can construct estimators that adapt to this instance-dependent difficulty. We show that for problems with favorable structures our proposed estimators and associated confidence regions are significantly better than those obtained from the worst-case theory. Finally, we show that the techniques that we developed for constructing instance-dependent estimators are not specific to RL problems, and can be applied to a broad class of other problems.

***

Dear STAT news subscribers:

Please be advised that this seminar has been cancelled. We will reschedule the seminar in the near future. We sincerely apologize for any inconvenience.

Best wishes,

UBC Statistics Department