This event has passed.

Spring 2025 GRASP SFI: Qinghua Liu, Microsoft Research, “When Is Partially Observable Reinforcement Learning Not Scary?”

Name: Spring 2025 GRASP SFI: Qinghua Liu, Microsoft Research, “When Is Partially Observable Reinforcement Learning Not Scary?”
Start: 2025-02-19T15:00:00-05:00
End: 2025-02-19T16:00:00-05:00
Location: Levine 307

February 19 @ 3:00 pm - 4:00 pm

This was a hybrid event with in-person attendance in Levine 307 and virtual attendance…

ABSTRACT

Partial observability is ubiquitous in Reinforcement Learning (RL) applications, where agents must make sequential decisions despite lacking complete information about the latent states of the controlled system. Partially observable RL is notoriously challenging in theory—well-known information-theoretic results show that learning partially observable Markov decision processes (POMDPs) requires an exponential number of samples in the worst case. However, this does not rule out the existence of interesting subclasses of POMDPs that encompass a diverse set of practical applications while remaining tractable.

In this talk, we identify a rich family of tractable POMDPs, which we call weakly revealing POMDPs. This family excludes pathological cases where observations provide so little information that learning becomes infeasible. We prove that for weakly revealing POMDPs, a simple algorithm combining optimism and Maximum Likelihood Estimation (MLE) is sufficient to guarantee polynomial sample complexity. Finally, we discuss the practical implications of this theory, including strategies for collecting samples in partially observable tasks and the limitations of purely model-free algorithms.

Presenter

Qinghua Liu

Qinghua Liu is a postdoctoral researcher at Microsoft Research in New York City. He earned his Ph.D. from Princeton University advised by Chi Jin. His previous research primarily focused on RL theory, while his recent works explore improving the reasoning capabilities of language models.

Details

Date:: February 19
Time:: 3:00 pm - 4:00 pm
Event Category:: Seminars

Venue

Levine 307

3330 Walnut St
Philadelphia, PA 19104 United States + Google Map

Spring 2025 GRASP SFI: Qinghua Liu, Microsoft Research, “When Is Partially Observable Reinforcement Learning Not Scary?”

February 19 @ 3:00 pm - 4:00 pm

ABSTRACT

Presenter

Details

Venue

Related Events

Spring 2025 GRASP SFI: Haimin Hu, Princeton University, “From Gambits to Assurances: Game-Theoretic Integration of Safety and Learning for Human-Centered Robotics”

Spring 2025 GRASP on Robotics: Phillip Isola, Massachusetts Institute of Technology, “Robots and Artificial Life from Visual Foundation Models”

Spring 2025 Robotics MSE Thesis and Capstone Lightning Talks and Poster Session