Parham Mohammad Panahi

I am a machine learning and reinforcement learning researcher and incoming PhD student at the University of Alberta, working with Adam White. I am also affiliated with the RLAI lab and the Alberta Machine Intelligence Institute (amii). Before that, I graduated with a M.Sc. in Computing Science from the University of Alberta in 2024.

Email  /  CV  /  Scholar  /  Github  /  M.Sc. Thesis

profile photo

Research


Keywords: Machine Learning, Model-Based Reinforcement Learning, Representation Learning, Continual Learning, and Generative Models.

I am broadly interested in machine learning and reinforcement learning. My research goal is to create efficient and reliable learning systems capable of continual interaction and decision-making in a changing world.

Position: Lifetime tuning is incompatible with continual reinforcement learning
Golnaz Mesbahi, Parham Mohammad Panahi, Olya Mastikhina, Steven Tang, Martha White, Adam White
Forty-second International Conference on Machine Learning (ICML), 2025

We argue and demonstrate the pitfalls of lifetime tuning in continual reinforcement learning, explain why recent progress in continual RL has been mixed, and motivate the development of empirical practices that better match the goals of continual RL.

Investigating the Interplay of Prioritized Replay and Generalization
Parham Mohammad Panahi, Andrew Patterson, Martha White, Adam White
Reinforcement Learning Conference (RLC), 2024

We present insight into the interaction between prioritization, bootstrapping, and neural networks and propose several improvements for prioritized replay in tabular settings and noisy domains.

Goal-Space Planning with Subgoal Models
*Chunlok Lo, *Kevin Roice, *Parham Mohammad Panahi, Scott M. Jordan, Adam White, Gabor Mihucz, Farzane Aminmansour, Martha White
Journal of Machine Learning Research (JMLR), 2024

We constrain background planning to a given set of (abstract) subgoals and learning only local, subgoal-conditioned models to avoid compounding model error. Also appeared as an oral presentation at Planning and Reinforcement Learning Workshop at ICAPS 2024.

* indicates equal contribution

Recorded Talks

Experience Bottleneck and how it shapes our Reinforcement Learning algorithms

Tea Time Talks 2025, University of Alberta. (Recording available soon!)

Experience Selection in Deep RL

Tea Time Talks 2024, University of Alberta.