Radman Rakhshandehroo

Reinforcement Learning & AI

I am a Master of Management student at UBC, after a BSc in Biology and Computer Science (Combined Major).

I currently work on humanoid locomotion, training high-level controllers to take robust steps in challenging environments, with Dr. Michiel Van de Panne at UBC and Nick Ioannidis (PhD, SFU). Accepted to Canadian AI/CRV 2026 (Nectar track, Exploration Edge).

My undergraduate thesis, supervised by Dr. Daniel Coombs, used reinforcement learning to model human behaviour during epidemics. Published at TMLR 2026 and a spotlight at Canadian AI/CRV 2026 (paper).

I am interested in reinforcement learning, vision-action-language models and computer vision: building systems that tightly couple perception and action.

Radman Rakhshandehroo

Research

Exploring the intersection of artificial intelligence, robotics, and human behavior through computational approaches. My research focuses on developing adaptive systems that can learn, reason, and interact effectively in complex environments.

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

under review

Supervised Program for Alignment Research (SPAR)

An infra-Bayesian reinforcement learning agent that tracks a set of plausible world models instead of a single posterior, and selects actions by their worst-case expected value across that set.

Imitation-Free Diffusion Policy Training for Humanoid Footstep Planning

Imitation-Free Diffusion Policy Training for Humanoid Footstep Planning

accepted to Canadian AI/CRV 2026Nectar, Exploration Edge

MOCCA Lab, UBC Computer Science · Dr. Michiel Van de Panne

Developing robust control policies for humanoid robots to navigate challenging terrains using deep reinforcement learning techniques.

ContagionRL: A Flexible Platform for Learning in Different Spatial Epidemic Environments

ContagionRL: A Flexible Platform for Learning in Different Spatial Epidemic Environments

published at TMLR 2026spotlight at Canadian AI/CRV 2026

UBC Mathematics & Computer Science · Dr. Daniel Coombs

ContagionRL simulates human behavioral responses during epidemics using reinforcement learning, combining a spatial SIRS disease model with single-agent RL.

Entrepreneurship Education Research

Entrepreneurship Education Research

under review

UBC Computer Science and Sauder School of Business · Dr. Angele Beausoleil

An NLP-based system that automatically analyzes and maps entrepreneurship education programs and course syllabi against defined competency frameworks using zero-shot classification.

PILOT: Platformed Inteins: A linked orthogonal toolkit

PILOT: Platformed Inteins: A linked orthogonal toolkit

🥈 iGEM silver medal

UBC Life Sciences Institute (LSI) · Dr. Steven Halem

A modular intein-mediated cell-free protein synthesis platform with a self-aggregating solubility tag, enabling multisubunit peptide assembly and traceless purification in a Vibrio natriegens lysate.

Featured Projects

A selection of projects combining AI, software engineering, and data-driven systems: from interactive dashboards and research prototypes to robust web and mobile applications.

Blog

GitHub Activity

Connect

Feel free to contact me at rdmnr@protonmail.com