Ray Luo

Ray Luo

Ph.D. Candidate

Mila, McGill University

Biography

Hi there! I’m Ziyan “Ray” Luo, a Ph.D. candidate at Mila, McGill, fortunate to work with (and learn from) Dr. Xujie Si, Dr. Doina Precup, and my talented colleagues, who continuously inspire me.

🌟 Research Interest: RL & Abstraction

I am broadly interested in Reinforcement Learning (RL), currently with a focus on representation learning—developing compact, structured, and agent-centric encodings that support sample-efficient learning, generalization, and planning. As RL scales to complex domains like robotics, such representations become essential for tractable and robust decision-making.

A central theme in my research is understanding how abstraction can serve as a foundation for representation learning. Inspired by early work in formal verification, which emphasized abstraction as a tool for managing complexity through rigorous equivalence and refinement, I explore principles such as behavioral metrics for state abstraction and temporal abstractions in hierarchical RL as tools to integrate formal structure with learning-based flexibility.

I am also inspired by the ethos of the formal methods community: their emphasis on precision, scientific rigor, and long-term research impact shapes the way I approach problems.

🎼🏸 Beyond the Lab: Music, Sports & More

When I’m not exploring algorithms, you may find me composing music that tells stories. Here’s my portfolio (some tracks even made it into popular video games!). I love blending electronic sounds with acoustic instruments to create immersive, theme-driven pieces.

Ball sports keep me energized: badminton, tennis, table tennis, billiards—you name it! And if you’re an animal lover, check out my Instagram for some furry friends.

Interests

  • Reinforcement Learning
  • Abstraction
  • Formal Verification

Education

  • Ph.D. in Computer Science, 2021

    Mila, McGill University

Experience

 
 
 
 
 

Ph.D. Candidate

Mila, McGill University

Sep 2021 – Present Montreal, Quebec
 
 
 
 
 

Research Intern

Microsoft Research Asia

Nov 2019 – Mar 2020 SRG, MSRA, Beijing, China

Match Plan Generation in Web Search with Parameterized Action Reinforcement Learning

  • Design DRL algorithms for Inverted Index Match Plan Generation, replacing current hand-crafted heuristics in the query process of Bing
 
 
 
 
 

Research Assistant

Tsinghua University

Mar 2019 – Jan 2021 Beijing

Spatio-temporal Trajectory Prediction

  • Supervised by Jilong Wang (chair of APAN) and Congcong Miao
  • ACN: accepted by WSDM'20 (top 7%, oral)
  • RLMob: accepted by WSDM'22

Selected Publications

Click a title to read the story behind the work; find more on Google Scholar.