RL

Understanding Behavioral Metric Learning

A large-scale study on the impact of behavioral metric learning in deep RL, conceptually unifying and evaluating recent methods spanning 370 tasks with diverse noise settings.

Understanding Behavioral Metric Learning

A large-scale study on the impact of behavioral metric learning in deep RL, conceptually unifying and evaluating recent methods spanning 370 tasks with diverse noise settings.

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning

Click to view the blog. An 80+ page comprehensive overview of HRL, focusing on methods for discovering temporal structure and the key benefits it provides.

Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning

Click to view the blog. An 80+ page comprehensive overview of HRL, focusing on methods for discovering temporal structure and the key benefits it provides.

RLMob - Deep Reinforcement Learning for Successive Mobility Prediction

Introduce RL to human mobility prediction to address the key challenges in this task.

Match Plan Generation in Web Search with Parameterized Action Reinforcement Learning

During my Microsoft internship, we explored an RL-based prototype for match plan generation, which outperformed hand-crafted match plans tuned by experts for years, and was later integrated into Bing.