A large-scale study on the impact of behavioral metric learning in deep RL, conceptually unifying and evaluating recent methods spanning 370 tasks with diverse noise settings.
A large-scale study on the impact of behavioral metric learning in deep RL, conceptually unifying and evaluating recent methods spanning 370 tasks with diverse noise settings.
Click to view the blog. An 80+ page comprehensive overview of HRL, focusing on methods for discovering temporal structure and the key benefits it provides.
Click to view the blog. An 80+ page comprehensive overview of HRL, focusing on methods for discovering temporal structure and the key benefits it provides.
Introduce RL to human mobility prediction to address the key challenges in this task.
During my Microsoft internship, we explored an RL-based prototype for match plan generation, which outperformed hand-crafted match plans tuned by experts for years, and was later integrated into Bing.