Parameterized Action Soft Actor-Critic

Match Plan Generation in Web Search with Parameterized Action Reinforcement Learning

To achieve good result quality and short query response time, search engines use specific match plans on Inverted Index to help retrieve a small set of relevant documents from billions of web pages. A match plan is composed of a sequence of match …