Policy Optimization with Learning and Planning in Continuous Spaces Planning algorithms have shown impressive performance in many domains such as chess and Go. In particular, Monte Carlo ... Ingenieurwissenschaften Informatik