Tom Vodopivec (2017) Monte Carlo tree search strategies. EngD thesis.
Tom Vodopivec (2011) Reinforcement learning on the cart-pole problem. EngD thesis.