Reinforcement learning on the cart-pole problem

Tom Vodopivec (2011) Reinforcement learning on the cart-pole problem. EngD thesis.

Preview

Abstract

The main goal of this thesis was the evaluation and implementation of two types of reinforcement learning algorithms on a computer-simulated control problem. Reinforcement learning is a branch of machine learning which combines principles of dynamic programming and supervised learning for problem solving. For the benchmark system we chose the cart-pole control problem as it is widely used in this field for testing the efficiency of learning algorithms. Out of the reinforcement learning methods we chose two algorithms for temporal difference learning. This type of learning uses methods of dynamic programming and Monte Carlo methods. The first chosen algorithm is Q-learning, the second is an actor-critic algorithm which is called learning by associative search element and adaptive critic element. In the purpose of achieving our goal, we developed a computer application for the experimental testing of the simulation of learning on a benchmark system. Our aim was to make this tool as modular and reusable as possible. We defined a different method of performance evaluation which was used to evaluate both learning algorithms on a wide set of simulation parameters. We also measured the computational performance of both algorithms.

Item Type:

Thesis (EngD thesis)

Keywords:

reinforcement learning, cart-pole control problem, algorithm evaluation, performance evaluation method

Number of Pages:

Language of Content:

Slovenian

Mentor / Comentors:

Name and Surname	ID	Function
prof. dr. Branko Šter	283	Mentor

Link to COBISS:

http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=00008630356)

Institution:

University of Ljubljana

Department:

Faculty of Computer and Information Science

Item ID:

1517

Date Deposited:

20 Sep 2011 12:59

Last Modified:

26 Sep 2011 17:47

URI:

http://eprints.fri.uni-lj.si/id/eprint/1517

Actions (login required)

View Item