ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Reinforcement learning on the cart-pole problem

Tom Vodopivec (2011) Reinforcement learning on the cart-pole problem. EngD thesis.

[img]
Preview
PDF
Download (2181Kb)

    Abstract

    The main goal of this thesis was the evaluation and implementation of two types of reinforcement learning algorithms on a computer-simulated control problem. Reinforcement learning is a branch of machine learning which combines principles of dynamic programming and supervised learning for problem solving. For the benchmark system we chose the cart-pole control problem as it is widely used in this field for testing the efficiency of learning algorithms. Out of the reinforcement learning methods we chose two algorithms for temporal difference learning. This type of learning uses methods of dynamic programming and Monte Carlo methods. The first chosen algorithm is Q-learning, the second is an actor-critic algorithm which is called learning by associative search element and adaptive critic element. In the purpose of achieving our goal, we developed a computer application for the experimental testing of the simulation of learning on a benchmark system. Our aim was to make this tool as modular and reusable as possible. We defined a different method of performance evaluation which was used to evaluate both learning algorithms on a wide set of simulation parameters. We also measured the computational performance of both algorithms.

    Item Type: Thesis (EngD thesis)
    Keywords: reinforcement learning, cart-pole control problem, algorithm evaluation, performance evaluation method
    Number of Pages: 75
    Language of Content: Slovenian
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Branko Šter283Mentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=50070&select=(ID=00008630356)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 1517
    Date Deposited: 20 Sep 2011 12:59
    Last Modified: 26 Sep 2011 17:47
    URI: http://eprints.fri.uni-lj.si/id/eprint/1517

    Actions (login required)

    View Item