ePrints.FRI - University of Ljubljana, Faculty of Computer and Information Science

Monte Carlo Tree Search with Thompson sampling in The Settlers of Catan

Katja Tuma (2016) Monte Carlo Tree Search with Thompson sampling in The Settlers of Catan. MSc thesis.

[img]
Preview
PDF
Download (2275Kb)

    Abstract

    Monte Carlo Tree search (MCTS) is a popular method of choice for addressing the problem of a strong computer based game playing agent in Artificial Intelligence, without any prior domain knowledge. The strongest and most popular algorithms used to tackle the so-called exploration vs. exploitation dilemma in Multi-armed Bandit (MAB) problems were identified and presented in a literature review. Empirical studies measuring the performance of Thompson sampling (TS) and the state-of-the-art Upper Confidence Bound (UCB) approach in the classical MAB problem have been found, results of which support our modified tree policy in MCTS. The domain of application is the board game of the Settlers of Catan (SoC), which is implemented as a multi-agent environment in the programming language C, along with a MCTS-UCT agent, MCTS-TS agent and two strategy playing agents, namely the ore-grain and wood-clay agent. Performance measurements of the aforementioned agents, presented and discussed in this work, demonstrate an increase in the performance of the agent with the modified tree policy, when compared to the state-of-the-art approach (UCT).

    Item Type: Thesis (MSc thesis)
    Keywords: Monte Carlo Tree Search (MCTS), Multi-armed Bandits (MAB), Upper Confidence Bound for Trees (UCT), Thompson sam- pling (TS), Artificial Intelligence (AI), the Settlers of Catan (SoC)
    Number of Pages: 44
    Language of Content: English
    Mentor / Comentors:
    Name and SurnameIDFunction
    prof. dr. Branko Šter283Mentor
    prof. dr. Bengt J. NilssonComentor
    Link to COBISS: http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1537159107)
    Institution: University of Ljubljana
    Department: Faculty of Computer and Information Science
    Item ID: 3504
    Date Deposited: 05 Sep 2016 17:03
    Last Modified: 30 Sep 2016 13:18
    URI: http://eprints.fri.uni-lj.si/id/eprint/3504

    Actions (login required)

    View Item