Karin Frlic (2019) Using Monte Carlo tree search and machine learning to learn a heuristic function. EngD thesis.
Abstract
Minimax algorithm is one of the most widely used algorithms for playing two-player games. It uses a heuristic function that estimates the benefits of reaching a given game state for both players. In this bachelor thesis we attempt to automatically construct that kind of a function for the game of Hex. Different models of supervised machine learning are trained on learning samples, generated by simulations of MCTS. As a result, the player that uses minimax with α-β and the learnt function performs worse than the player that uses pure MCTS. However, the player combining advantages of both players achieves better results than MCTS.
Item Type: | Thesis (EngD thesis) |
Keywords: | Monte Carlo tree search, supervised machine learning, minimax algorithm, heuristic evaluation function, alpha-beta pruning, the game of Hex |
Number of Pages: | 59 |
Language of Content: | Slovenian |
Mentor / Comentors: | Name and Surname | ID | Function |
---|
doc. dr. Aleksander Sadikov | 934 | Mentor |
|
Link to COBISS: | http://www.cobiss.si/scripts/cobiss?command=search&base=51012&select=(ID=1538106051) |
Institution: | University of Ljubljana |
Department: | Faculty of Computer and Information Science |
Item ID: | 4337 |
Date Deposited: | 15 Jan 2019 11:26 |
Last Modified: | 24 Jan 2019 11:08 |
URI: | http://eprints.fri.uni-lj.si/id/eprint/4337 |
---|
Actions (login required)