Monte Carlo Tree Search (MCTS) in Python

This project offers a Python module with an extensible implementation of MCTS (Monte Carlo Tree Search).

Description

This MCTS implementation can be used to find the best action for a single-player problem or a multi-player competitive turn-based game with discrete actions.

Typical examples are two-player board games such as: checkers or chess. Ot other 2-player games such as connect-4 or tic-tac-toe. Games with more than 2 players are also supported.

Features

Supports from 1 to multiple players
Automatic random rollout policy
Rollout policy can be replaced by custom implementation

Using

To use this module:

Copy "py_mcts" folder to your project's folder.
In your code, import py_mcts. You will use ProblemState and MCTS classes.
Define a new class to represent your problem state, as a subclass of ProblemState. Implement these methods:
- get_player(self) -> in non-terminal states, returns the next player to move (use any data type to identify the players, players will be compared with==); in terminal states, should return None
- get_valid_actions(self) -> return a list or any iterable object of actions (each represented by any type you want)
- is_terminal(self) -> returns boolean to indicate if it is a terminal state
- game_result(self, player) -> used in terminal states, indicates the score for any given player
- move(self, action) -> call this method to indicate that get_player() has performed the given action; should return a new "state" (instance of the same class)
Instantiate you MCTS solver. In the constructor:
- you may set a custom rollout_policy if you want (default: random policy)
- you may also set the c parameter of the tree policy (default: 1.41)
Then, whenever you need to find a "good" (quasi-optimal) action in a given state of your game:
- instantiate your class with the current state information
- in your MCTS solver call choose_action(state, duration) with the current state and with the desired duration in seconds
- it will run for the desired time, then it will return the recommended action!

Example

This project includes an example of the implementation of an AI for tic-tac-toe, which you can play against.

pablo-sampaio / python_mcts Goto Github PK

python_mcts's Introduction

Monte Carlo Tree Search (MCTS) in Python

Description

Features

Using

Example

python_mcts's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent