youjiazhang / alphago-zero-gobang Goto Github PK

View Code? Open in Web Editor NEW

69.0 2.0 8.0 13.55 MB

Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程

License: MIT License

Python 100.00%

gobang gui mcts deep-learning residual-networks alphazero alphago tensorflow ai gomuku

alphago-zero-gobang's Introduction

AlphaGo-Zero-Gobang

Do you like to play Gobang ?
Do you want to know how AlphaGo Zero works ?
Check it out!

You can also read my Blog :)

View a Demo

这是一个基于强化学习的自我博弈模型，运行后的程序如下所示。

Quick Start

python3 MetaZeta.py

Train

我们构建了一个基于MCTS进行决策的 AI玩家，由残差神经网络辅助预测落子。

操作：点击 AI 自我对弈，在右上角点击 开始

Test

我们可以和训练有素的 AI玩家 对弈，以测试 AI 的下棋水平。

操作：点击 与 AI对战，在右上角点击 开始

Environment

Ubuntu 18.04.6 LTS
tensorflow-gpu==2.6.2

File Structure

filename	type	description
`TreeNode.py`	MCTS	nodes of the MCTS decision tree
`MCTS.py`	MCTS	Build MCTS decision tree
`AIplayer.py`	MCTS	Build an AI based on MCTS+NN
`Board.py`	Board	store board information
`Game.py`	Board	defines the game process for selfPlay and play-with-Human
`PolicyNN.py`	NN	constructs a residual neural network
`MetaZeta.py`	Main	GUI synthesis for all parties All in one

How it works (with code explanation)

3. MCTS ✨✨✨

然后，我们需要了解 AI 是如何做出决策的。他是如何积累下棋的知识，并利用学到的知识进行下棋的

alphago-zero-gobang's People

Contributors

Stargazers

Watchers

Forkers

licq328 xinglin-yu sakurai-nokiryuu jiwoongim gwstudy jaapin tudohuang wannasleep61c

alphago-zero-gobang's Issues

想问下作者windows环境中的配置是怎样的呢

我尝试了1.15.0版本的tensorflow和2.0.0版本的tensorflow，都会出现相应的问题，1.x版本不兼容2.0版本，2.0版本会出现一些1.x的属性，请问作者是怎么解决这个问题的呢？
我需要对代码做出哪些修改呢？
感谢作者！

Training & Performance Question and GPU requirement

Hi,

Thanks for sharing your implementation of AlphaGo Zero.
I was wondering how long does it take to train AlphaGo-zero (RL based) on 9X9? And how long does it get?
Could you share which GPUs did you train on?

Thanks

[Google translation version]
"你好，

感谢分享您对 AlphaGo Zero 的实现。
我想知道在 9X9 上训练 AlphaGo-zero（基于 RL）需要多长时间？它需要多长时间？
你能分享一下你在哪些 GPU 上进行训练吗？

谢谢"

How to enter your blog? That link has expired

怎样进入你的博客，那个链接已经失效了
How to enter your blog? That link has expired

用的tf版本也太老了。。。

ERROR: Could not find a version that satisfies the requirement tensorflow-gpu==2.6.2 (from versions: 2.12.0)
ERROR: No matching distribution found for tensorflow-gpu==2.6.2

Attempting to perform BLAS operation using StreamExecutor without BLAS support

需要自行配置tensorflow config

import tensorflow as tf
physical_devices = tf.config.list_physical_devices('GPU') 
tf.config.experimental.set_memory_growth(physical_devices[0], True)