Comments (6)
After looking through the AlphaZero code quickly, it wasn't clear .. maybe/possibly?
@tewalds, do you remember if you supported this case?
from open_spiel.
The MCTS implementation does, but I'm pretty sure you can't in AlphaZero since you add noise to the first layer of nodes, which distorts the shape of the tree. If you were to try to reuse the tree you'd either need to not add that noise or add it at all layers. You're welcome to try that, but that's different than the canonical AlphaZero implementation.
from open_spiel.
Does this python version AlphaZero have the speed of c++ version?
from open_spiel.
The MCTS implementation does, but I'm pretty sure you can't in AlphaZero since you add noise to the first layer of nodes, which distorts the shape of the tree. If you were to try to reuse the tree you'd either need to not add that noise or add it at all layers. You're welcome to try that, but that's different than the canonical AlphaZero implementation.
how can I use sub-tree in eval or play with human?
from open_spiel.
Does this python version AlphaZero have the speed of c++ version?
The Python AlphaZero is not as fast as the C++ version.
from open_spiel.
The MCTS implementation does, but I'm pretty sure you can't in AlphaZero since you add noise to the first layer of nodes, which distorts the shape of the tree. If you were to try to reuse the tree you'd either need to not add that noise or add it at all layers. You're welcome to try that, but that's different than the canonical AlphaZero implementation.
how can I use sub-tree in eval or play with human?
Yes, you can plat AlphaZero checkpoints from the keyboard, see here: https://github.com/google-deepmind/open_spiel/blob/master/open_spiel/python/examples/alpha_zero.py
(Apologies for the late replies -- I missed your follow-up questions.)
from open_spiel.
Related Issues (20)
- Problem with Julia API on Ubuntu 24.04 HOT 7
- Problem with Python AlphaZero using Keras 3 HOT 6
- Problem with RCFR using Keras 3 HOT 3
- Problem with TF2 version of Deep CFR using Keras 3 HOT 1
- Spades Implementation HOT 15
- Returned Policies and Exploitability HOT 2
- dqn_torch_test build failure HOT 6
- AlphaZero pseudo code available? HOT 2
- developing agents for team dominoes HOT 13
- [Puzzle] N-Queens HOT 7
- Spielviz gives AttributeError: module 'pyspiel' has no attribute 'GameParameter' HOT 28
- Failure in alpha_zero.py HOT 1
- chat_game_base.py prints lots of stuff during testing
- Issue with Downloading OpenSpiel Package via pip HOT 7
- Congestion games via C++ API HOT 5
- RNaD negative loss and barely any correlation of loss with NashConv HOT 7
- Potential issue: Reach probabilities not updated for chance nodes in ExternalSamplingMCCFRSolver HOT 1
- Suggestion: Replace flat list of available games with a table HOT 2
- RNaD - MLP alternatives HOT 1
- RNaD - Multiple policy heads implementation HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from open_spiel.