Comments (11)
Hello, @jack86666 May I know what ‘last game’ refers to? Is it the previous game data or the last state of a game?
from rlcard.
from rlcard.
from rlcard.
@jack86666 To start a new game, you can call env.init_game()
. Then call env.run()
, you will obtain the game data (trajectories) of one game and payoffs. Unfortunately, RLCard will not save the previous game data. If you want to access previous game data, you can simply store the trajectories into a list and access later. Thanks!
from rlcard.
from rlcard.
@jack86666 In this case, you may not use env.run
. Instead, you may use env.step
which will take one step, and return the next state and the next player. Then you can do whatever you want when the game is in progress.
See the definition of step:
Line 46 in b526e0e
You can also refer to the document to get a sense of the interface design:
https://github.com/datamllab/rlcard/blob/master/docs/high-level-design.md
from rlcard.
from rlcard.
from rlcard.
@jack86666 The action space of Dou Dizhu is too large, thus we have to make abstractions to make it feasible for RL algorithms. Although we have added some rules for kicker, the agent needs to learn how to make 309 decisions. This is still challenging.
from rlcard.
from rlcard.
from rlcard.
Related Issues (20)
- 牌谱 HOT 2
- Array has inhomogeneous shape when trying to run the pettingzoo example HOT 7
- Bridge hidden card rep error HOT 2
- About Dou Dizhu game state HOT 1
- Should PyPi be updating rlcard package? HOT 2
- Missing fields for checkpoint in DQNAgent ? HOT 9
- 如何让自己训练的AI和已有的AI较量呢 HOT 1
- How to replace the DMC operating device? HOT 5
- Doc search is broken HOT 1
- Env 中的多牌桌流程优化 数倍以上效率提升 HOT 3
- `last_landlord_action` and `last_teammate_action` in `DoudizhuEnv` are wrong HOT 1
- 执行run_rl.py报错 HOT 2
- Leakage of actions to the other agent
- how to train mahjong game? HOT 5
- nolimitholdem Flop下注順序錯誤? HOT 1
- Question about doudizhu action space. HOT 1
- calc reward? HOT 2
- How do I run it on a Mac?
- 训练只能占一个gpu吗
- How to save agent? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rlcard.