jaggbow / saf Goto Github PK

This repository contains code for the paper "Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning". https://arxiv.org/abs/2210.03022

Python 87.96% Shell 12.04%

coordination multi-agent-reinforcement-learning reinforcement-learning reinforcement-learning-environments

saf's People

Contributors

Stargazers

Watchers

Forkers

andrewrwilliams tianyu-z edu-ai juandavidvargas19

saf's Issues

Add support for CNN architectures

Add the possibility to use a CNN architecture for all implemented algorithms in the code base. Appropriate reshaping might be needed, in that case, make sure runner.py is compatible with it too

Additional IPPO/MAPPO training details

We still need to implement additional PPO training details to benefit from the full performance of IPPO and MAPPO [1,2]. Here are the things that should be implemented:

Feature Pruning: Form a state by concatenating environment provided global state and agent's local observation and then prune out redundant information. This is highly environment specific so we might need to change the obs_to_state_wrapper to account for that. No change needed elsewhere.
Value Normalization: Regress value network output to the normalized value target. This was found to help the training significantly for MAPPO
Recurrent-MAPPO: MAPPO that operates with RNNs (GRU for example) instead of simple MLPs
Frame stacking: Provide a stack of observations instead of only one

References:

[1] The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

[2] (https://github.com/marlbenchmark/on-policy/tree/main/onpolicy)[https://github.com/marlbenchmark/on-policy/tree/main/onpolicy]

jaggbow / saf Goto Github PK

saf's People

Contributors

Stargazers

Watchers

Forkers

saf's Issues

Add support for CNN architectures

Additional IPPO/MAPPO training details

References:

MARLGrid integration

Add SAF as an additional model

Check if marlgrid has random initialization for agent and goal positions

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent