Partially observable multi-agent pathfinding (PO-MAPF) is a challenging problem which fundamentally differs from regular MAPF, in which a central controller is assumed to construct a joint plan for all agents before they start execution. PO-MAPF is intrisically decentralized and decision making (e.g. planning) here is interleaved with the execution. At each time step an agent receives a (local) observation of the environment and decides which action to take. The ultimate goal for the agents is to reach their goals while avoiding collisions with each other and the static obstacles.
POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored to a variety of PO-MAPF settings. Currently the (somewhat) standard setting is supported: agents can move between the cardinally-adjacent cells of the grid, each action (move or wait) takes one time step. No information sharing between the agents is happening.
POGEMA can generate random maps and start/goals locations for the agents. It also can take custom maps as the input.
Just install from PyPI:
pip install pogema
import gym
import pogema
env = gym.make("Pogema-8x8-hard-v0")
obs = env.reset()
done = [False, ...]
while not all(done):
# Use random policy to make actions
obs, reward, done, info = env.step([env.action_space.sample() for _ in range(len(obs))])
Environment | agents density | num agents | horizon |
---|---|---|---|
Pogema-8x8-easy-v0 | 2.2% | 1 | 64 |
Pogema-8x8-normal-v0 | 4.5% | 2 | 64 |
Pogema-8x8-hard-v0 | 8.9% | 4 | 64 |
Pogema-8x8-extra-hard-v0 | 17.8% | 8 | 64 |
Pogema-16x16-easy-v0 | 2.2% | 4 | 128 |
Pogema-16x16-normal-v0 | 4.5% | 8 | 128 |
Pogema-16x16-hard-v0 | 8.9% | 16 | 128 |
Pogema-16x16-extra-hard-v0 | 17.8% | 32 | 128 |
Pogema-32x32-easy-v0 | 2.2% | 16 | 256 |
Pogema-32x32-normal-v0 | 4.5% | 32 | 256 |
Pogema-32x32-hard-v0 | 8.9% | 64 | 256 |
Pogema-32x32-extra-hard-v0 | 17.8% | 128 | 256 |
Pogema-64x64-easy-v0 | 2.2% | 64 | 512 |
Pogema-64x64-normal-v0 | 4.5% | 128 | 512 |
Pogema-64x64-hard-v0 | 8.9% | 256 | 512 |
Pogema-64x64-extra-hard-v0 | 17.8% | 512 | 512 |
Pogema provides integrations with a range of MARL frameworks: PettingZoo, PyMARL and SampleFactory.
import gym
import pogema
# Create Pogema environment with PettingZoo interface
env = gym.make("Pogema-8x8-hard-v0", integration="PettingZoo")
import gym
import pogema
env = gym.make("Pogema-8x8-hard-v0", integration="PyMARL")
import gym
import pogema
env = gym.make("Pogema-8x8-hard-v0", integration="SampleFactory")
import gym
import pogema
# This interface is suitable only for
# single-agent partially observable pathfinding tasks
env = gym.make("Pogema-8x8-easy-v0", integration="SampleFactory")
import gym
from pogema import GridConfig
# Define random configuration
grid_config = GridConfig(num_agents=4, # number of agents
size=8, # size of the grid
density=0.4, # obstacle density
seed=1, # set to None for random
# obstacles, agents and targets
# positions at each reset
max_episode_steps=128, # horizon
obs_radius=3, # defines field of view
)
env = gym.make('Pogema-v0', grid_config=grid_config)
env.reset()
env.render()
import gym
from pogema import GridConfig
grid = """
.....#.....
.....#.....
...........
.....#.....
.....#.....
#.####.....
.....###.##
.....#.....
.....#.....
...........
.....#.....
"""
# Define new configuration with 8 randomly placed agents
grid_config = GridConfig(map=grid, num_agents=8)
# Create custom Pogema environment
env = gym.make('Pogema-v0', grid_config=grid_config)
If you use this repository in your research or wish to cite it, please make a reference to our paper:
@misc{https://doi.org/10.48550/arxiv.2206.10944,
doi = {10.48550/ARXIV.2206.10944},
url = {https://arxiv.org/abs/2206.10944},
author = {Skrynnik, Alexey and Andreychuk, Anton and Yakovlev, Konstantin and Panov, Aleksandr I.},
keywords = {Machine Learning (cs.LG), Artificial Intelligence (cs.AI), Multiagent Systems (cs.MA), FOS: Computer and information sciences, FOS: Computer and information sciences},
title = {POGEMA: Partially Observable Grid Environment for Multiple Agents},
publisher = {arXiv},
year = {2022},
copyright = {arXiv.org perpetual, non-exclusive license}
}