Hello, I am testing the chain environment with 10 nodes in interactive mode. The minim

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Question about rewards in chain environment about cyberbattlesim HOT 2 CLOSED

microsoft commented on May 18, 2024

Question about rewards in chain environment

from cyberbattlesim.

Comments (2)

MariaRigaki commented on May 18, 2024

It seems that when the Gym environment is initialized the following is passed as a winning reward:
winning_reward=5000.0 (line 396 in cyberbattle_env.py). If the agent reaches the goal this is added as a reward overriding the value of the node. This does not happen when you play the interactive game, which returns the string "FLAG: flag discovered!" and just adds the node value which is 1000.

Can someone please confirm that this explanation is correct and it is a matter of just adding 4000 to the final reward to compare the results?

from cyberbattlesim.

blumu commented on May 18, 2024

@MariaRigaki The explanation is correct. The attacker gets the final winning reward if one is specified when the environment is created. The reason why this does not happen in the interactive game is because in notebook_benchmark-chain.ipynb the environment that gets instantiated is CyberBattleToyCtf-v0. This environment is defined in __init__.py as:

register(
    id='CyberBattleToyCtf-v0',
    cyberbattle_env_identifiers=toy_ctf.ENV_IDENTIFIERS,
    entry_point='cyberbattle._env.cyberbattle_toyctf:CyberBattleToyCtf',
    kwargs={'defender_agent': None,
            'attacker_goal': AttackerGoal(own_atleast=6),
            'defender_goal': DefenderGoal(eviction=True)
            },
)

which does not have a winning_reward parameter.

from cyberbattlesim.

Recommend Projects

Question about rewards in chain environment about cyberbattlesim HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent