Git Product home page Git Product logo

gym_tool_use's Introduction

Gym Tool Use

gym tool use environments.


$ pip install gym-tool-use

Usage

import gym_tool_use  # import to register gym envs
env = gym.make("TrapTube-v0")
observation = env.reset()
action = env.action_space.sample()
observation_next, reward, done, info = env.step(action)
image = env.render(mode="rgb_array")  # also supports mode="human"

Environments

The following environments are registered:

  • "TrapTube-v0" (base task)
  • "PerceptualTrapTube-v0"
  • "StructuralTrapTube-v0"
  • "SymbolicTrapTube-v0"
  • "PerceptualSymbolicTrapTube-v0"
  • "StructuralSymbolicTrapTube-v0"
  • "PerceptualStructuralTrapTube-v0"
  • "PerceptualStructuralSymbolicTrapTube-v0"

Baselines

Baseline implementations here: https://github.com/fomorians/tool-use

Development

Development is started with pipenv.

$ pipenv install
$ pipenv shell

Citation

If you use this code in your work, please cite the following:

@ARTICLE{2019arXiv190702050W,
      author = {{Wenke}, Sam and {Saunders}, Dan and {Qiu}, Mike and {Fleming}, Jim},
       title = "{Reasoning and Generalization in RL: A Tool Use Perspective}",
     journal = {arXiv e-prints},
    keywords = {Computer Science - Neural and Evolutionary Computing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning},
        year = "2019",
       month = "Jul",
         eid = {arXiv:1907.02050},
       pages = {arXiv:1907.02050},
archivePrefix = {arXiv},
      eprint = {1907.02050},
primaryClass = {cs.NE},
      adsurl = {https://ui.adsabs.harvard.edu/abs/2019arXiv190702050W},
     adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

gym_tool_use's People

Contributors

wenkesj avatar jimfleming avatar djsaunde avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.