Light

dyth Goto Github PK

followers: 24.0 following: 51.0 repos: 29.0 gists: 2.0

Type: User

Blog: http://dyth.github.io

David Yu-Tung Hui, 許宇同

I am an independent researcher interested in Deep Reinforcement Learning (RL). My research goal is to make RL algorithms more scalable by improving their optimization stability using principled methods from probability theory and analysis. I'm especially curious about how to design gradient-based optimization algorithms that enable neural networks to learn Q-functions.

I've written two works along this research direction:

Stabilizing Q-Learning for Continuous Control (MSc Thesis, 2022) showed that adding LayerNorm to critic networks prevented semi-gradient updates of the mean-squared temporal-difference error from diverging. Adding LayerNorm to DDPG solved high-dimensional continuous control tasks such as dog-run in DeepMind Control.
Double Gumbel Q-Learning (Spotlight @NeurIPS 2023) showed that Maximum-Entropy RL algorithms have two heteroscedastic Gumbel noise sources. Accounting for these noise sources improved the aggregate performance of SAC by 2x at 1M training timesteps.

In 2023, I graduated with an MSc from Mila, University of Montreal. I'm looking for opportunities where I can continue my research.

For more information about me, see:

dyth's Projects

1dcellularautomata

Java Swing implementation of cellular automata with options to change rule and start string

acme

A library of reinforcement learning components and agents

armsurveyswarm

Swarm robotics project for the Cambridge Part 1B Group Project

azurechatbot

Microsoft Bot Framework, Hack Cambridge 2017

carassius

Chess agent trained by Deep Reinforcement Learning

causalentropicforces

Emergent unsupervised policy generation from thermodynamics

chessrandomagent

Agent which plays randomly generated moves

chessreinforcementlearning

Version control and backup for code and text written for University of Cambridge Computer Science Tripos Part II Dissertation Project

computationtheory

Implementations of theoretical computational frameworks

computer-vision-experiments

Python OpenCV experiments for object detection and tracking

deepfish

Sunfish + Deep Learning = Deepfish

deeplearningexperiments

Miscellaneous Neural Networks / Deep Learning in Python

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

doublegum

NeurIPS 2023 Spotlight

dyth

dyth.github.io

generative_models

gofai

Implementation of some early GOFAI algorithms

juno

Tic-Tac-Toe agent trained by Deep Reinforcement Learning

machinelearningexperiments

Some experiments of different machine learning algorithms with Python and poly/ML.

machinelearningintro

boilerplate code, scripts, modules, data for Introduction to Machine Learning with Python

maml-haiku

nlpexperiments

Attempts at parsing and semantic similarities

rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

semanticnet

Creates semantic networks from groups of text

simulations

Simulation of biological, natural and anthropic phenomena

train-procgen

Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"

1

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.