Lorenz Wolf's Projects
Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"
Autoregressive Convolutional RNN for univariate and multivariate time series forecasting implemented with keras and tensorflow.
This repo contains a collection of papers on ML for functional data.
Bayesian Bandits
Knowledge-Aware RL agents with Commonsense Reasoning
First one day project for the computational statistics module.
Implementing the Denoising Diffusion Probabilistic Model in Flax
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Example implementation of dqn for lunar lander, solving some incompatibilities
An extension of the PyMARL codebase that includes additional algorithms and environment support
A collection of Google research projects related to Federated Learning and Federated Analytics.
MSc Thesis on Deep Learning methodology for functional data
The Unified Machine Learning Framework
Some small ML projects
Simple and easily configurable grid world environments for reinforcement learning
Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
Portfolio with applied ML/AI projects
Recursive Bayesian Estimation (Sequential / Online Inference)
Minimalist version of probml/rebayes
Soft Actor-Critic for unsupervised RL
Code and documentation to train Stanford's Alpaca models, and generate the data.