shitianyu-hue Goto Github PK
Name: SHITIANYU
Type: User
Company: University of Toronto
Bio: Ph.D. student @ University of Toronto Reinforcement learning
Location: Toronto, Canada
Name: SHITIANYU
Type: User
Company: University of Toronto
Bio: Ph.D. student @ University of Toronto Reinforcement learning
Location: Toronto, Canada
process for age bias dataset
[ICML 2022] RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression
AgentTuning: Enabling Generalized Agent Abilities for LLMs
梳理每周最新多模态,LLMs,embodied AI相关论文
Part of the Fundamentals of Computing Specialization, Rice, Coursera
Official GitHub repository for Argoverse dataset
Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting (ASTGCN) AAAI 2019
Code and links for trained Atari agents
A curated list of awesome Python frameworks, libraries, software and resources
Author's PyTorch implementation of BCQ for continuous and discrete actions
Reinforcement Learning + Imitation Learning based approach to AI Driving Olympics
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Official reinforcement learning environment for demand response and load shaping
Training framework for conditional imitation learning
This is the source code of COMP 767 group project of Tianyu Shi & Jiawei Wang.
📖Coursera Princeton Algorithms Part 1
Data Mining - University of Illinois at Urbana-Champaign
A reliable controller is critical for execution of safe and smooth maneuvers of an autonomous vehicle. The controller must be robust to external disturbances, such as road surface, weather, wind conditions, and so on. It also needs to deal with internal variations of vehicle sub-systems, including powertrain inefficiency, measurement errors, time delay, etc. These factors introduce issues in controller performance. In this paper, a feed-forward compensator is designed via a data-driven method to model and optimize the controller’s performance. Principal Component Analysis (PCA) is applied for extracting influential features, after which a Time Delay Neural Network is adopted to predict control errors over a future time horizon. Based on the predicted error, a feedforward compensator is then designed to improve control performance. Simulation results in different scenarios show that, with the help of with the proposed feedforward compensator, the maximum path tracking error and the steering wheel angle oscillation are improved by 44.4% and 26.7%, respectively.
multi-agent deep reinforcement learning for networked system control.
DGN Code
Deep Implicit Coordination Graphs
Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.