Git Product home page Git Product logo

Tianbing Xu's Projects

alpaca_farm icon alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

baselines icon baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

d3po icon d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

epg icon epg

Open-sourced code for Evolved Policy Gradients

gpt-fast icon gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

halos icon halos

A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).

lit-llama icon lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

llama-adapter icon llama-adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

llm icon llm

Sharing LLM basic ideas and code

llm-course icon llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

mistral-src icon mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

mit-6.824 icon mit-6.824

Basic Sources for MIT 6.824 Distributed Systems Class

mlc-llm icon mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

mlresearch icon mlresearch

My machine learning, reinforcement learning and deep learning research and engineering work.

nanogpt icon nanogpt

The simplest, fastest repository for training/finetuning medium-sized GPTs.

neftune icon neftune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

openai icon openai

The repository for all Azure OpenAI Samples complementing the OpenAI cookbook.

openrlhf icon openrlhf

A Ray-based High-performance RLHF framework (for large models)

paddle icon paddle

PArallel Distributed Deep LEarning

peft icon peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.