Git Product home page Git Product logo

sirius93123's Projects

aitemplate icon aitemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

allreduce-proto icon allreduce-proto

A prototype implementation of AllReduce collective communication routine.

alpa icon alpa

Auto parallelization for large-scale neural networks

amos icon amos

Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators

apq icon apq

[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

atom icon atom

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

awesome-compilers icon awesome-compilers

:sunglasses: Curated list of awesome resources on Compilers, Interpreters and Runtimes

awesome-llm-inference icon awesome-llm-inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

baichuan-7b icon baichuan-7b

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

bladedisc icon bladedisc

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

blinkplus icon blinkplus

Blink+: Increase GPU group bandwidth by utilizing across tenant NVLink.

block-shuffle icon block-shuffle

A method for high-resolution Fast Style Transfer with limited memory

block-sparse-benchmark icon block-sparse-benchmark

Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.