sanchitintel Goto Github PK
Type: User
Location: San Francisco Bay Area
Type: User
Location: San Francisco Bay Area
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
fork of AlbanD'S subclass zoo
low level kernels to benchmark peak compute, cache bandwidth on various levels, memory bandwidth, and some basic compute routines
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
An end-to-end PyTorch framework for image and video classification
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
PyTorch extensions for high performance and large scale training.
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Unofficial mirror of sourceware glibc repository. Updated daily.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Hydra is a framework for elegantly configuring complex applications
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
OpenAI Triton backend for Intel® GPUs
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Code samples using features from PyTorch's Lazy Tensor Core
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Bert Maher's llama2.so
Bert Maher's membench
FBResearch Metaseq fork
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.
Visualizer for neural network, deep learning, and machine learning models
Intel® Neural Compressor provides unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.
Multithreaded Python without the GIL
oneAPI Deep Neural Network Library (oneDNN)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.