sanchitintel Goto Github PK

followers: 10.0 following: 125.0 repos: 44.0 gists: 0.0

Type: User

Location: San Francisco Bay Area

sanchitintel's Projects

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

aitemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

archbenchsuite

low level kernels to benchmark peak compute, cache bandwidth on various levels, memory bandwidth, and some basic compute routines

benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

classyvision

An end-to-end PyTorch framework for image and video classification

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

fairscale

PyTorch extensions for high performance and large scale training.

fbgemm

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

glibc

Unofficial mirror of sourceware glibc repository. Updated daily.

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

hydra

Hydra is a framework for elegantly configuring complex applications

ideep

Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.

intel-extension-for-pytorch

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

lazy-tensor-samples

Code samples using features from PyTorch's Lazy Tensor Core

libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

mmperf

MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.

nanobench

A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.

netron-fork

Visualizer for neural network, deep learning, and machine learning models

Intel® Neural Compressor provides unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.

nogil

Multithreaded Python without the GIL

onednn

oneAPI Deep Neural Network Library (oneDNN)

sanchitintel Goto Github PK

sanchitintel's Projects

Recommend Projects

Recommend Topics

Recommend Org