Topic: cuda-kernels Goto Github
Some thing interesting about cuda-kernels
Some thing interesting about cuda-kernels
cuda-kernels,Triton implementation of FlashAttention2 that adds Custom Masks.
User: alexzhang13
cuda-kernels,Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops
User: aredden
cuda-kernels,CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable version of the Jacobi eigenvalue algorithm.
User: arneish
cuda-kernels,PatchMatch Stereo with Red-Black modifiaction and Row Parallel modification for massively parallel computing
User: artmortal93
cuda-kernels,(REOS) Radar and Electro-Optical Simulation Framework written in C++.
User: bgin
cuda-kernels,(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.
User: bgin
cuda-kernels,Bandicoot: C++ library for GPU linear algebra & scientific computing - https://coot.sourceforge.io
User: conradsnicta
Home Page: https://coot.sourceforge.io
cuda-kernels,Safe rust wrapper around CUDA toolkit
User: coreylowman
cuda-kernels,Deep learning in Rust, with shape checked tensors and neural networks
User: coreylowman
cuda-kernels,Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
User: deepakkumar1984
cuda-kernels,🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
User: deftruth
Home Page: https://github.com/DefTruth/cuda-learn-notes
cuda-kernels,C++ cross-platform gpu SDK
Organization: dehancer
cuda-kernels,Speed up image preprocess with cuda when handle image or tensorrt inference
User: emptysoal
cuda-kernels,Using custom CUDA kernels with Open CV Mat objects.
User: evlasblom
cuda-kernels,CUDA kernel author's tools
User: eyalroz
cuda-kernels,StiffMa: Fast finite element STIFFness MAtrix generation in MATLAB by using GPU computing.
User: fjramireg
cuda-kernels,The cuda code is mainly for nvidia hardware device. This repo will show how to run cuda c or cuda cpp code on the google colab platform for free.
User: flin3500
cuda-kernels,Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
User: harrism
cuda-kernels,This is a Lattice-Boltzmann simulation using CUDA GPU graphics optimization.
User: henryfriedlander
cuda-kernels,From zero to hero CUDA for accelerating maths and machine learning on GPU.
User: hmunachi
cuda-kernels,cuda编程学习入门
User: huangcongqing
cuda-kernels,LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Organization: internlm
Home Page: https://lmdeploy.readthedocs.io/en/latest/
cuda-kernels,This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010
User: jaredhoberock
cuda-kernels,Kernel Tuner
Organization: kerneltuner
Home Page: https://kerneltuner.github.io/kernel_tuner/
cuda-kernels,A Complete beginner's introduction to programming with CUDA Fortran
User: koushikphy
cuda-kernels,Pytorch implementation of a message passing neural network with RNN sub-units
User: kovanostra
cuda-kernels,🚀 你的YOLO部署神器。TensorRT Plugin、CUDA Kernel、CUDA Graphs三管齐下,享受闪电般的推理速度。| Your YOLO Deployment Powerhouse. With the synergy of TensorRT Plugins, CUDA Kernels, and CUDA Graphs, experience lightning-fast inference speeds.
User: laugh12321
Home Page: https://github.com/laugh12321/TensorRT-YOLO
cuda-kernels,This is a cross-chip platform collection of operators and a unified neural network library.
User: matrix97317
cuda-kernels,Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
Organization: microsoft
Home Page: https://microsoft.github.io/Accera
cuda-kernels,CUDA Guide
User: mikeroyal
cuda-kernels,
Organization: mis-wut
cuda-kernels,Allen Cahn CUDA (Phase-Field Simulation of Dendritic Solidification)
User: myousefi2016
cuda-kernels,Cahn Hilliard CUDA (Phase-Field Simulation of Spinodal Decomposition)
User: myousefi2016
cuda-kernels,CUDA Core Compute Libraries
Organization: nvidia
cuda-kernels,Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Organization: nvidia
cuda-kernels,CUDA Kernel Benchmarking Library
Organization: nvidia
cuda-kernels,This repository contains examples CUDA usage in Cython code.
User: okatanaaa
cuda-kernels,Implementation of ConjugateGradients method using C and Nvidia CUDA
User: p-sto
cuda-kernels,Some CUDA design patterns and a bit of template magic for CUDA
User: patwie
cuda-kernels,Get started with CUDA programming
User: priteshgohil
cuda-kernels,Quantum-inspired evolutionary algorithms for Optimization problems
User: rnowotniak
cuda-kernels,Non Local Means Filter for Image Denoising in CUDA
User: rosevoul
cuda-kernels,maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas
User: stefan20162016
cuda-kernels,Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
Organization: stellar-group
Home Page: http://octotiger.stellar-group.org/
cuda-kernels,Implement Neural Networks in Cuda from Scratch
User: thoenigadrian
Home Page: https://www.youtube.com/playlist?list=PLdVoL2No_-X9OK8-20KOyVRki5tBMrGGG
cuda-kernels,Spiking Neural Networks in C++ with strong GPU acceleration through CUDA
Organization: tudelft
cuda-kernels,Protecting Real-Time GPU Kernels on Integrated CPU-GPU SoC Platforms
User: wali-ku
cuda-kernels,High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
User: wangsiping97
cuda-kernels,A tool for examining GPU scheduling behavior.
User: yalue
cuda-kernels,2D Game texture special effects
User: yoyoberenguer
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.