xlan09 / how_to_optimize_in_gpu_cuda Goto Github PK
View Code? Open in Web Editor NEWThis project forked from liu-xiandong/how_to_optimize_in_gpu
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
License: Apache License 2.0