Comments (2)
If someone need the similar function, here is my solution edimetia3d@8fc8e37
from cutlass.
It looks like you have addressed this reasonably well.
Fwiw, the CUTLASS 2.x codebase should support this case relatively well, accommodating any arbitrary functor applied to the epilogue including type conversion.
Thanks for posting!
from cutlass.
Related Issues (20)
- [QST]How to `Copy_Atom` with data type conversion? HOT 1
- [BUG] Stride is ignored for dst tensor of a Conv2dFprop HOT 29
- [FEA] Add Prefetching Hints Support for Global Memory Loading HOT 4
- random error with sm80_mma_multistage and sm70_epilogue_vectorized HOT 2
- [QST] different EVT-based epilogues for hopper and pre-hopper HOT 3
- [QST] What is "fast accumulation" for fp8? HOT 1
- [QST] Gather/Scatter in cute/cutlass 3 HOT 18
- [QST] Is there any other legal layout in cutlass? HOT 3
- [BUG] w4a8 mixed-input gemm for fine-grained quantization HOT 4
- [QST] Is it possible to detect output coordinates in elementwise epilogue ? HOT 5
- [QST] thread num assert in sm70_epilogue_vectorized HOT 1
- [QST] Sparse GEMM runs much worse than Dense GEMM in some cases HOT 15
- [DOC] Cannot compile the example code in quickstart.md HOT 2
- [QST] `sm70` `mma.sync` layout HOT 2
- [QST] Any rules to follow when set instruction shape for GemmTensorOp? HOT 5
- [QST] Is s8 * s8 = {s32, s8} supported in cuTe? HOT 4
- [QST] `TensorView` API HOT 4
- [QST] The performance of Hopper group gemm is not meeting expectation in some cases HOT 2
- [QST]Composition does not work as expected. HOT 3
- [QST] could you please help me understand how right_inverse work? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cutlass.