CUBLASLt gemm for the candle ML framework
Owned by Nicolas Patry.
#236 in #tensor
528 downloads per month
31KB 752 lines
CublasLt Matmul operation for the Candle ML framework. Allows for bias and Relu/Gelu fusing.
~28MB ~519K SLoC