|
Mila 0.13.48
Deep Neural Network Library
|

Files | |
| CublasLtMatMulBias.ixx | |
| CUDA-accelerated matrix multiplication with bias addition using cuBLASLt. | |
| CudaLinearGeluOp.ixx | |
| CudaLinearOp.Dispatch.ixx | |
| CudaLinearOp.ixx | |
| CUDA implementation of Linear operation with two-phase cuBLASLt optimization. | |
| CudaLinearOp.Plans.ixx | |
| cuBLASLt plan builders for CudaLinearOp forward and backward passes. | |
| CudaLinearOp.Quantize.ixx | |
| Quantize partition of CudaLinearOp. | |
| CudaLinearOpTypeMap.ixx | |
| LinearOpTypeMap specializations for CUDA device targets. | |