|  | CUTLASS
    CUDA Templates for Linear Algebra Subroutines and Solvers | 
| File in include/cutlass/gemm | Includes file in include/cutlass/epilogue | 
|---|---|
| kernel / default_gemm.h | threadblock / default_epilogue_simt.h | 
| kernel / default_gemm.h | threadblock / default_epilogue_tensor_op.h | 
| kernel / default_gemm.h | threadblock / default_epilogue_volta_tensor_op.h | 
| kernel / default_gemm.h | threadblock / epilogue.h | 
| kernel / default_gemm.h | thread / linear_combination.h | 
| device / default_gemm_configuration.h | thread / linear_combination.h | 
| device / default_gemm_configuration.h | thread / linear_combination_clamp.h | 
| device / device/gemm_splitk_parallel.h | thread / conversion_op.h | 
 1.8.11
 1.8.11