Mila
Deep Neural Network Library
|
Functions | |
template<typename MR > | |
constexpr size_t | get_alignment () |
Determines the appropriate memory alignment based on the memory resource type. | |
Variables | |
constexpr size_t | CPU_SIMD_ALIGN = 64 |
AVX-512 alignment for CPU operations. | |
constexpr size_t | CUDA_WARP_SIZE = 32 |
CUDA warp size alignment (32 threads) | |
|
constexpr |
Determines the appropriate memory alignment based on the memory resource type.
For GPU memory resources, alignment is based on CUDA warp size. For CPU memory resources, alignment is based on AVX-512 requirements.
MR | The memory resource type |
|
constexpr |
AVX-512 alignment for CPU operations.
This constant defines the alignment required for optimal SIMD operations using AVX-512 instructions on CPU.
|
constexpr |
CUDA warp size alignment (32 threads)
This constant defines the alignment required for optimal memory access patterns when using CUDA warp-level operations.