SSE4.1 GEMM for Q5_0 quantized weights. More...
Go to the source code of this file.
Functions | |
| void | gemm_nt_q5_0_sse (const float *A, const void *B, const float *bias, float *C, int M, int N, int K) |
SSE4.1 GEMM for Q5_0 quantized weights.
After changes: make test && make llamacpp-parity-full
Compatible with Sandy Bridge/Ivy Bridge and later.
Definition in file gemm_kernels_q5_0_sse.c.
| void gemm_nt_q5_0_sse | ( | const float * | A, |
| const void * | B, | ||
| const float * | bias, | ||
| float * | C, | ||
| int | M, | ||
| int | N, | ||
| int | K | ||
| ) |
Definition at line 23 of file gemm_kernels_q5_0_sse.c.
References C, CK_FP16_TO_FP32, block_q5_0::d, block_q5_0::qh, QK5_0, and block_q5_0::qs.