AVX2 Q4_K x Q8_K matvec kernel (inference only) More...
Go to the source code of this file.
Functions | |
| void | gemv_q4_k_q8_k_avx2 (float *y, const void *W, const void *x_q8, int M, int K) |
| void | gemv_q4_k_q8_k_ref (float *y, const void *W, const void *x_q8, int M, int K) |
AVX2 Q4_K x Q8_K matvec kernel (inference only)
After changes: make test && make llamacpp-parity-full
Requires AVX2 for 256-bit integer operations.
Definition in file gemm_kernels_q4k_q8k_avx2.c.
| void gemv_q4_k_q8_k_avx2 | ( | float * | y, |
| const void * | W, | ||
| const void * | x_q8, | ||
| int | M, | ||
| int | K | ||
| ) |
Definition at line 89 of file gemm_kernels_q4k_q8k_avx2.c.
References gemv_q4_k_q8_k_ref().
Referenced by gemv_q4_k_q8_k(), and gemv_q4_k_q8_k_amx().
| void gemv_q4_k_q8_k_ref | ( | float * | y, |
| const void * | W, | ||
| const void * | x_q8, | ||
| int | M, | ||
| int | K | ||
| ) |
Definition at line 177 of file gemm_kernels_q4k_q8k.c.
Referenced by gemv_q4_k_q8_k_avx2().