← Back to C-Kernel-Engine Docs Doxygen Source Documentation
sigmoid_kernels_bf16.c File Reference

Sigmoid activation kernels for BF16 tensors. More...

#include <stddef.h>
#include <stdint.h>
#include "bf16_utils.h"
#include "ckernel_engine.h"

Go to the source code of this file.

Functions

void sigmoid_backward_bf16 (const uint16_t *input, const uint16_t *d_output, uint16_t *d_input, size_t n, float *scratch_input, float *scratch_d_output, float *scratch_d_input)
 
void sigmoid_forward_bf16 (const uint16_t *input, uint16_t *output, size_t n, float *scratch_input, float *scratch_output)
 

Detailed Description

Sigmoid activation kernels for BF16 tensors.

CK-ENGINE KERNEL RULES:

  1. NO malloc/free - memory via bump allocator, pointers passed in
  2. NO OpenMP - parallelization at orchestrator/codegen layer
  3. API must define: inputs, outputs, workspace, and memory layouts
  4. Pure computation - deterministic, no side effects

After changes: make test && make llamacpp-parity-full

Sigmoid: y = 1 / (1 + exp(-x))

Definition in file sigmoid_kernels_bf16.c.

Function Documentation

◆ sigmoid_backward_bf16()

void sigmoid_backward_bf16 ( const uint16_t *  input,
const uint16_t *  d_output,
uint16_t *  d_input,
size_t  n,
float *  scratch_input,
float *  scratch_d_output,
float *  scratch_d_input 
)

Definition at line 45 of file sigmoid_kernels_bf16.c.

52 {
53  if (!input || !d_output || !d_input || n == 0) return;
54  if (!scratch_input || !scratch_d_output || !scratch_d_input) return;
55 
56  bf16_tensor_to_float(input, scratch_input, n);
57  bf16_tensor_to_float(d_output, scratch_d_output, n);
58  sigmoid_backward(scratch_input, scratch_d_output, scratch_d_input, n);
59  float_tensor_to_bf16(scratch_d_input, d_input, n);
60 }
static void float_tensor_to_bf16(const float *src, uint16_t *dst, size_t count)
Definition: bf16_utils.h:271
static void bf16_tensor_to_float(const uint16_t *src, float *dst, size_t count)
Definition: bf16_utils.h:250
void sigmoid_backward(const float *input, const float *d_output, float *d_input, size_t n)

References bf16_tensor_to_float(), float_tensor_to_bf16(), and sigmoid_backward().

◆ sigmoid_forward_bf16()

void sigmoid_forward_bf16 ( const uint16_t *  input,
uint16_t *  output,
size_t  n,
float *  scratch_input,
float *  scratch_output 
)

Definition at line 27 of file sigmoid_kernels_bf16.c.

32 {
33  if (!input || !output || n == 0) return;
34  if (!scratch_input || !scratch_output) return;
35 
36  bf16_tensor_to_float(input, scratch_input, n);
37  sigmoid_forward(scratch_input, scratch_output, n);
38  float_tensor_to_bf16(scratch_output, output, n);
39 }
void sigmoid_forward(const float *input, float *output, size_t n)

References bf16_tensor_to_float(), float_tensor_to_bf16(), and sigmoid_forward().