← Back to C-Kernel-Engine Docs Doxygen Source Documentation
CKTokenizerConfig Struct Reference

#include <tokenizer.h>

Data Fields

bool add_bos
 
bool add_eos
 
bool add_space_prefix
 
bool lowercase
 
bool space_prefix_detected
 
CKSpacePrefixStyle space_prefix_style
 
CKSpmMode spm_mode
 
bool treat_whitespace_as_suffix
 
CKTokenizerType type
 
float unk_score
 
bool use_trie
 

Detailed Description

Definition at line 73 of file tokenizer.h.

Field Documentation

◆ add_bos

bool CKTokenizerConfig::add_bos

◆ add_eos

bool CKTokenizerConfig::add_eos

◆ add_space_prefix

bool CKTokenizerConfig::add_space_prefix

◆ lowercase

bool CKTokenizerConfig::lowercase

Definition at line 78 of file tokenizer.h.

◆ space_prefix_detected

bool CKTokenizerConfig::space_prefix_detected

◆ space_prefix_style

CKSpacePrefixStyle CKTokenizerConfig::space_prefix_style

◆ spm_mode

CKSpmMode CKTokenizerConfig::spm_mode

Definition at line 84 of file tokenizer.h.

Referenced by ck_tokenizer_create(), ck_tokenizer_encode(), and ck_tokenizer_set_spm_mode().

◆ treat_whitespace_as_suffix

bool CKTokenizerConfig::treat_whitespace_as_suffix

Definition at line 79 of file tokenizer.h.

◆ type

CKTokenizerType CKTokenizerConfig::type

Definition at line 74 of file tokenizer.h.

Referenced by ck_tokenizer_create(), and ck_tokenizer_encode().

◆ unk_score

float CKTokenizerConfig::unk_score

Definition at line 80 of file tokenizer.h.

Referenced by ck_tokenizer_create().

◆ use_trie

bool CKTokenizerConfig::use_trie

Definition at line 81 of file tokenizer.h.

Referenced by ck_tokenizer_set_use_trie(), and find_longest_match().


The documentation for this struct was generated from the following file: