LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 constant
int
const LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16
tok_embeddings.weight and output.weight are F16
Implementation
static const int LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 = 4;