LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 constant

int const LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16

tok_embeddings.weight and output.weight are F16

Implementation

static const int LLAMA_FTYPE_MOSTLY_Q4_1_SOME_F16 = 4;