llama_context_params class
final
- Inheritance
-
- Implemented types
-
- Available extensions
Properties
-
embedding
↔ bool
-
embedding mode only
getter/setter pair
-
f16_kv
↔ bool
-
use fp16 for KV cache, fp32 otherwise
getter/setter pair
-
hashCode
→ int
-
The hash code for this object.
no setterinherited
-
logits_all
↔ bool
-
the llama_eval() call computes all logits, not just the last one
getter/setter pair
-
mul_mat_q
↔ bool
-
if true, use experimental mul_mat_q kernels (DEPRECATED - always true)
getter/setter pair
-
n_batch
↔ int
-
prompt processing maximum batch size
getter/setter pair
-
n_ctx
↔ int
-
text context, 0 = from model
getter/setter pair
-
n_threads
↔ int
-
number of threads to use for generation
getter/setter pair
-
n_threads_batch
↔ int
-
number of threads to use for batch processing
getter/setter pair
-
rope_freq_base
↔ double
-
RoPE base frequency, 0 = from model
getter/setter pair
-
rope_freq_scale
↔ double
-
RoPE frequency scaling factor, 0 = from model
getter/setter pair
-
rope_scaling_type
↔ int
-
RoPE scaling type, from
enum llama_rope_scaling_type
getter/setter pair
-
runtimeType
→ Type
-
A representation of the runtime type of the object.
no setterinherited
-
seed
↔ int
-
RNG seed, -1 for random
getter/setter pair
-
yarn_attn_factor
↔ double
-
YaRN magnitude scaling factor
getter/setter pair
-
yarn_beta_fast
↔ double
-
YaRN low correction dim
getter/setter pair
-
yarn_beta_slow
↔ double
-
YaRN high correction dim
getter/setter pair
-
yarn_ext_factor
↔ double
-
YaRN extrapolation mix factor, negative = from model
getter/setter pair
-
yarn_orig_ctx
↔ int
-
YaRN original context size
getter/setter pair
Methods
-
noSuchMethod(Invocation invocation)
→ dynamic
-
Invoked when a nonexistent method or property is accessed.
inherited
-
toString()
→ String
-
A string representation of this object.
inherited