llama_sampler_init_softmax method

Pointer<llama_sampler> llama_sampler_init_softmax()

@details Sorts candidate tokens by their logits in descending order and calculate probabilities based on logits. NOTE: Avoid using on the full vocabulary as the sorting can become slow. For example, apply top-k or top-p sampling first.

Implementation

ffi.Pointer<llama_sampler> llama_sampler_init_softmax() {
  return _llama_sampler_init_softmax();
}