LocalLlamaProvider class - local_llama_provider library

LocalLlamaProvider class

Local in-process GGUF LLM inference provider for Neomage.

Implements SAIA's saia.LocalAiService interface. Integrates with the HardwareProfiler from neom_ollama to dynamically profile device resources at runtime and optimize context size/thread settings.

Constructors

LocalLlamaProvider()

Properties

hashCode → int: The hash code for this object.
no setterinherited
isReady → bool: Whether a local model is loaded and ready for inference.
no setteroverride
modelInfo → Map<String, dynamic>: Returns metadata about the model (name, size, quantization).
no setteroverride
runtimeType → Type: A representation of the runtime type of the object.
no setterinherited

Methods

chat(List<Map<String, String>> messages, {int maxTokens = 256}) → Future<String>: Generates a response with conversation history context.
override
dispose() → Future<void>: Releases the local model from memory to free up resources.
override
generate(String prompt, {int maxTokens = 256}) → Future<String>: Generates a response from a prompt (single turn, no conversation history).
override
initialize({String? modelPath}) → Future<bool>: Loads the local model.
override
noSuchMethod(Invocation invocation) → dynamic: Invoked when a nonexistent method or property is accessed.
inherited
toString() → String: A string representation of this object.
inherited

Operators

operator ==(Object other) → bool: The equality operator.
inherited