LocalLlamaProvider class
Local in-process GGUF LLM inference provider for Neomage.
Implements SAIA's saia.LocalAiService interface.
Integrates with the HardwareProfiler from neom_ollama to dynamically
profile device resources at runtime and optimize context size/thread settings.
Constructors
Properties
- hashCode → int
-
The hash code for this object.
no setterinherited
- isReady → bool
-
Whether a local model is loaded and ready for inference.
no setteroverride
-
modelInfo
→ Map<
String, dynamic> -
Returns metadata about the model (name, size, quantization).
no setteroverride
- runtimeType → Type
-
A representation of the runtime type of the object.
no setterinherited
Methods
-
chat(
List< Map< messages, {int maxTokens = 256}) → Future<String, String> >String> -
Generates a response with conversation history context.
override
-
dispose(
) → Future< void> -
Releases the local model from memory to free up resources.
override
-
generate(
String prompt, {int maxTokens = 256}) → Future< String> -
Generates a response from a prompt (single turn, no conversation history).
override
-
initialize(
{String? modelPath}) → Future< bool> -
Loads the local model.
override
-
noSuchMethod(
Invocation invocation) → dynamic -
Invoked when a nonexistent method or property is accessed.
inherited
-
toString(
) → String -
A string representation of this object.
inherited
Operators
-
operator ==(
Object other) → bool -
The equality operator.
inherited