flutter_mind_local 0.1.0
flutter_mind_local: ^0.1.0 copied to clipboard

Published 2 days ago • Latest: 0.2.1

→

On-device AI inference for Flutter using llama.cpp via FFI. Local engine companion to flutter_mind.

0.1.0 #

Initial release.
LocalEngine — on-device LLM inference via llama.cpp, no API key or internet required.
LocalConfig — full configuration: model path, system prompt, temperature, context size, sampling parameters.
LocalModelType — built-in chat templates for Qwen, Llama 3, Gemma, Phi, Mistral, DeepSeek, with auto-detection from .gguf metadata.
Lifecycle events via onEvent: ModelLoadStarted, ModelReady, ModelFailed, InferenceStarted, InferenceCompleted, InferenceFailed, ContextCleared, ModelDisposed.
Android support via CMake + FetchContent (llama.cpp built from source).
iOS and macOS support via Swift Package Manager (llama.cpp built from source).
Model loading and inference run on background isolates — UI thread never blocked.