createModel method - FlutterGemmaPlugin class - flutter_gemma_interface library

Creates and returns a new InferenceModel instance.

modelType — model type to create. maxTokens — maximum context length for the model. preferredBackend — backend preference (e.g., CPU, GPU). loraRanks — optional supported LoRA ranks. maxNumImages — maximum number of images (for multimodal models). supportImage — whether the model supports images. supportAudio — whether the model supports audio (Gemma 3n E4B only). enableSpeculativeDecoding — Multi-Token Prediction toggle for Gemma 4 E2B/E4B (LiteRT-LM v0.11.0+). null honors the model's default; true/false forces on/off. Older .litertlm files without an MTP drafter ignore this flag at the SDK level.

Implementation

Future<InferenceModel> createModel({ required ModelType modelType, ModelFileType fileType = ModelFileType.task, int maxTokens = 1024, PreferredBackend? preferredBackend, List<int>? loraRanks, int? maxNumImages, // Add image support bool supportImage = false, // Add image support flag bool supportAudio = false, // Add audio support flag (Gemma 3n E4B) bool? enableSpeculativeDecoding, });

createModel abstract method

Implementation

FlutterGemmaPlugin class