runStream method

Stream<SyniRuntimeStreamChunk> runStream(
  1. SyniRuntimeRequest request, {
  2. SyniPreset preset = SyniPreset.chat,
  3. int seed = 0,
})

Run inference and stream token chunks as they are generated.

Returns a Stream of SyniRuntimeStreamChunk:

V1 does not support mid-stream cancellation — let the stream complete or close it on the consumer side and ignore further deltas.

Implementation

Stream<SyniRuntimeStreamChunk> runStream(
  SyniRuntimeRequest request, {
  SyniPreset preset = SyniPreset.chat,
  int seed = 0,
}) async* {
  await initialize();
  if (_modelPath == null) {
    throw SyniRuntimeError(
      'Model not loaded. Call loadModel() or downloadModel() first.',
    );
  }
  yield* _worker!.runStream(
    preset.value,
    seed,
    jsonEncode(request.toJson()),
  );
}