LLM Dart Library

A modular Dart library for AI provider interactions. This library provides a unified interface for interacting with different AI providers using Dio for HTTP requests.

I want to...	Go to
Get started	Quick Start
Build a chatbot	Chatbot example
Compare providers	Provider comparison
Use streaming	Streaming example
Cancel requests	Cancellation demo
Call functions	Tool calling
Search the web	Web search
Generate embeddings	Embeddings
Moderate content	Content moderation
Access AI thinking	Reasoning models
Use MCP tools	MCP integration
Use local models	Ollama examples
See production app	Yumcha

Features

Multi-provider support: OpenAI, Anthropic, Google, DeepSeek, Groq, Ollama, xAI, ElevenLabs
OpenAI Responses API: Stateful conversations with built-in tools (web search, file search, computer use)
Thinking process access: Model reasoning for Claude, DeepSeek, Gemini, Ollama
Unified capabilities: Chat, streaming, tools, audio, images, files, web search, embeddings
Request cancellation: Cancel in-flight requests for chat, streaming, and API operations
MCP integration: Model Context Protocol for external tool access
Content moderation: Built-in safety and content filtering
Type-safe building: Compile-time capability validation
Builder pattern: Fluent configuration API
Production ready: Error handling, retry logic, monitoring

Supported Providers

Provider	Chat	Streaming	Tools	Thinking	Audio	Image	Files	Web Search	Embeddings	Moderation	Notes
OpenAI	✅	✅	✅	❌	✅	✅	✅	✅	✅	✅	GPT models, DALL-E, o1 reasoning
Anthropic	✅	✅	✅	🧠	❌	✅	✅	✅	❌	❌	Claude models with thinking
Google	✅	✅	✅	🧠	✅	✅	❌	❌	✅	❌	Gemini models with reasoning
DeepSeek	✅	✅	✅	🧠	❌	❌	❌	❌	❌	❌	DeepSeek reasoning models
Groq	✅	✅	✅	❌	❌	❌	❌	❌	❌	❌	Ultra-fast inference
Ollama	✅	✅	✅	🧠	❌	❌	❌	❌	✅	❌	Local models, privacy-focused
xAI	✅	✅	✅	❌	❌	❌	❌	✅	❌	❌	Grok models with web search
ElevenLabs	❌	❌	❌	❌	✅	❌	❌	❌	❌	❌	Advanced voice synthesis

🧠 Thinking Process Support: Access to model's reasoning and thought processes
🎵 Audio Support: Text-to-speech, speech-to-text, and audio processing
🖼️ Image Support: Image generation, editing, and multi-modal processing
📁 File Support: File upload, management, and processing capabilities
🔍 Web Search: Real-time web search across multiple providers
🧮 Embeddings: Text embeddings for semantic search and similarity

Installation

Add this to your pubspec.yaml:

dependencies:
  llm_dart: ^0.9.0

Then run:

dart pub get

Or install directly using:

dart pub add llm_dart

Quick Start

Basic Usage

import 'package:llm_dart/llm_dart.dart';

void main() async {
  // Method 1: Using the new ai() builder with provider methods
  final provider = await ai()
      .openai()
      .apiKey('your-api-key')
      .model('gpt-4')
      .temperature(0.7)
      .build();

  // Method 2: Using provider() with string ID (extensible)
  final provider2 = await ai()
      .provider('openai')
      .apiKey('your-api-key')
      .model('gpt-4')
      .temperature(0.7)
      .build();

  // Method 3: Using convenience function
  final directProvider = await createProvider(
    providerId: 'openai',
    apiKey: 'your-api-key',
    model: 'gpt-4',
    temperature: 0.7,
  );

  // Simple chat
  final messages = [ChatMessage.user('Hello, world!')];
  final response = await provider.chat(messages);
  print(response.text);

  // Access thinking process (for supported models)
  if (response.thinking != null) {
    print('Model thinking: ${response.thinking}');
  }
}

Streaming with DeepSeek Reasoning

import 'dart:io';
import 'package:llm_dart/llm_dart.dart';

// Create DeepSeek provider for streaming with thinking
final provider = await ai()
    .deepseek()
    .apiKey('your-deepseek-key')
    .model('deepseek-reasoner')
    .temperature(0.7)
    .build();

final messages = [ChatMessage.user('What is 15 + 27? Show your work.')];

// Stream with real-time thinking process
await for (final event in provider.chatStream(messages)) {
  switch (event) {
    case ThinkingDeltaEvent(delta: final delta):
      // Show AI's thinking process in gray
      stdout.write('\x1B[90m$delta\x1B[0m');
      break;
    case TextDeltaEvent(delta: final delta):
      // Show final answer
      stdout.write(delta);
      break;
    case CompletionEvent(response: final response):
      print('\n✅ Completed');
      if (response.usage != null) {
        print('Tokens: ${response.usage!.totalTokens}');
      }
      break;
    case ErrorEvent(error: final error):
      print('Error: $error');
      break;
  }
}

🧠 Thinking Process Access

Access the model's internal reasoning and thought processes:

// Claude with thinking
final claudeProvider = await ai()
    .anthropic()
    .apiKey('your-anthropic-key')
    .model('claude-sonnet-4-20250514')
    .build();

final messages = [
  ChatMessage.user('Solve this step by step: What is 15% of 240?')
];

final response = await claudeProvider.chat(messages);

// Access the final answer
print('Answer: ${response.text}');

// Access the thinking process
if (response.thinking != null) {
  print('Claude\'s thinking process:');
  print(response.thinking);
}

// DeepSeek with reasoning
final deepseekProvider = await ai()
    .deepseek()
    .apiKey('your-deepseek-key')
    .model('deepseek-reasoner')
    .temperature(0.7)
    .build();

final reasoningResponse = await deepseekProvider.chat(messages);
print('DeepSeek reasoning: ${reasoningResponse.thinking}');

Web Search

// Enable web search across providers
final provider = await ai()
    .xai()
    .apiKey('your-xai-key')
    .model('grok-3')
    .enableWebSearch()
    .build();

final response = await provider.chat([
  ChatMessage.user('What are the latest AI developments this week?')
]);
print(response.text);

// Provider-specific configurations
final anthropicProvider = await ai()
    .anthropic()
    .apiKey('your-key')
    .webSearch(
      maxUses: 3,
      allowedDomains: ['wikipedia.org', 'arxiv.org'],
      location: WebSearchLocation.sanFrancisco(),
    )
    .build();

Embeddings

// Generate embeddings for semantic search
final provider = await ai()
    .openai()
    .apiKey('your-key')
    .buildEmbedding();

final embeddings = await provider.embed([
  'Machine learning fundamentals',
  'Deep learning neural networks',
  'Natural language processing',
]);

// Use embeddings for similarity search
final queryEmbedding = await provider.embed(['AI research']);
// Calculate cosine similarity with your embeddings

Content Moderation

// Moderate content for safety
final provider = await ai()
    .openai()
    .apiKey('your-key')
    .buildModeration();

final result = await provider.moderate(
  ModerationRequest(input: 'User generated content to check')
);

if (result.results.first.flagged) {
  print('Content flagged for review');
} else {
  print('Content is safe');
}

Tool Calling

final tools = [
  Tool.function(
    name: 'get_weather',
    description: 'Get weather for a location',
    parameters: ParametersSchema(
      schemaType: 'object',
      properties: {
        'location': ParameterProperty(
          propertyType: 'string',
          description: 'City name',
        ),
      },
      required: ['location'],
    ),
  ),
];

final response = await provider.chatWithTools(messages, tools);
if (response.toolCalls != null) {
  for (final call in response.toolCalls!) {
    print('Tool: ${call.function.name}');
    print('Args: ${call.function.arguments}');
  }
}

Provider Examples

OpenAI

final provider = await createProvider(
  providerId: 'openai',
  apiKey: 'sk-...',
  model: 'gpt-4',
  temperature: 0.7,
  extensions: {'reasoningEffort': 'medium'}, // For reasoning models
);

Responses API (Stateful Conversations)

OpenAI's new Responses API provides stateful conversation management with built-in tools:

final provider = await ai()
    .openai((openai) => openai
        .useResponsesAPI()
        .webSearchTool()
        .fileSearchTool(vectorStoreIds: ['vs_123']))
    .apiKey('your-key')
    .model('gpt-4o')
    .build();

// Cast to access stateful features
final responsesProvider = provider as OpenAIProvider;
final responses = responsesProvider.responses!;

// Stateful conversation with automatic context preservation
final response1 = await responses.chat([
  ChatMessage.user('My name is Alice. Tell me about quantum computing'),
]);

final responseId = (response1 as OpenAIResponsesResponse).responseId;
final response2 = await responses.continueConversation(responseId!, [
  ChatMessage.user('Remember my name and explain it simply'),
]);

// Background processing for long tasks
final backgroundTask = await responses.chatWithToolsBackground([
  ChatMessage.user('Write a detailed research report'),
], null);

// Response lifecycle management
await responses.getResponse('resp_123');
await responses.deleteResponse('resp_123');
await responses.cancelResponse('resp_123');

Anthropic (with Thinking Process)

final provider = await ai()
    .anthropic()
    .apiKey('sk-ant-...')
    .model('claude-sonnet-4-20250514')
    .build();

final response = await provider.chat([
  ChatMessage.user('Explain quantum computing step by step')
]);

// Access Claude's thinking process
print('Final answer: ${response.text}');
if (response.thinking != null) {
  print('Claude\'s reasoning: ${response.thinking}');
}

DeepSeek (with Reasoning)

final provider = await ai()
    .deepseek()
    .apiKey('your-deepseek-key')
    .model('deepseek-reasoner')
    .build();

final response = await provider.chat([
  ChatMessage.user('Solve this logic puzzle step by step')
]);

// Access DeepSeek's reasoning process
print('Solution: ${response.text}');
if (response.thinking != null) {
  print('DeepSeek\'s reasoning: ${response.thinking}');
}

Ollama (with Thinking Process)

// Ollama with thinking enabled
final provider = await ai()
    .ollama()
    .baseUrl('http://localhost:11434')
    .model('gpt-oss:latest') // Reasoning model
    .reasoning(true)         // Enable reasoning process
    .build();

final response = await provider.chat([
  ChatMessage.user('Solve this math problem step by step: 15 * 23 + 7')
]);

// Access Ollama's thinking process
if (response.thinking != null) {
  print('Ollama\'s reasoning: ${response.thinking}');
}

xAI (with Web Search)

final provider = await ai()
    .xai()
    .apiKey('your-xai-key')
    .model('grok-3')
    .enableWebSearch()
    .build();

// Real-time web search
final response = await provider.chat([
  ChatMessage.user('What is the current stock price of NVIDIA?')
]);

// News search with date filtering
final newsProvider = await ai()
    .xai()
    .apiKey('your-xai-key')
    .newsSearch(
      maxResults: 5,
      fromDate: '2024-12-01',
    )
    .build();

Google (with Embeddings)

final provider = await ai()
    .google()
    .apiKey('your-google-key')
    .model('gemini-2.0-flash-exp')
    .buildEmbedding();

final embeddings = await provider.embed([
  'Text to embed for semantic search',
  'Another piece of text',
]);

// Use for similarity search, clustering, etc.

ElevenLabs (Audio Processing)

// Use buildAudio() for type-safe audio capability building
final audioProvider = await ai()
    .elevenlabs()
    .apiKey('your-elevenlabs-key')
    .voiceId('JBFqnCBsd6RMkjVDRZzb') // George voice
    .stability(0.7)
    .similarityBoost(0.9)
    .style(0.1)
    .buildAudio(); // Type-safe audio capability building

// Direct usage without type casting
final features = audioProvider.supportedFeatures;
print('Supports TTS: ${features.contains(AudioFeature.textToSpeech)}');

// Text to speech with advanced options
final ttsResponse = await audioProvider.textToSpeech(TTSRequest(
  text: 'Hello world! This is ElevenLabs speaking.',
  voice: 'JBFqnCBsd6RMkjVDRZzb',
  model: 'eleven_multilingual_v2',
  format: 'mp3_44100_128',
  includeTimestamps: true,
));
await File('output.mp3').writeAsBytes(ttsResponse.audioData);

// Speech to text (if supported)
if (features.contains(AudioFeature.speechToText)) {
  final audioData = await File('input.mp3').readAsBytes();
  final sttResponse = await audioProvider.speechToText(
    STTRequest.fromAudio(audioData, model: 'scribe_v1')
  );
  print(sttResponse.text);
}

// Convenience methods
final quickSpeech = await audioProvider.speech('Quick TTS');
final quickTranscription = await audioProvider.transcribeFile('audio.mp3');

Request Cancellation

Cancel in-flight requests for better resource management and user experience:

import 'package:llm_dart/llm_dart.dart';

// Create a cancel token
final cancelToken = CancelToken();

// Start a long-running request
final responseFuture = provider.chat(
  [ChatMessage.user('Write a very long essay...')],
  cancelToken: cancelToken,
);

// Cancel it later (e.g., user navigates away)
cancelToken.cancel('User cancelled');

// Handle cancellation
try {
  await responseFuture;
} on CancelledError catch (e) {
  print('Request cancelled: ${e.message}');
} catch (e) {
  if (CancellationHelper.isCancelled(e)) {
    print('Cancelled: ${CancellationHelper.getCancellationReason(e)}');
  }
}

// Cancel streaming responses
await for (final event in provider.chatStream(messages, cancelToken: cancelToken)) {
  switch (event) {
    case TextDeltaEvent(delta: final delta):
      print(delta);
      // Cancel after first token
      cancelToken.cancel('Got enough data');
      break;
    case ErrorEvent(error: final error):
      if (CancellationHelper.isCancelled(error)) {
        print('Stream cancelled');
      }
      break;
    // ... other events
  }
}

Supported operations: chat(), chatStream(), models(), and all other provider operations.

See cancellation_demo.dart for comprehensive examples.

Error Handling

try {
  final response = await provider.chatWithTools(messages, null);
  print(response.text);
} on CancelledError catch (e) {
  print('Request cancelled: $e');
} on AuthError catch (e) {
  print('Authentication failed: $e');
} on ProviderError catch (e) {
  print('Provider error: $e');
} on HttpError catch (e) {
  print('Network error: $e');
} catch (e) {
  print('Unexpected error: $e');
}

Architecture

Capability-Based Design

The library uses a capability-based interface design instead of monolithic "god interfaces":

// Core capabilities
abstract class ChatCapability {
  Future<ChatResponse> chat(List<ChatMessage> messages);
  Stream<ChatStreamEvent> chatStream(List<ChatMessage> messages);
}

abstract class EmbeddingCapability {
  Future<List<List<double>>> embed(List<String> input);
}

abstract class WebSearchCapability {
  // Web search is integrated into chat - no separate methods needed
  // Providers handle search automatically when enabled
}

abstract class ModerationCapability {
  Future<ModerationResponse> moderate(ModerationRequest request);
}

// Providers implement only the capabilities they support
class OpenAIProvider implements
    ChatCapability,
    EmbeddingCapability,
    WebSearchCapability,
    ModerationCapability {
  // Implementation
}

Type-Safe Capability Building

The library provides capability factory methods for compile-time type safety:

// Old approach - runtime type casting
final provider = await ai().openai().apiKey(apiKey).build();
if (provider is! AudioCapability) {
  throw Exception('Audio not supported');
}
final audioProvider = provider as AudioCapability; // Runtime cast!

// New approach - compile-time type safety
final audioProvider = await ai().openai().apiKey(apiKey).buildAudio();
// Direct usage without type casting - guaranteed AudioCapability!

// Available factory methods:
final chatProvider = await ai().openai().build(); // Returns ChatCapability
final audioProvider = await ai().openai().buildAudio();
final imageProvider = await ai().openai().buildImageGeneration();
final embeddingProvider = await ai().openai().buildEmbedding();
final fileProvider = await ai().openai().buildFileManagement();
final moderationProvider = await ai().openai().buildModeration();
final assistantProvider = await ai().openai().buildAssistant();
final modelProvider = await ai().openai().buildModelListing();

// Web search is enabled through configuration, not a separate capability
final webSearchProvider = await ai().openai().enableWebSearch().build();

// Clear error messages for unsupported capabilities
try {
  final audioProvider = await ai().groq().buildAudio(); // Groq doesn't support audio
} catch (e) {
  print(e); // UnsupportedCapabilityError: Provider "groq" does not support audio capabilities. Supported providers: OpenAI, ElevenLabs
}

Provider Registry

The library includes an extensible provider registry system:

// Check available providers
final providers = LLMProviderRegistry.getRegisteredProviders();
print('Available: $providers'); // ['openai', 'anthropic', ...]

// Check capabilities
final supportsChat = LLMProviderRegistry.supportsCapability('openai', LLMCapability.chat);
print('OpenAI supports chat: $supportsChat'); // true

// Create providers dynamically
final provider = LLMProviderRegistry.createProvider('openai', config);

Custom Providers

You can register custom providers:

// Create a custom provider factory
class MyCustomProviderFactory implements LLMProviderFactory<ChatCapability> {
  @override
  String get providerId => 'my_custom';

  @override
  Set<LLMCapability> get supportedCapabilities => {LLMCapability.chat};

  @override
  ChatCapability create(LLMConfig config) => MyCustomProvider(config);

  // ... other methods
}

// Register it
LLMProviderRegistry.register(MyCustomProviderFactory());

// Use it
final provider = await ai().provider('my_custom').build();

Configuration

All providers support common configuration options:

apiKey: API key for authentication
baseUrl: Custom API endpoint
model: Model name to use
temperature: Sampling temperature (0.0-1.0)
maxTokens: Maximum tokens to generate
systemPrompt: System message
timeout: Request timeout
topP, topK: Sampling parameters

Provider-Specific Extensions

Use the extension system for provider-specific features:

final provider = await ai()
    .openai()
    .apiKey('your-key')
    .model('gpt-4')
    .reasoningEffort(ReasoningEffort.high)  // OpenAI-specific
    .extension('voice', 'alloy')           // OpenAI TTS voice
    .build();

Examples

See the example directory for comprehensive examples:

Getting Started: quick_start.dart, provider_comparison.dart

Core Features:

Advanced: reasoning_models.dart, multi_modal.dart, semantic_search.dart

MCP Integration: MCP examples - Model Context Protocol for external tool access

Use Cases: chatbot.dart, cli_tool.dart, web_service.dart, multimodal_app.dart

Production App: Yumcha - Cross-platform AI chat app built with LLM Dart

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

See Contributing Guide for details.

Thanks

This project exists thanks to all the people who have contributed:

Acknowledgments

This library is inspired by the Rust graniet/llm library and follows similar patterns adapted for Dart.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Libraries

builder/audio_config
builder/http_config
builder/image_config
builder/llm_builder
builder/provider_config
core/base_http_provider
core/cancellation: Cancellation support for LLM operations
core/capability
core/config
core/google_openai_transformers
core/llm_error
core/openai_compatible_configs
core/provider_defaults
core/registry
core/tool_validator
core/web_search: Unified web search configuration for multi-provider support
llm_dart: LLM Dart Library - A modular Dart library for AI provider interactions
models/assistant_models
models/audio_models: Audio-related models for Text-to-Speech (TTS) and Speech-to-Text (STT) functionality
models/chat_models
models/file_models
models/image_models: Image generation related models for AI image generation functionality
models/moderation_models
models/responses_models: Models for OpenAI Responses API
models/tool_models
providers/anthropic/anthropic: Modular Anthropic Provider
providers/anthropic/builder
providers/anthropic/chat
providers/anthropic/client
providers/anthropic/config
providers/anthropic/dio_strategy
providers/anthropic/files
providers/anthropic/mcp_models: Anthropic MCP Connector models
providers/anthropic/models
providers/anthropic/provider
providers/anthropic/request_builder
providers/deepseek/chat
providers/deepseek/client
providers/deepseek/config
providers/deepseek/deepseek: Modular DeepSeek Provider
providers/deepseek/dio_strategy
providers/deepseek/error_handler
providers/deepseek/models
providers/deepseek/provider
providers/elevenlabs/audio
providers/elevenlabs/builder
providers/elevenlabs/client
providers/elevenlabs/config
providers/elevenlabs/dio_strategy
providers/elevenlabs/elevenlabs: Modular ElevenLabs Provider
providers/elevenlabs/models
providers/elevenlabs/provider
providers/factories/anthropic_factory
providers/factories/base_factory
providers/factories/deepseek_factory
providers/factories/elevenlabs_factory
providers/factories/google_factory
providers/factories/groq_factory
providers/factories/ollama_factory
providers/factories/openai_compatible_factory
providers/factories/openai_factory
providers/factories/phind_factory
providers/factories/xai_factory
providers/google/builder
providers/google/chat
providers/google/client
providers/google/config
providers/google/dio_strategy
providers/google/embeddings
providers/google/google: Modular Google Provider
providers/google/images
providers/google/provider
providers/google/tts
providers/groq/chat
providers/groq/client
providers/groq/config
providers/groq/dio_strategy
providers/groq/groq: Modular Groq Provider
providers/groq/provider
providers/ollama/builder
providers/ollama/chat
providers/ollama/client
providers/ollama/completion
providers/ollama/config
providers/ollama/dio_strategy
providers/ollama/embeddings
providers/ollama/models
providers/ollama/ollama: Modular Ollama Provider
providers/ollama/provider
providers/openai/assistants
providers/openai/audio
providers/openai/builder
providers/openai/builtin_tools: OpenAI Built-in Tools for Responses API
providers/openai/chat
providers/openai/client
providers/openai/compatible/openrouter/builder
providers/openai/completion
providers/openai/config
providers/openai/dio_strategy
providers/openai/embeddings
providers/openai/files
providers/openai/images
providers/openai/models
providers/openai/moderation
providers/openai/openai: Modular OpenAI Provider
providers/openai/provider
providers/openai/responses
providers/openai/responses_capability: OpenAI-specific Responses API capability interface
providers/phind/chat
providers/phind/client
providers/phind/config
providers/phind/dio_strategy
providers/phind/phind: Modular Phind Provider
providers/phind/provider
providers/xai/chat
providers/xai/client
providers/xai/config
providers/xai/dio_strategy
providers/xai/embeddings
providers/xai/provider
providers/xai/xai: Modular xAI Provider
utils/capability_utils
utils/config_utils
utils/dio_client_factory
utils/http_client_adapter_io
utils/http_client_adapter_stub
utils/http_client_adapter_web
utils/http_config_utils
utils/http_response_handler
utils/provider_registry
utils/reasoning_utils: Utilities for handling reasoning/thinking content in AI responses This matches the logic from the TypeScript implementation
utils/utf8_stream_decoder