Multimodal
Send images and documents alongside text in your LLM requests.
Gateway accepts multimodal content natively — include image or document content blocks in your messages and Gateway routes to a capable model. No configuration needed. Gateway automatically detects which models support each modality and translates content to the provider’s format.
Supported content types
Quick example
Model compatibility
Gateway auto-detects multimodal capabilities from model metadata. Use GET /v1/models to check a model’s capabilities.input array and capabilities.vision field.
Context compression automatically protects multimodal messages. When trimming is needed, text-only messages are removed first — your images and documents are preserved.