v2026.5.1: New image models + Gemini catalog sync

Three new image-generation families are live — OpenAI gpt-image-2, Google Imagen 4 (Fast / Standard / Ultra), and Nano Banana Pro. Gemini catalog updated against Google's live model list. Bug fix: gateway no longer leaks OpenAI-only fields to Gemini.

May 13, 2026

At a glance

Category	What's new
OpenAI image generation	`gpt-image-2`, `gpt-image-2-2026-04-21`
Google Imagen 4	`imagen-4.0-fast-generate-001`, `imagen-4.0-generate-001`, `imagen-4.0-ultra-generate-001`
Google Gemini image	`nano-banana-pro-preview`
Google Gemini text / embeds	`gemini-3.1-flash-lite` (GA), `gemini-embedding-2` (GA)
Legacy Gemini (deprecated)	`gemini-2.0-flash`, `gemini-2.0-flash-lite` — Google shutdown 2026-06-01
Bug fix	`modalities` field no longer forwarded to Gemini

All new models work through the standard POST /v1/chat/completions endpoint. No SDK or client changes required.

OpenAI — gpt-image-2

OpenAI's flagship image model, accepting text and image inputs and returning image output. Available immediately on osmAPI under both its rolling alias and dated snapshot.

Model	Released
`gpt-image-2`	2026-04-21
`gpt-image-2-2026-04-21`	2026-04-21

Pricing — per 1M tokens (both models identical):

Token type	Standard	Cached
Text input	$5.00	$1.25
Image input	$8.00	—
Image output	$30.00	—

Tip: include an image_url part in the user message to edit an existing image instead of generating from scratch.

Google Imagen 4

Three quality tiers of Google's Imagen 4, billed per generated image (no token pricing).

Model	Price per image	Best for
`imagen-4.0-fast-generate-001`	$0.02	High-throughput, lowest cost
`imagen-4.0-generate-001`	$0.04	Standard quality, balanced cost
`imagen-4.0-ultra-generate-001`	$0.06	Highest fidelity, premium quality

Imagen uses Google's :predict endpoint with a different request/response shape than Gemini chat. The gateway handles the translation, so callers continue to use POST /v1/chat/completions with model: "imagen-4.0-..." and receive a standard OpenAI-style response.

Google Gemini — new entries

Image

Model	Pricing	Notes
`nano-banana-pro-preview`	Text in $2/1M · Text out $12/1M · Image out $120/1M	Google's marketing alias for `gemini-3-pro-image-preview` — same model

Text & embeddings

Model	Input (per 1M)	Output (per 1M)	Notes
`gemini-3.1-flash-lite`	$0.25	$1.50	GA promotion of the preview
`gemini-embedding-2`	$0.20	—	GA promotion of the preview

Deprecated by Google (action required by 2026-06-01)

Model	Input (per 1M)	Output (per 1M)	Shutdown date
`gemini-2.0-flash`	$0.10	$0.40	2026-06-01
`gemini-2.0-flash-lite`	$0.075	$0.30	2026-06-01

Google will stop serving these on 2026-06-01. Both models still respond today, but the gateway now flags them as deprecated and will auto-filter them from routing after the cutoff. Migrate to gemini-2.5-flash or gemini-3.1-flash-lite for similar pricing and behavior.

Bug fix — Gemini image 400 errors

Before this release, a small percentage of requests to Gemini image models (e.g. gemini-3-pro-image-preview) failed with:

HTTP 400 — Invalid JSON payload received. Unknown name "modalities": Cannot find field.

HTTP 400 — Invalid JSON payload received. Unknown name "modalities": Cannot find field.

The root cause was the gateway forwarding the OpenAI-style modalities request field to every provider, including Google AI Studio. Gemini's REST schema doesn't accept that field.

The fix scopes the forwarding to OpenAI only. gpt-audio models still receive modalities and audio parameters as before; Gemini requests are now clean.

Compatibility

No caller changes required. All new code paths are scoped to imageGenerations: true + the matching provider, so existing flows are untouched:

Gemini chat-image (gemini-2.5-flash-image, gemini-3-pro-image-preview, gemini-3.1-flash-image-preview) — unchanged
Alibaba qwen-image-* — unchanged
Z.AI cogview-* — unchanged

All six new image models were verified live against the upstream providers before release.