Veo 3.1 API

Text to VideoImage to VideoVideo to Video

Google's flagship video model with native audio. Generate, extend, and upscale cinematic 1080p/4K content.

·Features·FAQ

Documentation

Model

Output

Your generated video will appear here

Features

What Veo 3.1 API offers

Google Veo 3.1 with eight API variants: Lite, Lite Relaxed, Fast, Fast Relaxed, Quality, Extend, Upscale 1080p, and Upscale 4K

Lite, Fast, and Quality tiers for new clips; Lite Relaxed and Fast Relaxed are lower priority and cost less per video

Text-to-video from prompts; image-to-video with optional start and end frames (frame mode)

Fast and Fast Relaxed add reference-image mode (up to three images) and optional voice presets; Lite and Quality use frame mode only

Extend continues a completed generation using task ID, prompt, and a chosen base model (Lite through Quality)

Upscale turns in an existing task ID and targets 1080p or 4K (Veo 3.1 Upscale 1080p and Veo 3.1 Upscale 4K)

Aspect ratios 16:9 and 9:16; optional seed on new clips and on Extend when supported

REST API with async generation and polling

Use cases

Built for

Primary

Product marketing: short hero clips from a storyboard frame pair or reference look

Social verticals: 9:16 promos with Fast reference images aligned to brand shots

Story iteration: start on Lite or Fast Relaxed, then branch in Quality for a final pass

Longer narrative: generate a base clip, then use Extend to continue the same storyline

Delivery polish: Upscale 1080p for web, Upscale 4K for premium screens or edit timelines

FAQ

About Veo 3.1 API

Lite and Quality focus on standard frame-mode generation with different quality or latency tradeoffs. Fast adds reference images and optional voice. Lite Relaxed and Fast Relaxed use the same capabilities as Lite and Fast but run at lower priority for a lower listed price per video.

The Veo 3.1 Extend variant takes a completed task ID plus a prompt and a selected model (Lite, Lite Relaxed, Fast, Fast Relaxed, or Quality) to continue that video. Pricing is listed as Veo 3.1 Extend from $0.15 to $0.60 per video depending on options.

They take a finished generation task ID and return an upscaled file at 1080p or 4K. Listed prices are Veo 3.1 Upscale 1080p at $0.05 per video and Veo 3.1 Upscale 4K at $0.50 per video.

Text-to-video from a prompt. Image-to-video via start and end frames, or on Fast and Fast Relaxed via up to three reference images with optional voice. Video continuation via Extend using a prior task ID.

On the public price list, generation is Veo 3.1 Lite at $0.15, Veo 3.1 Lite Relaxed at $0.075, Veo 3.1 Fast at $0.30, Veo 3.1 Fast Relaxed at $0.15, and Veo 3.1 Quality at $0.60 per video, each with per-video units. Extend and Upscale lines are listed separately as Veo 3.1 Extend and Veo 3.1 Upscale 1080p / Veo 3.1 Upscale 4K.

Pass an optional numeric seed on generation or extend when the endpoint accepts it, using the same prompt and media so reruns stay aligned.