AI Model Playground

Access 50+ AI models for video, image, and audio generation

Some parameters or model details may differ from the actual documentation. This playground is meant for quick testing. Check the docs for the most current info.

Showing all 53 models

Video Generation

25 models

Veo 3.1

Google

Google's newest video model. Fast and Quality modes with text-to-video and image-to-video support.

Text to VideoImage to VideoVideo to Video

Veo 3.1 Fast Veo 3.1 Fast Relaxed Veo 3.1 Quality Veo 3.1 Lite Veo 3.1 Lite Relaxed Veo 3.1 Extend Veo 3.1 Upscale

SeeDance 2.0

ByteDance

ByteDance's latest video model with omni-reference, first & last frame, and T2V at 720p. Available in Pro and Fast variants.

Text to VideoImage to VideoReference to Video

SeeDance 2.0 Pro SeeDance 2.0 Fast

Kling 3.0

Kling

Latest Kling model with multi-shot support, 3-15s duration, and sound generation.

Text to VideoImage to Video

Grok Imagine

xAI

xAI's multimodal generation platform. Create videos, images, and apply video edits with text guidance.

Text to VideoImage to VideoText to ImageImage to ImageVideo to Video

Grok Imagine Video Grok Imagine Image Grok Imagine Video Edit

Kling O1

Kling

Kling's reasoning model for complex prompts. Superior understanding and creative interpretation.

Text to VideoImage to Video

MiniMax Hailuo

MiniMax

Flexible video generation with motion control. Customizable duration and resolution options.

Text to VideoImage to Video

Kling 2.5 Turbo

Kling

Fast Kling model optimized for speed. 5s and 10s durations with multiple aspect ratios.

Image to Video

Kling 2.1

Kling

Stable Kling model with professional and standard quality tiers. Reliable results.

Image to Video

Kling 2.1 Master

Kling

Premium Kling model for highest quality output. Best for professional productions.

Image to Video

Higgsfield

Cinematic videos with 50+ camera effects. Dolly shots, zooms, and drone perspectives.

Image to Video

Kling 2.6 Motion Control

Kling

Transfer motion from a reference video onto a character image. Perfect for dance and movement replication.

Image to VideoReference to Video

Kling 3.0 Motion Control

Kling

Transfer motion from reference videos onto characters with high facial consistency. Upgraded from v2.6.

Image to VideoReference to Video

Higgsfield Cinematic Studio Video

Higgsfield

Professional cinematic videos with camera movements. AI sound effects and slow motion support.

Image to Video

Kling 2.6

Kling

Latest Kling model with enhanced dynamics and aesthetics. Supports text-to-video and image-to-video.

Text to VideoImage to Video

SeeDance 1.5 Pro

ByteDance

Professional videos with first & last frame support at 720p/1080p, 5-12 seconds.

Image to Video

Wan 2.6

Alibaba

Wan AI's latest model with extended 15s duration, multi-shot, and audio support. Includes Flash variant.

Text to VideoImage to VideoReference to Video

Wan 2.6 Wan 2.6 Flash

SeeDance 1.0

ByteDance

ByteDance video generation with text-to-video, first & last frame, and multi-frame keypoint control at 720p/1080p.

Text to VideoImage to Video

SeeDance 1.0 Pro SeeDance 1.0 Mini SeeDance 1.0 Fast

Wan 2.5

Alibaba

Wan AI video generation with text-to-video and image-to-video capabilities.

Text to VideoImage to Video

Wan 2.2

Alibaba

Wan AI with advanced frame and quality step control. Includes Fast variant with auto-calculated frames.

Text to VideoImage to Video

Wan 2.2 Wan 2.2 Fast

Topaz Upscale

Topaz Labs

AI video upscaling to 4K with frame interpolation, slow motion, and enhancement presets.

Video to Video

Kling 3.0 Omni

Kling

Kling 3.0 Omni image-to-video with sound support. 5-10s at 720p-1080p.

Image to Video

Kling 3.0 Omni Edit

Kling

AI video editing with Kling 3.0 Omni. Edit videos using text prompts and reference images.

Video to Video

Kling O1 Edit

Kling

Reasoning-powered video editing. Transform videos with complex edit instructions and reference images.

Video to Video

Wan 2.7

Alibaba

Advanced video generation and editing with text-to-video, image-to-video, reference-to-video, and video editing modes.

Text to VideoImage to VideoReference to VideoVideo to Video

Wan 2.7 Wan 2.7 Edit

Kling 2.5

Kling

Kling 2.5 video generation with end frame support. Image-to-video with std and pro modes.

Image to Video

Image Generation

25 models

Wan 2.7 Pro

Alibaba

Highest quality AI images with thinking mode, up to 4K resolution, and editing with up to 9 references.

Text to ImageImage to Image

Nano Banana 2

Google

Next-generation Google Gemini image model with up to 4K resolution and reference image support.

Text to ImageImage to Image

Flux.2

Black Forest Labs

Black Forest Labs image model. Supports 3 modes: Flex, Pro, and Max with reference images.

Text to Image

Nano Banana Pro

Google

Gemini 3 Preview with 1K/2K/4K resolution control. Advanced image generation capabilities.

Text to Image

Nano Banana

Google

Google's Gemini 2.5 Flash image model. Fast, high-quality image generation and editing.

Text to ImageImage to Image

Higgsfield Cinematic Studio Video

Higgsfield

Professional cinematic videos with camera movements. AI sound effects and slow motion support.

Text to Image

GPT Image 1.5

OpenAI

Enhanced OpenAI image model with improved quality. Better detail and prompt adherence.

Text to ImageImage to Image

GPT Image 1

OpenAI

OpenAI's photorealistic model. Excellent text rendering and complex prompt understanding.

Text to ImageImage to Image

Flux.2 Klein 9B

Black Forest Labs

Lightweight Flux.2 model for fast image generation at $0.0105/MP.

Text to Image

Flux.2 Klein 4B

Black Forest Labs

Smallest and fastest Flux.2 model for rapid image generation at $0.0098/MP.

Text to Image

SeeDream 4.6

ByteDance

ByteDance AI image generation with up to 6 reference images and 4K resolution.

Text to Image

SeeDream 4.0

ByteDance

ByteDance AI image generation with up to 8 reference images support.

Text to Image

SeeDream 4.1

ByteDance

ByteDance AI image generation with up to 6 reference images and 4K resolution.

Text to Image

SeeDream 4.5

ByteDance

Enhanced ByteDance model with 14 reference images and 4K resolution.

Text to Image

SeeDream 5.0 Lite

ByteDance

ByteDance lightweight image model with 14 reference images and 3K resolution.

Text to Image

Kling O1

Kling

Kling's reasoning model for complex prompts. Superior understanding and creative interpretation.

Text to Image

Kling 3.0 Omni

Kling

Kling 3.0 Omni image-to-video with sound support. 5-10s at 720p-1080p.

Text to Image

Topaz Upscale

Topaz Labs

AI image upscaling up to 16x with face enhancement and optimized presets for photos, text, and anime.

Image to Image

Qwen Image 2.0

Alibaba

Alibaba's latest image model with text rendering, image editing, and multi-image fusion. Available in Pro and Standard variants.

Text to ImageImage to Image

Qwen Image 2.0 Pro Qwen Image 2.0

Qwen Image

Alibaba

Alibaba image generation with multiple tiers. Max for photorealism, Plus for artistic styles, and base for general use.

Text to ImageImage to Image

Qwen Image Max Qwen Image Plus Qwen Image Base

Z-Image Turbo

Alibaba

Lightweight fast text-to-image model with Chinese and English text rendering.

Text to Image

Wan 2.7

Alibaba

Fast AI image generation with thinking mode and editing support. Max 2K resolution.

Text to ImageImage to Image

Wan 2.6

Alibaba

AI image generation with style transfer and editing using 1-4 reference images.

Text to ImageImage to Image

Wan 2.5

Alibaba

AI image generation with editing support using 1-3 reference images.

Text to ImageImage to Image

Wan 2.2

Alibaba

Text-to-image AI image generation with negative prompt support. Includes Flash variant for fastest generation.

Text to Image

Wan 2.2 Wan 2.2 Flash

Audio Generation

3 models

Suno

Complete AI audio platform. Generate music, extend songs, create covers, add vocals, extract stems, generate lyrics, and more.

MusicSound EffectsText to Speech

Suno Music Suno Extend Suno Cover Suno Add Vocals Suno Add Instrumental Suno Extract Stems Suno Lyrics Suno WAV Export

Higgsfield TTS

Higgsfield

Text-to-speech with customizable voices. Adjustable speed, style, and stability.

Text to Speech

ElevenLabs

Complete audio AI platform. Text-to-speech, multi-voice dialogue, sound effects, vocal isolation, and speech-to-text.

Text to SpeechSound EffectsMusicSpeech to Text

ElevenLabs TTS ElevenLabs Dialogue ElevenLabs Sound Effects ElevenLabs Audio Isolation ElevenLabs Speech-to-Text