AI Model Directory — Video, Image & Audio Generation
Showing all 47 models
See full pricingVideo Generation
23
Up to 9 image/video/audio references, T2V or first/last frame, 4–15s at 720p or 1080p.
Try this model
T2V or I2V up to 10s at 480p or 720p, 5 aspect ratios, with custom/spicy/fun/normal presets.
Try this modelGrok Imagine
xAI
- Text to Video
- Image to Video
- Text to Image
- Image to Image
- Video to Video

T2V or I2V at 720p Standard or 1080p Pro, 5 or 10s, with optional end frame and audio.
Try this model
Up to 7 references plus elements, video reference/transform, multi-shot 2–6, 4K mode.
Try this modelKling 3.0 Omni
Kling
- Text to Video
- Image to Video
- Reference to Video
- Video to Video

T2V, I2V, or R2V up to 9 reference images, 720P or 1080P, 3–15s clips with joint audio-video.
Try this modelHappyHorse 1.0
Alibaba
- Text to Video
- Image to Video
- Reference to Video
- Video to Video
Image Generation
22
T2I plus reference editing, 5 aspect ratios, smart prompt rewriting, negative prompts.
Try this model



































