Google Veo 3.1 delivers industry-leading video generation with native audio synthesis, photorealistic quality, and resolution up to 1080p. The Extend variant adds video continuation and V2V capabilities for seamless clip extension.
Veo 3.1 is Google's flagship video model, combining text-to-video and image-to-video generation with built-in audio that is generated natively alongside the visual content rather than added as a separate step. It offers Fast and Standard generation tiers, supports reference images for style and character guidance, and produces output at 720p or 1080p. The Extend variant takes an existing video clip and continues it seamlessly, operating in V2V mode. Together they enable end-to-end video production from initial generation through extension, all with synchronized audio.
| Variant | Description |
|---|---|
| Veo 3.1 | T2V and I2V with native audio, reference support, Fast and Standard tiers, up to 1080p. |
| Veo 3.1 Extend | Seamless video extension and V2V continuation of existing clips. |
Higher quality tiers generally offer better detail and consistency, but require more credits and generation time.
Connect Google Veo with other AI models on Martini's infinite canvas. No GPU required — start free.
Get Started Free