3 Models Available
An indie filmmaker drafts a 3-5 shot narrative short on the canvas — same protagonist, same world — over a weekend, before booking any crew. Use Martini's storyboard generator to lock a character reference, fan out shot frames to Sora 2, Kling 3, or Google Veo 3.1, and chain last-frame to first-frame so cuts read as one continuous scene. The result is a festival-bound pre-viz reel cinematic enough to greenlight the production round. Pick a model below to walk through the workflow that fits your script.
OpenAI
Sora 2 is OpenAI's flagship for cinematic short film work — realistic lighting, believable reflections, and camera moves that read like a real DP shot them. The base Sora 2 handles text-to-video and image-to-video at 1080p; Sora 2 Pro lifts fidelity and unlocks 15-second clips with clarity control. For an indie filmmaker drafting a 3-5 shot festival short over a weekend, Sora 2 hits the bar where the pre-viz looks production-ready before any crew is booked.
Kling
Kling 3.0 is the first major video model to render native 4K (3840x2160) at the diffusion stage rather than via post-process upscaling — sharper textures, accurate film grain, finer hair, fabric, and skin detail than any upscaler can recover. For a short film bound for a festival projector, that detail floor matters. Kling also bakes Omni Native Audio into the same pass (English, Chinese, Japanese, Korean, Spanish), so dialogue lip-sync and ambience can ship without a separate audio chain.
Google Veo 3.1 ships native audio synthesis baked into the same generation pass as the video — describe the ambient sound right in the prompt and Veo synchronizes it to the picture. For an indie short film where the dialogue, footsteps, and music bed all have to land in time, Veo 3.1 is the cleanest end-to-end option. Output goes up to 1080p with Fast and Standard tiers, plus an Extend variant that continues an existing clip in V2V mode for seamless multi-clip assembly.