3 Models Available

How to Turn an Image Into a 3D World

A concept artist generates a clean interior reference on Nano Banana 2, then turns it into a navigable scene where the camera can orbit and capture matched-angle stills. On Martini's canvas, drop the reference image into a world node (or chain Flux as an alt-look reference), capture 5-10 angles, and feed each as a starting frame into Sora 2 video nodes for shots that all share the same world. Note: Martini does not export navigable worlds as glTF or USD — captured stills are the deliverable. Pick a model below to walk through the image-to-world workflow.

Try Free

Choose a Model to Get Started

Google

Nano Banana 2

Generate the canonical reference image for an Image-to-3D-World workflow on Martini using Nano Banana 2 — the cleaner the source, the navigable the resulting scene. The output of the world node is a navigable canvas-internal scene preview you can orbit and screenshot, not a portable .obj, .fbx, .glb, or USD mesh file. Concept artists use this to lock a location once on Nano Banana 2, pass the locked still into the World Labs or Image-to-3D-World node, and capture matched-angle stills that feed downstream Sora 2 or Kling 3 nodes for shots that all share the same world.

6 steps + 4 promptsView guide

Black Forest Labs

FLUX.2

Generate the source reference image for an Image-to-3D-World workflow on Martini using FLUX.2 — its prompt-fidelity rendering produces clean, literal scene compositions that the world node can reconstruct cleanly. The world node's output is a navigable canvas-internal scene preview you can orbit and screenshot, not a portable .obj, .fbx, .glb, or USD mesh file. Concept artists use FLUX.2 when they need an alt-look reference (different palette, different lighting, different style) than what Nano Banana 2 produces — same workflow, different aesthetic.

6 steps + 4 promptsView guide

OpenAI

Sora 2

Use Sora 2 as the downstream camera-move engine for an Image-to-3D-World workflow on Martini — the captured stills from the navigable world feed directly into Sora 2 video nodes for matched-angle motion shots. The world node's output is a canvas-internal navigable scene preview, not a portable .obj, .fbx, .glb, or USD mesh. Sora 2 takes the captured stills as starting frames and produces video clips that all share the same locked location, with cinematographic camera moves that respect the spatial structure of the source world.

6 steps + 4 promptsView guide

More How-To Guides

This website uses cookies

We use cookies to keep Martini secure, remember your preferences, and, if you allow it, measure product performance. Read more

Strictly necessary

Required for authentication, security, payments, and core product flows.

Functionality

Remembers product preferences such as theme, language, and your most recent workspace.

Performance

Helps us understand product usage and site performance with PostHog, Vercel Analytics, Speed Insights, and Ahrefs.

Targeting

Allows marketing and advertising tags we may run through Google Tag Manager.

How to Turn an Image Into a 3D World

Choose a Model to Get Started

Nano Banana 2

FLUX.2

Sora 2

More How-To Guides

How to Generate AI Art & Illustrations

How to Create AI Product Photography

How to Design Social Media Graphics with AI

How to Edit & Transform Photos with AI

This website uses cookies

How to Turn an Image Into a 3D World

Choose a Model to Get Started

Nano Banana 2

FLUX.2

Sora 2

More How-To Guides

How to Generate AI Art & Illustrations

How to Create AI Product Photography

How to Design Social Media Graphics with AI

How to Edit & Transform Photos with AI