OpenAI
OpenAI GPT Image generates images directly from the GPT-4 architecture, combining deep language understanding with visual generation. Available in GPT Image 1 and 1.5 with quality and background controls.
GPT Image is OpenAI's native image generation model built on the GPT-4 architecture. Unlike earlier DALL-E models, GPT Image understands nuanced, multi-part prompts at a deeper level thanks to its foundation in language modeling. GPT Image 1 offers solid general-purpose generation with image editing support. GPT Image 1.5 adds quality tiers — Low, Medium, and High — plus background control (auto, transparent, opaque) for product photography and design workflows. Both support transparent PNG output. The Low tier on 1.5 is the lightest way to leverage GPT-4-level prompt understanding on Martini.
| Variant | Description |
|---|---|
| GPT Image 1 | Native OpenAI image generation with editing support and multiple sizes. |
| GPT Image 1.5 | Enhanced variant with quality tiers, background control, and transparent output. |
Connect GPT Image with other AI models on Martini's infinite canvas. No GPU required — start free.
Get Started FreeGPT Image 1 provides solid general-purpose generation with image editing support. GPT Image 1.5 adds quality tiers (Low/Medium/High), background control (transparent, opaque, auto), and improved detail — especially useful for product photography and design workflows.
Yes. Both GPT Image 1 and 1.5 support transparent PNG output. GPT Image 1.5 additionally offers explicit background control — set to "transparent" for product cutouts and compositing work.
GPT Image replaces DALL-E as OpenAI's image generation model. Built on the GPT-4 architecture rather than a separate diffusion model, it has significantly better prompt understanding, especially for complex multi-part descriptions and nuanced instructions.
OpenAI
OpenAI GPT Image 2 is a quality-first, reasoning-driven image model that plans the composition before generating. It delivers state-of-the-art text rendering, multilingual typography, and high-fidelity edits across up to 16 reference images, with output up to 4K.
View detailsMidjourney
Midjourney v7 is the most recognizable AI image generator, with the strongest aesthetic signature in the category. On Martini you get V7 for photoreal and painterly work, Niji 7 for anime, Omni Reference for character lock-in, and Stylization, Variety, and Weirdness sliders for fine control — all from the canvas, no Discord required.
View detailsBlack Forest Labs
FLUX by Black Forest Labs is a fast, high-quality image generation family known for photorealistic output and excellent prompt adherence. Variants span free-tier dev models to ultra-resolution Pro outputs.
View details