Alibaba
Alibaba Qwen Image family provides instruction-based image editing (Edit and Edit Plus) powered by the Qwen architecture, plus Z-Image for text-to-image generation. Excels at natural-language editing commands in both English and Chinese.
The Qwen Image family from Alibaba takes a different approach from most image models: rather than competing on raw generation quality, it specializes in instruction-based image editing — describe what you want to change in natural language, and the model modifies the image accordingly. Qwen Image Edit handles standard editing tasks like background replacement, object removal, style transfer, and color adjustments. Qwen Image Edit Plus adds enhanced contextual understanding for complex multi-step instructions like "remove the person on the left, change the sky to sunset, and add a vintage film grain." Z-Image rounds out the family with general-purpose text-to-image generation from Alibaba's pipeline. Together they form a complete generate-then-refine workflow. The family's standout feature is bilingual prompt excellence — Chinese-language editing instructions often produce more precise results than English, making it particularly valuable for Chinese-speaking creators. On Martini, the typical workflow is: generate a base image with a stronger model (FLUX, Imagen 4), then use Qwen Edit or Edit Plus as a refinement node to make targeted modifications without regenerating from scratch.
| Variant | Description |
|---|---|
| Qwen Image Edit | Instruction-based image editing with natural language commands. |
| Qwen Image Edit Plus | Enhanced editing model for complex multi-step instructions. |
| Z-Image | General-purpose text-to-image generation from Alibaba. |
Connect Qwen Image with other AI models on Martini's infinite canvas. No GPU required — start free.
Get Started FreeQwen Image excels at instruction-based image editing — modifying existing images using natural language commands. Use Qwen Edit for standard changes (background swap, object removal) and Edit Plus for complex multi-step instructions.
Yes. Built by Alibaba, Qwen has particularly strong Chinese-language understanding. Chinese editing instructions often produce more precise results, though English works well for most standard editing tasks.
Z-Image, part of the Qwen family, handles text-to-image generation. However, the family's strength is editing rather than generation. For best results, generate a base image with FLUX or Imagen, then refine with Qwen Edit.
Midjourney
Midjourney v7 is the most recognizable AI image generator, with the strongest aesthetic signature in the category. On Martini you get V7 for photoreal and painterly work, Niji 7 for anime, Omni Reference for character lock-in, and Stylization, Variety, and Weirdness sliders for fine control — all from the canvas, no Discord required.
View detailsBlack Forest Labs
FLUX by Black Forest Labs is a fast, high-quality image generation family known for photorealistic output and excellent prompt adherence. Variants span free-tier dev models to ultra-resolution Pro outputs.
View detailsBlack Forest Labs
FLUX Kontext is a context-aware image generation and editing model that uses reference images to maintain character and style consistency across outputs. Available in Pro and Max tiers.
View details