Alibaba
Alibaba Qwen Image family provides instruction-based image editing (Edit and Edit Plus) powered by the Qwen architecture, plus Z-Image for text-to-image generation. Excels at natural-language editing commands in both English and Chinese.
The Qwen Image family from Alibaba takes a different approach from most image models: rather than competing on raw generation quality, it specializes in instruction-based image editing — describe what you want to change in natural language, and the model modifies the image accordingly. Qwen Image Edit handles standard editing tasks like background replacement, object removal, style transfer, and color adjustments. Qwen Image Edit Plus adds enhanced contextual understanding for complex multi-step instructions like "remove the person on the left, change the sky to sunset, and add a vintage film grain." Z-Image rounds out the family with general-purpose text-to-image generation from Alibaba's pipeline. Together they form a complete generate-then-refine workflow. The family's standout feature is bilingual prompt excellence — Chinese-language editing instructions often produce more precise results than English, making it particularly valuable for Chinese-speaking creators. On Martini, the typical workflow is: generate a base image with a stronger model (FLUX, Imagen 4), then use Qwen Edit or Edit Plus as a refinement node to make targeted modifications without regenerating from scratch.
| Variant | Description |
|---|---|
| Qwen Image Edit | Instruction-based image editing with natural language commands. |
| Qwen Image Edit Plus | Enhanced editing model for complex multi-step instructions. |
| Z-Image | General-purpose text-to-image generation from Alibaba. |
Connect Qwen Image with other AI models on Martini's infinite canvas. No GPU required — start free.
Get Started FreeQwen Image excels at instruction-based image editing — modifying existing images using natural language commands. Use Qwen Edit for standard changes (background swap, object removal) and Edit Plus for complex multi-step instructions.
Yes. Built by Alibaba, Qwen has particularly strong Chinese-language understanding. Chinese editing instructions often produce more precise results, though English works well for most standard editing tasks.
Z-Image, part of the Qwen family, handles text-to-image generation. However, the family's strength is editing rather than generation. For best results, generate a base image with FLUX or Imagen, then refine with Qwen Edit.
Midjourney
Midjourney v7 delivers artistic, highly detailed images with an iconic aesthetic style. It excels at creative illustration, concept art, and photorealistic renders with strong prompt adherence and built-in Niji mode for anime styles.
View detailsBlack Forest Labs
FLUX by Black Forest Labs is a fast, high-quality image generation family known for photorealistic output and excellent prompt adherence. Variants span free-tier dev models to ultra-resolution Pro outputs.
View detailsBlack Forest Labs
FLUX Kontext is a context-aware image generation and editing model that uses reference images to maintain character and style consistency across outputs. Available in Pro and Max tiers.
View details