3 Models Available
Edit photos using natural language instructions. Upload an image, describe changes, and let AI handle the rest. Choose a model below for editing-specific workflows.
Black Forest Labs
FLUX Kontext is the most precise AI photo editor available — it understands exactly which elements to change and which to preserve, making it the best choice for targeted, surgical edits. Where GPT Image 1 excels at abstract intent ("make this look more professional"), FLUX Kontext excels at literal, specific transformations ("change the red shirt to a blue blazer," "swap the background to a beach at sunset"). Two quality tiers are available: Pro (6 credits/image) for standard edits and Max (12 credits/image) for edits requiring fine detail preservation. Both support 1-4 output images and 9 aspect ratios. For edits combining multiple source photos, the FLUX Kontext Multi variant accepts multiple reference images simultaneously.
OpenAI
GPT Image 1 is the most intelligent image editor on Martini — it understands abstract, intent-based instructions that other models cannot interpret. Where FLUX Kontext and Qwen require precise, literal edit commands ("remove the red car"), GPT Image 1 understands conceptual requests like "make this look more professional," "add a festive holiday feel," or "make this photo feel warmer and more inviting." This language understanding comes from GPT's foundation as a language model, giving it an editing capability that feels closer to working with a human designer than an AI tool.
Alibaba
Qwen Image Edit is the cheapest and fastest image editor on Martini — just 4 credits per image with near-instant results. It excels at object-level edits: adding, removing, replacing, or recoloring specific elements with clean, natural blending. Where GPT Image 1 understands abstract concepts ("make it more professional"), Qwen works best with concrete, literal instructions ("replace the red car with a blue bicycle"). This specificity paired with low cost makes it the go-to choice for batch editing workflows where you need to process 10-50 images quickly.