Skip to main content

GPT Image 2

OpenAI's flagship image model — prompt-faithful generation and reference-based edits, up to 4K with three quality tiers.

Capabilities

FeatureSupport
Text-to-ImageYes
Image-to-Image (editing)Yes
Max Resolution4K (3840 x 3840)
Reference ImagesYes
Aspect Ratios1:1, 1:3, 2:3, 3:1, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
Quality Tierslow, medium, high
Size Tiers1K, 2K, 4K
Negative PromptNo
Inpainting / MaskYes

Prompt-faithfulness

GPT Image 2's strength is following the prompt literally — typography, layout, and explicit visual elements come through more reliably than with diffusion-based image models. Best for cases where the prompt names the elements directly ("a poster with the title 'Summer Festival' centered, in art-deco style on a teal background").

Quality vs. cost

Each size (1K / 2K / 4K) can be rendered at one of three quality tiers (low / medium / high). The credit cost scales with both axes. For iteration, 1K + low is the cheapest combination; switch to higher tiers once you've locked the composition.

Prompting Tips

  • Spell out text exactly as you want it rendered. GPT Image 2 reads quoted strings literally.
  • Describe layout, not just subject. "A 3-column infographic with section headers" lands more reliably than "an infographic".
  • Use reference images for style transfer. Pass the look you want as a reference rather than describing it in prose.
  • Use @-mentions when multiple references compete. Type @ to insert a reference inline so the model knows which image plays which role. See Referencing Images in Prompts.

Limitations

  • No negative prompt support
  • Rate-limited to 3 requests per minute on the upstream API