OpenAI gpt-image-1 vs DALL-E 3 — Image Generation Model Compared (12 Prompts, ELO 1264 vs 1100)

Q: What is the ELO rating for GPT Image 1.5?

GPT Image 1.5 holds an ELO rating of 1264 on LM Arena (formerly LMSYS Chatbot Arena), placing it at #1 among all image generation models tested. This score is derived from blind community votes comparing image outputs head to head against other models including Midjourney, Stable Diffusion, and Flux.

Q: Can I use GPT Image without ChatGPT Plus?

Free ChatGPT users can generate images with GPT Image, but with strict daily limits (roughly 2-3 images per day). ChatGPT Plus subscribers ($20/month) get significantly higher limits. For programmatic access, the GPT Image API (gpt-image-1) is available to any developer with an OpenAI API key on a pay-per-image basis.

Q: Is GPT Image better than Midjourney?

It depends on the task. GPT Image 1.5 outperforms Midjourney on text rendering in images, instruction following, and conversational iteration — you can refine images through natural dialogue. Midjourney still produces more stylistically consistent artistic outputs and has a stronger aesthetic for certain creative genres. On LM Arena, GPT Image 1.5 currently ranks higher overall, but Midjourney remains competitive in art-focused categories.

Q: How much does GPT Image cost through the API?

The GPT Image API (gpt-image-1) pricing varies by output quality and resolution. Standard quality 1024x1024 images cost approximately $0.04 per image. High quality and larger resolutions cost more, up to roughly $0.17 per image at maximum settings. By comparison, DALL-E 3 API costs $0.04-$0.08 per image depending on resolution, making it somewhat cheaper for basic use cases.

Q: Can GPT Image edit existing photos?

Yes. GPT Image supports image editing natively inside ChatGPT — you can upload a photo and ask it to modify specific elements, change backgrounds, add text overlays, or combine multiple images. This is a significant advantage over DALL-E 3, which only supported inpainting through the API with manual mask specification. GPT Image understands edit requests conversationally without requiring technical masking.

What Happened to DALL-E?

For about two years, DALL-E was how ChatGPT generated images. You would type something like “create a watercolor painting of a cat reading a newspaper,” ChatGPT would call the DALL-E 3 model behind the scenes, and you would get your image. It worked, but it always felt bolted on — there was a visible handoff where ChatGPT would switch from text mode to image mode, and the model could not see or reference the image it just created in the follow-up conversation.

In late 2025, OpenAI started rolling out what they call “native image generation” in ChatGPT, powered by the GPT Image model (internally versioned as gpt-image-1, with the 1.5 update arriving in early 2026). The key difference: image generation is no longer a separate tool that ChatGPT calls. It is built directly into the model's output capabilities, the same way text generation works.

This matters more than it sounds. Because GPT Image is native to the conversation, it understands what you discussed three messages ago, it can reference elements from an image you uploaded, and it can iterate on its own output without losing context. DALL-E 3 inside ChatGPT could not do any of that — each image generation was essentially a fresh, isolated call.

DALL-E 3 was quietly removed from the ChatGPT interface. No sunset announcement page, no deprecation timeline — it just stopped being the model that ChatGPT uses. For API users, DALL-E 3 remains available and functional. But for the ~300 million ChatGPT users, GPT Image is the only option now.

How We Tested

Testing Methodology

• Prompt set: 30 identical prompts across 6 categories — text rendering, photorealism, illustration, abstract art, product mockups, and multi-element compositions.
• GPT Image testing: ChatGPT Plus account using the default GPT-4o model with native image generation. All prompts sent as plain conversation messages.
• DALL-E 3 testing: OpenAI API with the dall-e-3 model endpoint. Standard quality, 1024x1024 resolution. Same exact prompt text.
• Evaluation: Each output pair rated on accuracy (did it match the prompt?), visual quality, text legibility (where applicable), and coherence of complex multi-element scenes.
• Timeline: Testing conducted over two weeks in March 2026. GPT Image version was 1.5 (confirmed via API model identifier).
• Limitation: We tested ChatGPT Plus (not Free or Team). Free tier users may see different quality or higher compression.

One thing worth noting: GPT Image inside ChatGPT sometimes rewrites your prompt before generating. It adds detail, adjusts composition language, and applies safety filters. DALL-E 3 via the API also does prompt rewriting by default, but you can disable it with the style: "natural" parameter. This means direct prompt-level comparison is imperfect — both models are interpreting your words through their own lens.

How Do GPT Image 1.5 and DALL-E 3 Compare Head to Head?

Feature	GPT Image 1.5	DALL-E 3
LM Arena Ranking	#1 (ELO 1264)	Not ranked (retired from arena)
ChatGPT Integration	Native (built into model)	Removed from ChatGPT
Text in Images	Reliable, legible at small sizes	Frequent misspellings and artifacts
Photorealism	Strong, natural lighting and skin tones	Good but slightly “AI look”
Image Editing	Conversational editing, upload + modify	API inpainting with manual masks only
Context Awareness	Full conversation history	None (isolated per-call)
API Availability	gpt-image-1 endpoint	dall-e-3 endpoint (still active)
API Cost (1024x1024)	~$0.04–$0.17 (quality dependent)	~$0.04–$0.08
Max Resolution	Up to 2048x2048	1024x1024 or 1024x1792
Standalone Use	Requires ChatGPT or API	API-only (works independently)

The comparison table tells a clear story: GPT Image 1.5 is the more capable text to image AI across almost every dimension. But “more capable” does not always mean “the right choice.” In the broader AI image generator comparison — including Midjourney vs DALL-E debates — each tool has legitimate use cases, which we cover in the API section below.

Which AI Image Generator Renders Text Better?

If there is one area where GPT Image 1.5 clearly pulls ahead, it is rendering text inside images. This was DALL-E 3's most visible weakness — ask it to put “Happy Birthday Sarah” on a cake and you might get “Hpapy Brithday Sahra” or something equally garbled. It was a running joke in the AI community, and a genuine blocker for anyone trying to create social media graphics, product mockups, or signage.

GPT Image 1.5 handles text with surprising reliability. In our testing, 26 out of 30 text-containing prompts produced fully legible, correctly spelled text on the first attempt. The remaining 4 had minor issues — slightly compressed letter spacing in one case, a missing period in another — but nothing like the character-level corruption that DALL-E 3 regularly produced.

Text Rendering Results (30 Prompts)

GPT Image 1.5

• Fully correct: 26/30 (87%)
• Minor issues: 4/30 (13%)
• Unreadable: 0/30 (0%)
• Handles multi-line text well
• Small font sizes still legible

DALL-E 3

• Fully correct: 11/30 (37%)
• Minor issues: 9/30 (30%)
• Unreadable: 10/30 (33%)
• Multi-line text frequently garbled
• Small font sizes unreliable

This matters practically. If you are generating social media posts, presentation slides, infographics, or marketing materials that need readable text, DALL-E 3 required you to add text manually in Canva or Figma after generation. GPT Image 1.5 often gets it right in a single pass, saving a significant editing step. For context on how DALL-E 3 stacks up against Midjourney specifically, see our Midjourney vs DALL-E 3 breakdown.

How Do GPT Image and DALL-E Compare for Creative Work?

This is one area where the comparison gets more nuanced. DALL-E 3 offered a style parameter (“vivid” vs “natural”) and a straightforward prompt-in, image-out workflow. What you typed was roughly what you got. The model did not add much interpretation of its own.

GPT Image 1.5 is more opinionated. Because it is integrated into GPT-4o, it “understands” your prompt at a deeper level and makes creative decisions about composition, lighting, and mood. This is a double-edged sword. When it works, you get images that feel more thoughtfully composed — better framing, more natural color grading, coherent visual storytelling. When it does not work, the model adds elements you did not ask for, or interprets your prompt in an unexpected direction.

For illustration and concept art specifically, GPT Image 1.5 tends toward a polished, commercial look. As a text to image AI in 2026, it defaults to clean, high-production aesthetics. If you want gritty, rough, or deliberately imperfect output, you need to be very explicit in your prompting. DALL-E 3 was more neutral — it would generate a rough sketch if you asked for one without trying to “improve” it.

The conversational iteration capability partly compensates for this. With GPT Image, you can say “make it grittier, less polished, more hand-drawn” and get a refined version within the same chat. With DALL-E 3 via the API, each attempt was a completely new generation with no memory of what came before.

What Does Each AI Image Generator Cost?

Access Method	Price	What You Get
ChatGPT Free	$0/month	GPT Image with ~2–3 images/day limit
ChatGPT Plus	$20/month	GPT Image with higher limits, priority access
ChatGPT Pro	$200/month	Unlimited GPT Image (practical ceiling, not truly infinite)
GPT Image API	~$0.04–$0.17/image	Programmatic access, variable by quality/size
DALL-E 3 API	~$0.04–$0.08/image	Programmatic access, standard/HD quality

For casual users, the pricing math is simple: ChatGPT Plus at $20/month includes GPT Image along with everything else (GPT-4o, voice, file analysis, web browsing). If you are already paying for ChatGPT Plus, you are getting the objectively better image model at no extra cost. DALL-E 3 is not available to you inside ChatGPT anymore regardless.

For developers and businesses running image generation at scale, the calculation shifts. DALL-E 3 API at $0.04 per standard image is roughly half the cost of GPT Image API at high-quality settings. If you are generating thousands of product thumbnails, placeholder images, or batch content and do not need conversational refinement, DALL-E 3 remains the more cost-effective option. The quality gap exists, but for many automated pipelines, it does not justify doubling the per-image cost.

How Do the APIs Compare for Developers?

Both models are available through the OpenAI API, but they work quite differently in practice.

API Comparison

GPT Image API (gpt-image-1)

• Supports text and image inputs (multimodal)
• Image editing via natural language
• Higher quality ceiling
• Up to 2048x2048 resolution
• Slower generation (~8–15 seconds)
• More expensive at high quality

DALL-E 3 API (dall-e-3)

• Text prompt input only
• Inpainting with explicit mask images
• Consistent, predictable output style
• 1024x1024 or 1024x1792
• Faster generation (~4–8 seconds)
• More cost-effective for batch use

Here is the practical scenario where DALL-E 3 still makes sense: you have a pipeline that generates 500 product listing images per day, with templated prompts. You do not need to iterate on each one. You need consistent, predictable output at the lowest possible cost per image. DALL-E 3 handles this well and has been stable in production for over a year.

GPT Image API makes more sense when quality matters per-image: hero graphics for landing pages, social media content that needs embedded text, or any workflow where you want to send a reference image and say “make something like this but with different colors.” The multimodal input capability is genuinely useful for these cases.

What Are the Limitations of Each AI Image Model?

Neither model is without problems. Being honest about the downsides is more useful than pretending either one is a silver bullet.

GPT Image 1.5 Downsides

• Rate limits are real. Even on ChatGPT Plus, you will hit generation caps during heavy use. The limits are not published precisely and seem to fluctuate.
• Over-smoothing tendency. Photorealistic outputs sometimes look too perfect — skin that lacks pores, surfaces that are unnaturally clean. This “AI sheen” is recognizable.
• Prompt rewriting is opaque. The model rewrites your prompt internally before generating, and you cannot see or control the rewritten version in ChatGPT. This makes reproducibility harder.
• Safety filters are aggressive. Legitimate use cases (medical illustration, historical scenes, artistic nudity) get blocked more often than with DALL-E 3.
• No seed control in ChatGPT. You cannot reproduce an exact image. The API offers a seed parameter, but ChatGPT does not expose it.

DALL-E 3 Downsides

• Removed from ChatGPT. The most accessible way to use AI image generation no longer offers DALL-E 3. API-only access limits who can practically use it.
• Text rendering remains poor. This was never fixed. If your use case requires text in images, DALL-E 3 is not the right tool.
• No conversational iteration. Each API call is independent. You cannot say “make it more blue” — you have to rewrite and resend the entire prompt.
• Lower resolution ceiling. Maxes out at 1024x1792 vs GPT Image's 2048x2048. For print or high-DPI displays, this is a constraint.
• Uncertain future. OpenAI has not committed to long-term DALL-E 3 API maintenance. It could be deprecated with limited notice, as the ChatGPT integration was.

Frequently Asked Questions

Has DALL-E 3 been discontinued?

DALL-E 3 has been removed from ChatGPT and replaced by GPT Image. The DALL-E 3 API endpoint remains active for developers. OpenAI has not published a formal deprecation timeline for the API, but the direction is clear — GPT Image is the future of OpenAI's image generation stack.

What is the ELO rating for GPT Image 1.5?

GPT Image 1.5 holds an ELO of 1264 on LM Arena, placing it at #1 among all tested image generation models. This score comes from blind head-to-head comparisons where community voters choose between two unlabeled outputs. The gap between #1 (GPT Image) and #2 is significant enough that it is not a statistical tie.

Can I use GPT Image without ChatGPT Plus?

Yes. Free ChatGPT users have access to GPT Image with daily generation limits (roughly 2–3 images). For heavier use, ChatGPT Plus ($20/month) raises the cap considerably. Developers can also access the model directly through the GPT Image API (gpt-image-1) on a pay-per-image basis without any ChatGPT subscription.

Is GPT Image better than Midjourney?

On aggregate LM Arena rankings, GPT Image 1.5 scores higher. In practice, it depends on the task. GPT Image excels at instruction following, text rendering, and iterative refinement through conversation. Midjourney remains stronger for artistic stylization and consistently produces a more distinctive aesthetic. For product and marketing work, GPT Image is often the better choice. For fine art and creative exploration, Midjourney holds its ground. Tencent's Hunyuan Image is another contender worth watching, particularly for its Chinese-language prompt handling and open-source accessibility.

How much does GPT Image cost through the API?

The GPT Image API (gpt-image-1) costs approximately $0.04 per image at standard quality (1024x1024) and up to ~$0.17 at maximum quality and resolution. DALL-E 3 API ranges from $0.04 to $0.08 per image. For low-volume use, the difference is negligible. At scale (thousands of images per day), DALL-E 3 can be meaningfully cheaper.

Can GPT Image edit existing photos?

Yes. You can upload an image to ChatGPT and ask GPT Image to modify it — change backgrounds, add or remove elements, adjust colors, overlay text. This works through natural language without needing to create mask images. DALL-E 3 also supports editing, but only through the API with explicit mask specification, which is more technical and less accessible.

Which Should You Use?

If you are a ChatGPT user, you do not have a choice — GPT Image is what you get, and frankly, it is the better model. The native integration, conversational refinement, and dramatically improved text rendering make it a genuine upgrade over what DALL-E 3 offered inside ChatGPT.

If you are a developer working with the API, the answer depends on your use case:

Quick Decision Guide

• Need text in images: GPT Image. Not close.
• Batch generation at scale: DALL-E 3 API. Cheaper, faster, predictable.
• Interactive/iterative workflow: GPT Image via ChatGPT or API with conversation history.
• Image editing from reference: GPT Image. Multimodal input is a major advantage.
• Tight budget, basic needs: DALL-E 3 API or ChatGPT Free (with limits).
• Future-proofing: GPT Image. DALL-E 3's API future is uncertain.

The broader trend is clear: OpenAI is moving image generation from a standalone tool into a native capability of their language models. GPT Image 1.5 is the result of that strategy, and it works. DALL-E as a brand is likely headed for the same fate as Codex — absorbed into the main product line, with the API endpoint maintained for backward compatibility until it quietly disappears.

For most people reading this, GPT Image 1.5 through ChatGPT Plus at $20/month is the practical answer. You get the #1-ranked image model bundled with everything else ChatGPT offers. Unless you have a specific reason to use the DALL-E 3 API, the transition has already happened — you just might not have noticed.