GPT Image 1.5 vs DALL-E 3: What Changed After the Deprecation
DALL-E used to be OpenAI's image generation tool. Then, without a lot of ceremony, it got replaced inside ChatGPT by something called GPT Image — a model that now sits at #1 on LM Arena with an ELO of 1264. We ran both models through the same set of prompts to figure out what you actually gain, what you lose, and whether the DALL-E 3 API is still worth using.
TL;DR — Key Takeaways:
- • GPT Image 1.5 replaced DALL-E inside ChatGPT — no separate tool to invoke. It is native to the conversation flow and understands prior context.
- • #1 on LM Arena (ELO 1264) — outranks Midjourney, Flux, and Stable Diffusion in blind community voting across roughly 50K comparisons.
- • Text rendering is the biggest leap — GPT Image consistently renders readable text in images, something DALL-E 3 struggled with badly.
- • DALL-E 3 API still works and is cheaper — $0.04–$0.08 per image vs GPT Image's higher API cost. Fine for batch workflows that don't need conversational refinement.
- • Neither model is perfect — GPT Image has rate limits and occasional over-smoothing; DALL-E 3 lacks editing and conversational awareness.
What Happened to DALL-E?
For about two years, DALL-E was how ChatGPT generated images. You would type something like “create a watercolor painting of a cat reading a newspaper,” ChatGPT would call the DALL-E 3 model behind the scenes, and you would get your image. It worked, but it always felt bolted on — there was a visible handoff where ChatGPT would switch from text mode to image mode, and the model could not see or reference the image it just created in the follow-up conversation.
In late 2025, OpenAI started rolling out what they call “native image generation” in ChatGPT, powered by the GPT Image model (internally versioned as gpt-image-1, with the 1.5 update arriving in early 2026). The key difference: image generation is no longer a separate tool that ChatGPT calls. It is built directly into the model's output capabilities, the same way text generation works.
This matters more than it sounds. Because GPT Image is native to the conversation, it understands what you discussed three messages ago, it can reference elements from an image you uploaded, and it can iterate on its own output without losing context. DALL-E 3 inside ChatGPT could not do any of that — each image generation was essentially a fresh, isolated call.
DALL-E 3 was quietly removed from the ChatGPT interface. No sunset announcement page, no deprecation timeline — it just stopped being the model that ChatGPT uses. For API users, DALL-E 3 remains available and functional. But for the ~300 million ChatGPT users, GPT Image is the only option now.
How We Tested
Testing Methodology
- • Prompt set: 30 identical prompts across 6 categories — text rendering, photorealism, illustration, abstract art, product mockups, and multi-element compositions.
- • GPT Image testing: ChatGPT Plus account using the default GPT-4o model with native image generation. All prompts sent as plain conversation messages.
- • DALL-E 3 testing: OpenAI API with the dall-e-3 model endpoint. Standard quality, 1024x1024 resolution. Same exact prompt text.
- • Evaluation: Each output pair rated on accuracy (did it match the prompt?), visual quality, text legibility (where applicable), and coherence of complex multi-element scenes.
- • Timeline: Testing conducted over two weeks in March 2026. GPT Image version was 1.5 (confirmed via API model identifier).
- • Limitation: We tested ChatGPT Plus (not Free or Team). Free tier users may see different quality or higher compression.
One thing worth noting: GPT Image inside ChatGPT sometimes rewrites your prompt before generating. It adds detail, adjusts composition language, and applies safety filters. DALL-E 3 via the API also does prompt rewriting by default, but you can disable it with the style: "natural" parameter. This means direct prompt-level comparison is imperfect — both models are interpreting your words through their own lens.
Head-to-Head Comparison
| Feature | GPT Image 1.5 | DALL-E 3 |
|---|---|---|
| LM Arena Ranking | #1 (ELO 1264) | Not ranked (retired from arena) |
| ChatGPT Integration | Native (built into model) | Removed from ChatGPT |
| Text in Images | Reliable, legible at small sizes | Frequent misspellings and artifacts |
| Photorealism | Strong, natural lighting and skin tones | Good but slightly “AI look” |
| Image Editing | Conversational editing, upload + modify | API inpainting with manual masks only |
| Context Awareness | Full conversation history | None (isolated per-call) |
| API Availability | gpt-image-1 endpoint | dall-e-3 endpoint (still active) |
| API Cost (1024x1024) | ~$0.04–$0.17 (quality dependent) | ~$0.04–$0.08 |
| Max Resolution | Up to 2048x2048 | 1024x1024 or 1024x1792 |
| Standalone Use | Requires ChatGPT or API | API-only (works independently) |
The comparison table tells a clear story: GPT Image 1.5 is the more capable model across almost every dimension. But “more capable” does not always mean “the right choice.” DALL-E 3 still has legitimate use cases, which we will get to in the API section.
Text Rendering: The Biggest Difference
If there is one area where GPT Image 1.5 clearly pulls ahead, it is rendering text inside images. This was DALL-E 3's most visible weakness — ask it to put “Happy Birthday Sarah” on a cake and you might get “Hpapy Brithday Sahra” or something equally garbled. It was a running joke in the AI community, and a genuine blocker for anyone trying to create social media graphics, product mockups, or signage.
GPT Image 1.5 handles text with surprising reliability. In our testing, 26 out of 30 text-containing prompts produced fully legible, correctly spelled text on the first attempt. The remaining 4 had minor issues — slightly compressed letter spacing in one case, a missing period in another — but nothing like the character-level corruption that DALL-E 3 regularly produced.
Text Rendering Results (30 Prompts)
GPT Image 1.5
- • Fully correct: 26/30 (87%)
- • Minor issues: 4/30 (13%)
- • Unreadable: 0/30 (0%)
- • Handles multi-line text well
- • Small font sizes still legible
DALL-E 3
- • Fully correct: 11/30 (37%)
- • Minor issues: 9/30 (30%)
- • Unreadable: 10/30 (33%)
- • Multi-line text frequently garbled
- • Small font sizes unreliable
This matters practically. If you are generating social media posts, presentation slides, infographics, or marketing materials that need readable text, DALL-E 3 required you to add text manually in Canva or Figma after generation. GPT Image 1.5 often gets it right in a single pass, saving a significant editing step.
Style & Creative Control
This is one area where the comparison gets more nuanced. DALL-E 3 offered a style parameter (“vivid” vs “natural”) and a straightforward prompt-in, image-out workflow. What you typed was roughly what you got. The model did not add much interpretation of its own.
GPT Image 1.5 is more opinionated. Because it is integrated into GPT-4o, it “understands” your prompt at a deeper level and makes creative decisions about composition, lighting, and mood. This is a double-edged sword. When it works, you get images that feel more thoughtfully composed — better framing, more natural color grading, coherent visual storytelling. When it does not work, the model adds elements you did not ask for, or interprets your prompt in an unexpected direction.
For illustration and concept art specifically, GPT Image 1.5 tends toward a polished, commercial look. If you want gritty, rough, or deliberately imperfect aesthetics, you need to be very explicit in your prompting. DALL-E 3 was more neutral in this regard — it would generate a rough sketch if you asked for one without trying to “improve” it.
The conversational iteration capability partly compensates for this. With GPT Image, you can say “make it grittier, less polished, more hand-drawn” and get a refined version within the same chat. With DALL-E 3 via the API, each attempt was a completely new generation with no memory of what came before.
Pricing Breakdown
| Access Method | Price | What You Get |
|---|---|---|
| ChatGPT Free | $0/month | GPT Image with ~2–3 images/day limit |
| ChatGPT Plus | $20/month | GPT Image with higher limits, priority access |
| ChatGPT Pro | $200/month | Unlimited GPT Image (practical ceiling, not truly infinite) |
| GPT Image API | ~$0.04–$0.17/image | Programmatic access, variable by quality/size |
| DALL-E 3 API | ~$0.04–$0.08/image | Programmatic access, standard/HD quality |
For casual users, the pricing math is simple: ChatGPT Plus at $20/month includes GPT Image along with everything else (GPT-4o, voice, file analysis, web browsing). If you are already paying for ChatGPT Plus, you are getting the objectively better image model at no extra cost. DALL-E 3 is not available to you inside ChatGPT anymore regardless.
For developers and businesses running image generation at scale, the calculation shifts. DALL-E 3 API at $0.04 per standard image is roughly half the cost of GPT Image API at high-quality settings. If you are generating thousands of product thumbnails, placeholder images, or batch content and do not need conversational refinement, DALL-E 3 remains the more cost-effective option. The quality gap exists, but for many automated pipelines, it does not justify doubling the per-image cost.
API Access & Developer Use
Both models are available through the OpenAI API, but they work quite differently in practice.
API Comparison
GPT Image API (gpt-image-1)
- • Supports text and image inputs (multimodal)
- • Image editing via natural language
- • Higher quality ceiling
- • Up to 2048x2048 resolution
- • Slower generation (~8–15 seconds)
- • More expensive at high quality
DALL-E 3 API (dall-e-3)
- • Text prompt input only
- • Inpainting with explicit mask images
- • Consistent, predictable output style
- • 1024x1024 or 1024x1792
- • Faster generation (~4–8 seconds)
- • More cost-effective for batch use
Here is the practical scenario where DALL-E 3 still makes sense: you have a pipeline that generates 500 product listing images per day, with templated prompts. You do not need to iterate on each one. You need consistent, predictable output at the lowest possible cost per image. DALL-E 3 handles this well and has been stable in production for over a year.
GPT Image API makes more sense when quality matters per-image: hero graphics for landing pages, social media content that needs embedded text, or any workflow where you want to send a reference image and say “make something like this but with different colors.” The multimodal input capability is genuinely useful for these cases.
Limitations of Each Model
Neither model is without problems. Being honest about the downsides is more useful than pretending either one is a silver bullet.
GPT Image 1.5 Downsides
- • Rate limits are real. Even on ChatGPT Plus, you will hit generation caps during heavy use. The limits are not published precisely and seem to fluctuate.
- • Over-smoothing tendency. Photorealistic outputs sometimes look too perfect — skin that lacks pores, surfaces that are unnaturally clean. This “AI sheen” is recognizable.
- • Prompt rewriting is opaque. The model rewrites your prompt internally before generating, and you cannot see or control the rewritten version in ChatGPT. This makes reproducibility harder.
- • Safety filters are aggressive. Legitimate use cases (medical illustration, historical scenes, artistic nudity) get blocked more often than with DALL-E 3.
- • No seed control in ChatGPT. You cannot reproduce an exact image. The API offers a seed parameter, but ChatGPT does not expose it.
DALL-E 3 Downsides
- • Removed from ChatGPT. The most accessible way to use AI image generation no longer offers DALL-E 3. API-only access limits who can practically use it.
- • Text rendering remains poor. This was never fixed. If your use case requires text in images, DALL-E 3 is not the right tool.
- • No conversational iteration. Each API call is independent. You cannot say “make it more blue” — you have to rewrite and resend the entire prompt.
- • Lower resolution ceiling. Maxes out at 1024x1792 vs GPT Image's 2048x2048. For print or high-DPI displays, this is a constraint.
- • Uncertain future. OpenAI has not committed to long-term DALL-E 3 API maintenance. It could be deprecated with limited notice, as the ChatGPT integration was.
Frequently Asked Questions
Has DALL-E 3 been discontinued?
DALL-E 3 has been removed from ChatGPT and replaced by GPT Image. The DALL-E 3 API endpoint remains active for developers. OpenAI has not published a formal deprecation timeline for the API, but the direction is clear — GPT Image is the future of OpenAI's image generation stack.
What is the ELO rating for GPT Image 1.5?
GPT Image 1.5 holds an ELO of 1264 on LM Arena, placing it at #1 among all tested image generation models. This score comes from blind head-to-head comparisons where community voters choose between two unlabeled outputs. The gap between #1 (GPT Image) and #2 is significant enough that it is not a statistical tie.
Can I use GPT Image without ChatGPT Plus?
Yes. Free ChatGPT users have access to GPT Image with daily generation limits (roughly 2–3 images). For heavier use, ChatGPT Plus ($20/month) raises the cap considerably. Developers can also access the model directly through the GPT Image API (gpt-image-1) on a pay-per-image basis without any ChatGPT subscription.
Is GPT Image better than Midjourney?
On aggregate LM Arena rankings, GPT Image 1.5 scores higher. In practice, it depends on the task. GPT Image excels at instruction following, text rendering, and iterative refinement through conversation. Midjourney remains stronger for artistic stylization and consistently produces a more distinctive aesthetic. For product and marketing work, GPT Image is often the better choice. For fine art and creative exploration, Midjourney holds its ground.
How much does GPT Image cost through the API?
The GPT Image API (gpt-image-1) costs approximately $0.04 per image at standard quality (1024x1024) and up to ~$0.17 at maximum quality and resolution. DALL-E 3 API ranges from $0.04 to $0.08 per image. For low-volume use, the difference is negligible. At scale (thousands of images per day), DALL-E 3 can be meaningfully cheaper.
Can GPT Image edit existing photos?
Yes. You can upload an image to ChatGPT and ask GPT Image to modify it — change backgrounds, add or remove elements, adjust colors, overlay text. This works through natural language without needing to create mask images. DALL-E 3 also supports editing, but only through the API with explicit mask specification, which is more technical and less accessible.
Which Should You Use?
If you are a ChatGPT user, you do not have a choice — GPT Image is what you get, and frankly, it is the better model. The native integration, conversational refinement, and dramatically improved text rendering make it a genuine upgrade over what DALL-E 3 offered inside ChatGPT.
If you are a developer working with the API, the answer depends on your use case:
Quick Decision Guide
- • Need text in images: GPT Image. Not close.
- • Batch generation at scale: DALL-E 3 API. Cheaper, faster, predictable.
- • Interactive/iterative workflow: GPT Image via ChatGPT or API with conversation history.
- • Image editing from reference: GPT Image. Multimodal input is a major advantage.
- • Tight budget, basic needs: DALL-E 3 API or ChatGPT Free (with limits).
- • Future-proofing: GPT Image. DALL-E 3's API future is uncertain.
The broader trend is clear: OpenAI is moving image generation from a standalone tool into a native capability of their language models. GPT Image 1.5 is the result of that strategy, and it works. DALL-E as a brand is likely headed for the same fate as Codex — absorbed into the main product line, with the API endpoint maintained for backward compatibility until it quietly disappears.
For most people reading this, GPT Image 1.5 through ChatGPT Plus at $20/month is the practical answer. You get the #1-ranked image model bundled with everything else ChatGPT offers. Unless you have a specific reason to use the DALL-E 3 API, the transition has already happened — you just might not have noticed.
Source: This comparison is based on hands-on testing of GPT Image 1.5 (via ChatGPT Plus) and DALL-E 3 (via OpenAI API) using 30 identical prompts across 6 categories. LM Arena rankings referenced from lmarena.ai as of March 2026. API pricing from OpenAI's published rate cards. Testing conducted over two weeks in March 2026.
Related reading on OpenAI Tools Hub: