I ASKED FOUR AIs TO ROAST EACH OTHER. THE RESULTS WERE VERY REVEALING.

AI Tools for Graphic Designers and Creatives

Mar 10

ChatGPT sent a strategy deck. Gemini forgot an entire AI model existed. Grok had… other priorities. And Claude is currently in a standoff with the Pentagon. Here's what happened when I gave all four the same brief, and what it tells you about which one you should actually be using as a designer right now.

Here's the thing about being an AI educator in 2026: everyone has opinions about which model is best, and almost nobody has actually run a proper experiment. So I did.

I gave ChatGPT, Claude, Gemini, and Grok the same brief: write a long-form blog post roasting the AI landscape for designers and creatives. Equal opportunity. No favourites. Roast yourselves, roast each other, be honest about the state of play.

What came back told me more about these tools—and the companies behind them—than any benchmark score ever could. And it landed right in the middle of the most chaotic week in AI so far this year, which made the whole thing even more interesting.

Let's get into it.

FOUR AI LLM ROAST — Illustration created in MidJourney by House of gAi

What each AI actually did when given the same brief

Before we get to the industry drama — and there is significant drama — you deserve to know how each model approached the task. Because the way they responded to "write a creative hot-take blog post" is exactly the kind of thing that should inform which one you use for what.

    ChatGPT / OpenAI
    Sent back a 3,000-word strategy document. Not a blog post — a full content brief, complete with conversion goal questions, SEO keyword targets, a meta description, A/B testing baselines, and a comparison table. The actual blog post was buried on page five, after the executive summary. It was genuinely good when it finally showed up — smart intern energy, clearly researched the House of gAi brand. But it answered a different question than the one I asked. ChatGPT has become an agency that occasionally writes blog posts.
  
    Google Gemini
    Produced a warm, readable 800-word piece. Decent metaphors — “ex-bestie with selective amnesia” for ChatGPT, “over-eager stalker” for itself. Points for self-awareness. But here's the thing: it only covered three models. It called them the “Big Three” and just... didn't include Grok. Missed an entire AI. Also got a key fact wrong — said Anthropic “rejected” its $200M Pentagon contract, when actually the contract exists and the dispute is about usage terms. And the academic footnote numbers rendered inside the body copy. For a brand building authority on AI, publishing inaccurate, incomplete content is a credibility problem you really don't want.
  
    Grok / xAI
    I'll be honest — I didn't feel great about running this one. Grok spent the first two months of 2026 at the centre of a global scandal involving non-consensual deepfake imagery, multiple international bans, investigations from the EU, UK, and 35 US state attorneys general. Using it to produce content about AI ethics felt a bit like asking the person who set the building on fire to review the fire safety plan. So: noted for its real-time X data access and its genuine reasoning capability on a technical level, but currently in the bin from a brand-safety standpoint.
  
    Claude / Anthropic
    Delivered the brief. All four models covered. Correct word count. Three platform-formatted TikTok scripts with timing notes. Breaking news angle — the Pentagon deadline was that week — with a flag that the date reference would need updating after Friday. Most accurate on the facts. Consistent editorial voice throughout. Not the most viral-sounding headlines, but the only output I could hand to an editor without restructuring it first. Also, for the record: it's the AI writing these words right now, which is either a conflict of interest or the most on-brand thing I've ever published. You decide.

Why are designers and creatives leaving ChatGPT for Claude right now?

While I was running this experiment, the AI industry was having what can only be described as a moment. And it's directly relevant to which tools you should be paying for.

The short version: Anthropic lost a $200 million US military contract on a Friday in late February because CEO Dario Amodei refused to remove restrictions on using Claude for autonomous weapons and mass domestic surveillance. The Pentagon gave him until 5:01pm. He said no. Trump called Anthropic a "Radical Left AI company." Hours later, OpenAI announced it had signed its own Pentagon deal.

The internet noticed. And made a decision.

    #1
    Claude on US App Store — overtook ChatGPT for the first time
  
    11M
    Claude daily active users by early March 2026
  
    +180%
    Growth in Claude's daily active users since January
  
    2×
    Paid subscribers on Claude so far this year

Now, I want to be careful here, because this is where a lot of hot-take content goes wrong. The migration is real and the numbers are verifiable. But the reasons are more layered than "Claude is good, ChatGPT is evil." This wasn't one event. It was the accumulation of several things happening at once — the Pentagon deal, ChatGPT testing ads in responses since January, OpenAI president Greg Brockman's personal political donations surfacing on social media — and people deciding that their tools reflect their values whether they intend them to or not.

As someone who has worked in creative culture in Sydney, London, New York and now Mexico City, I can tell you: the creative industry has always cared about this stuff. We've been making decisions about which clients to take, which brands to work with, which platforms to build on, based on values alignment for decades. The idea that "it's just a tool, don't overthink it" has never really landed with people who make things for a living. Your tools are part of your practice. Who built them and why matters.

  
  “Shifting from ChatGPT to Claude signals a move toward trust-based AI selection. When a technology becomes infrastructure, the buyer's mindset shifts from ‘What can it do?’ to ‘Can we rely on it without getting burned?’”
  — Built In, March 2026

Does Claude have usage limits on a paid plan? (Yes, and it's catching everyone off guard)

Here is the thing nobody is telling the people who just cancelled their ChatGPT subscription in a values-fuelled rage.

Claude has usage limits. Even on the paid plan. Especially on Opus — the most powerful model — which will burn through your daily allowance surprisingly fast if you're doing heavy work.

AI educator Kyle Balmer, who's been tracking the migration closely, described the experience for new arrivals as "a shock." People coming from ChatGPT are used to near-unlimited usage. ChatGPT is built for the mass market and priced accordingly. Claude is built differently — it's computationally expensive to run at the quality level it runs, and Anthropic isn't pretending otherwise.

Here's the actual breakdown so you know what you're working with:

Free plan: Around 10–15 detailed Opus exchanges before you hit the daily ceiling. Resets periodically.
Pro plan ($20/month): More generous, but heavy creative users — the ones running multi-hour sessions across multiple projects — will still hit limits on Opus.
Max plan ($100/month): This is where the constraints largely disappear. Significant price jump though.

  ⚠ Before You Cancel ChatGPT
  Test your actual daily workflow volume on Claude's free tier before pulling the plug on your existing subscription. The good news: Sonnet 4.5 — Claude's mid-tier model — handles most creative work beautifully and has much more generous limits. A lot of the “Claude is incredible” content you're seeing right now is Sonnet work, not Opus. Know what you're using.

None of this makes Claude a worse tool. The limits are a business and cost decision, not a design flaw. But I'd be doing you a disservice if I wrote a piece about switching to Claude without telling you this clearly. You've been warned. Now you can make an informed choice instead of a reactive one.

I let Claude run autonomously on my own site. Here's what happened.

Okay, enough industry commentary. Let me tell you about the moment this stopped being abstract for me.

I ran Claude Cowork — Anthropic's desktop agentic tool, the one that actually executes tasks rather than just advising on them — pointed at my Google Search Console errors. Every website owner knows this feeling: there's a backlog of technical issues quietly accumulating in the background that you know you should deal with, but you're a creative, not a developer, and every time you look at the console you feel a specific kind of tired.

I didn't write a detailed prompt. I didn't create a task list. I connected it to the data and left it to work while I got on with something else.

It made a plan. Executed the plan. Found errors I didn't know existed. Fixed things that would have taken me hours to diagnose, let alone implement. Schema conflicts, metadata issues, the kind of quiet technical debt that never makes it to the top of the to-do list.

When I came back, the errors were gone.

I sat there for a moment genuinely processing what had just happened. Not because AI fixing things is new — it's not. But because this wasn't AI helping me fix things. It was AI fixing things while I wasn't there. That's a different category of experience entirely, and if you haven't felt it yet, it's coming for your workflow whether you're ready or not.

This is the Cyborg mode framing I use with designers all the time: AI isn't there to replace your thinking. It's there to handle the execution layer so you can stay in the creative layer. The version of this that's actually useful isn't "AI does my creative work for me." It's "AI handles the things that were eating my creative time so I can do more of the work only I can do."

The Google Console thing took 20 minutes. I would have put it off for three more weeks.

So is Claude actually better than ChatGPT for designers?

It's the wrong question, but since everyone's asking it: it depends entirely on what you're designing.

For the thinking side of design — brand voice, strategy, positioning, long-form copy, creative direction, complex briefs, anything requiring nuanced reasoning — Claude has a genuine edge. The writing feels editorial rather than algorithmic. It holds the thread across long conversations. It pushes back when your brief is vague, which is annoying in the moment and useful in the result. It's the difference between an intern who does exactly what you say and a collaborator who asks whether what you said is actually what you meant.

For visual output — Claude doesn't generate images. Full stop. If DALL-E or Sora is central to your workflow, ChatGPT still has the advantage there and that gap is real. ChatGPT's ecosystem of integrations and custom GPTs also wins for certain automated workflows.

The honest answer for most designers: the strongest creative setup in 2026 uses both. Claude for depth and strategy. ChatGPT (or Midjourney) for visual ideation and rapid iteration. Not as a cop-out — as a deliberate choice about what each tool is actually built to do.

Frequently Asked Questions

The questions I'm actually getting from designers right now, answered without the corporate hedging.

Why are so many people switching from ChatGPT to Claude right now?

    The tipping point was a values divergence that happened in real-time and publicly. Anthropic refused to let the Pentagon use Claude for autonomous weapons or mass domestic surveillance, lost a $200M government contract as a result, and had its CEO publicly called out by Trump. Within hours, OpenAI announced its own Pentagon deal. For users who were already unsettled by ChatGPT testing ads in January 2026 — and by various OpenAI executives' political affiliations surfacing online — the contrast was stark enough to prompt action. Claude hit number one on the US App Store and recorded over one million new sign-ups per day in the days that followed. The migration is real, even if the long-term picture is still unresolved.
  
Does Claude have usage limits even on a paid plan?

    Yes, and this is the most important thing to know before switching. Claude's Opus model — its most powerful — burns through your daily allowance quickly on the $20 Pro plan. Users coming from ChatGPT's near-unlimited model find this genuinely surprising. Sonnet 4.5, Claude's mid-tier model, is more generous and handles most creative work well. If you need effectively unlimited access to the top model, the Max plan is $100/month — a significant jump. Test your actual workflow on Claude's free tier before cancelling anything.
  
Is Claude good for graphic designers specifically?

    Strong for the thinking and writing side of design — brand positioning, copywriting, creative briefs, tone of voice, strategic content, and anything requiring sustained reasoning across a long project. Weak on visual output — no native image generation. For designers who use AI primarily for concept development, briefs, and written content, it's excellent. For designers whose AI workflow centres on image generation and visual ideation, keep ChatGPT or Midjourney in the mix. Most designers who use Claude seriously use it alongside a visual generation tool, not instead of one.
  
What is Claude Cowork and is it worth it for designers?

    Cowork is Anthropic's desktop agentic tool — the version of Claude that actually executes tasks on your computer rather than advising you on what to do yourself. It can audit websites, fix technical errors, manage files, and work through multi-step workflows autonomously while you focus on something else. For designers running their own businesses or client projects, it's the most immediately practical AI upgrade available right now. It's included in the Claude Pro plan at $20/month — not a separate cost. The learning curve is low; the impact on your time is significant.
  
Should I fully switch from ChatGPT to Claude?

    Not necessarily — and definitely not without testing first. The two tools have genuinely different strengths. Claude leads on writing depth, reasoning, and autonomous task execution. ChatGPT leads on image generation, video, speed, and breadth of integrations. If writing, strategy, and brand thinking dominate your AI use, Claude is likely your better primary tool. If visual output is central to what you do, think carefully before leaving ChatGPT behind. Many designers are running both, which at $40/month total is honestly one of the better creative investments available right now.

ClaudeChatGPTGrokGoogle GeminiLLMs for creatives & designers

Anthony Wood