DUMB DEV Community

Cover image for Grok 4 vs. Claude Opus 4 vs. Gemini 2.5 Pro Coding Comparison 🚀

Grok 4 vs. Claude Opus 4 vs. Gemini 2.5 Pro Coding Comparison 🚀

Shrijal Acharya on July 16, 2025

With the recent release of Grok 4, supposedly the most intelligent AI model, there's a significant question about how well this model performs in c...
Collapse
 
dvjosefr profile image
Josef Röyem

In all honesty in terms of pure code ability Claude sonnet 3.7 still wins (often produces cleaner, more reliable code, and excels in complex reason). Grok has its benefits but it falls short on instruction adherence and hits rate limits really fast under heavy usage. As for OPUS edges ahead in raw performance, this is noticeable on extra-long or intricate tasks. I guess whatever suits the use case is best.

Collapse
 
shricodev profile image
Shrijal Acharya

I've moved on from 3.7 Sonnet, though currently, I'm sticking with Sonnet 4 and Opus. I don't have a great use case for Grok; it's not so good with coding, after all. Once they launch the code-tuned model next month, we'll see if things change for me. It's true, you go with whatever suits the situation best.

Collapse
 
dvjosefr profile image
Josef Röyem

Makes perfect sense, Sonnet 4 and Opus are both solid choices! I’m also curious to see how Grok’s code tuned model performs too. I’d even go as far to add that if you truly want to get the most out of your tokens switching between models usually generates better results e.g (1 for writing) (1 for code) (1 for bugs) given that you can keep the model within the proper context/scope.

Thread Thread
 
shricodev profile image
Shrijal Acharya

I'm so looking to their code model release. It's always a good idea to use different models based on the use case. I'll most likely test their code model this August. Will ping you then.

Collapse
 
aravind_d profile image
Aravind d

Just now im trying to use the Grok4, TBH i'm not happy.

Collapse
 
shricodev profile image
Shrijal Acharya

Is it? Where did it fall short? I guess it's in coding? Opus all the way.

Collapse
 
paultowers profile image
Paul Towers

Great comparison. Didn't Grok say they were releasing a coding optimised version in the coming months? Would be interesting to redo the comparison again when they do

Collapse
 
shricodev profile image
Shrijal Acharya

Sure. It's coming next month in August. Will do a quick coding test at that time with that coding-tuned model.

Collapse
 
nettiequeensland profile image
Nettie

I don't plan to use this on a regular basis. What's the pricing?

Collapse
 
joanroling profile image
Joan Roling

This is free to use. The terminal app costs a little, but you can run it on your local machine or the server.

Collapse
 
shricodev profile image
Shrijal Acharya

It depends on what coding agent we're using and the model. Otherwise, the MCP server is free.

Collapse
 
shricodev profile image
Shrijal Acharya

For this entire test, I've used Cursor. It comes with some free requests, but it's not completely free, and to use any other custom models, you need to have the Pro plan.

Collapse
 
shricodev profile image
Shrijal Acharya

Folks, let me know how has been your experience with Grok 4 so far! ✌️

Collapse
 
aavash_parajuli_72 profile image
Aavash Parajuli

Great comparison indeed. Thanks for sharing this, Shrijal! Love your work. ☺️

Collapse
 
shricodev profile image
Shrijal Acharya

Thanks for checking, Aavash! ✌️