Tiled Hacker news on React Router

GPT 5.4 in practice – Stinks?

8 points - last Tuesday at 9:13 PM

satvikpendem
today at 11:29 AM
I'm using GPT 5.4 solely now, over Claude and any other models. In my usage I find it better than all the other models especially at large and complex codebases.
shivang2607
yesterday at 10:19 AM
I have building quite complex architecture applications from some time now. So I think I know the answer. For pure Coding no one comes close to Claude. No matter what the benchmark says, no one beats claude in terms of sheer coding skills. Having said that, claude lacks in architecture design decision making, it does not make good decision regarding that, I find ChatGPT more smart in terms of system designs and architectures. And I have experienced this not once but 4 times now. And For mathematical reasoning and formula making Gemini is better than both claude and chatGPT. I have experienced this once, when I had to design formula for calculating scores of different files and functions in a codebase.
sjt-at-rev
last Tuesday at 10:26 PM
We've been testing it extensively and its performance is no better than prior versions, and in many cases worse than open weight models like GLM. Gemini3.1 Pro is so significantly better.
To me, the play is: open weight on a provider like BaseTen (solid performance, low price point), or pay up for Gemini3.1 Pro if you need it.
But at their high price and low-ish quality, OpenAI models just aren't in the conversation right now without heavy incentives, e.g. via Azure.
Crazy, TBH. Curious if others find the same thing?
vicnov
yesterday at 10:57 PM
I can’t use it after engaging with Claude. Even simply having a conversation about some design decisions seems annoying.
So I would agree with you, it is not great.
a960206
yesterday at 5:07 PM
Using GPT to review work produced by Claude works extremely well.
segmondy
yesterday at 1:12 PM
skill issues. in practice, these models are all great. no matter how great the hammer or saw is, you need skills to be a great carpenter.
thiago_fm
yesterday at 8:57 AM
Try OpenCode or Kimi, they're mostly all the same thing
We still have to see what Anthropic has cooked though
zephyrwhimsy
yesterday at 1:10 AM
[flagged]
studio-m-dev
last Tuesday at 10:09 PM
[flagged]

GPT 5.4 in practice – Stinks?

satvikpendem

shivang2607

sjt-at-rev

Chyzwar

vicnov

a960206

muzani

segmondy

sjt-at-rev

thiago_fm

sjt-at-rev

zephyrwhimsy

studio-m-dev