OpenAI Says Its New GPT-5.5 Model Is More Efficient and Better At Coding (theverge.com) 29
OpenAI released its new GPT-5.5 model today, which the company calls its "smartest and most intuitive to use model yet, and the next step toward a new way of getting work done on a computer." The Verge reports: OpenAI just released GPT-5.4 last month, but says that the new GPT-5.5 "excels" at tasks like writing and debugging code, doing research online, making spreadsheets and documents, and doing that work across different tools. "Instead of carefully managing every step, you can give GPT-5.5 a messy, multi-part task and trust it to plan, use tools, check its work, navigate through ambiguity, and keep going," according to OpenAI. The company also notes that GPT-5.5 will have its "strongest set of safeguards to date" and can use "significantly fewer" tokens to complete tasks in Codex. GPT-5.5 is rolling out on Thursday for Plus, Pro, Business, and Enterprise ChatGPT tiers and Codex, with GPT-5.5 Pro coming to Pro, Business, and Enterprise users.
Sure (Score:3)
My butcher says, meat is healthier than bread and my baker says just the opposite.
I eat both with a grain of salt.:-)
Re: (Score:2)
Re: (Score:2)
My butcher says, meat is healthier than bread and my baker says just the opposite.
I eat both with a grain of salt.:-)
By both do you mean, meat and bread or your butcher and baker? The latter seems low in bread.
Re: (Score:2)
Sure, everybody touts their own products. But OpenAI has some reason to brag.
In my own comparison tests of coding LLMs, I've found Anthropic and OpenAI models superior. And OpenAI's are much faster, with similar results, than Anthropic's.
It's not *just* hot air.
Re: (Score:2)
Do me a favor, get or rent a GPU somewhere and run your own instance of Qwen 3.6 35B A3B. But make sure you give it the Playwrite MCP and for a bonus, toss in a web search MCP. I think you'll find that the other two still have a speed advantage as far as tokens spent. But that Qwen fini
Better (Score:2)
Re: (Score:2)
Re: (Score:2)
Well the 'nice' thing about this sort of language is it can frequently be true multiple times. It's "better" but how close to "good enough" is unspecified.
The Anthropic one was interesting because the original person behind it was fairly nuanced and honest. The stunt needed an existing reference implementation as a basis as well as a boat load of unit tests and needed hand holding and still didn't quite pass the big test of compiling the kernel (needed to borrow missing bits that claude couldn't figure ou
Re: (Score:2)
The C compiler was just an interesting experiment by some guy, not a claim of anything by any company.
I still get a laugh from the guy who pointed out his even-more-powerful AI model, 'cp', which he asked to write a gcc-compatible compiler via 'cp /src/gcc/* /$home/compiler/'.
Re: (Score:1)
Indeed. It is all smoke and mirrors and abysmally bad business numbers. I have decided to amuse me with doing some research into AI failing at things in the meantime.
Re: Problem is, they said that last time. (Score:1)
Re: (Score:1)
You are projecting. How pathetic.
Translation: Still sucks (Score:2, Funny)
Just a teeny bit less. Not that the mindless fans will care.
Oh, and how are those revenue numbers? Still "certain death soon" level?
Re: Translation: Still sucks (Score:1)
Re: (Score:2)
How much "mindless fanboi" can you get?
Re: (Score:2)
more faster (Score:3)
Now with more slop delivered faster!
Re: (Score:2)
The latest coding models have moved beyond slop. They actually write decent code.
Just a few months ago, I used to have to micromanage every code change. These days, with GPT-5.4, it usually gets it right the first time, even larger code updates. It does a great job of following the coding patterns and conventions YOU demonstrate in your code base. It's actually not hard to read or...sloppy.
So what happened to... (Score:1)
But I thought you told us that by now no programmer would
have a job ?
And I still have one.
Go bust already, Dirty Sam and friends.
Re: So what happened to... (Score:1)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
>> Literally in the last 6 months, maybe less, everything has changed
Same experience here. I'm giving AI some very complicated coding tasks to do for me and, with sufficient flogging on my part, it can do them amazingly well. Sometimes I have to switch models if the one I'm using bogs down, but no prob. There are several premium and second-tier models to choose from.
I frequently get a couple of weeks worth of work done in a day. I can try stretch objectives I would have never had time for and often th
Slop (Score:2)
Now we have 99% of the people in the industry who joined for the money, not because they enjoy programming. They write more slop.
AI is just the next iteration. Doesn't work in edge cases? No problem, write a ticket I'll fix that. I'm ju