Some developers say theyâve had largely positive experiences with GPT-5 so far. Jenny Wang, an engineer, investor, and creator of the personal styling agent Alta, told WIRED the model appears to be better at completing complex coding tasks in one shot than other models. She compared it to OpenAIâs o3 and 4o, which she uses frequently for code generation and straightforward fixes âlike formatting, or if I want to create an API endpoint similar to what I already have,â Wang says.
In her tests of GPT-5, Wang says she asked the model to generate code for a press page for her companyâs website, including specific design elements that would match the rest of the siteâs aesthetic. GPT-5 completed the task in one take, whereas in the past, Wang would have had to revise her prompts during the process. There was one significant error, though: âIt hallucinated the URLs,â Wang says.
Another developer, who spoke on the condition of anonymity because their employer didnât authorize them to speak to the press, says GPT-5 excels at solving deep technical problems.
The developerâs current hobby project is writing a programmatic network analysis tool, one that would require code isolation for security purposes. âI basically presented my project and some paths I was considering, and GPT-5 took it all in and gave back a few recommendations along with a realistic timeline,â the developer explains. âIâm impressed.â
A handful of OpenAIâs enterprise partners and customers, including Cursor, Windsurf, and Notion, have publicly vouched for GPT-5âs coding and reasoning skills. (OpenAI included many of these remarks in its own blog post announcing the new model.) Notion also shared on X that itâs âfast, thorough, and handles complex work 15 percent better than other models weâve tested.â
But within days of GPT-5âs release, some developers were weighing in online with complaints. Many said that GPT-5âs coding abilities seemed behind the curve for what was supposed to be a state-of-the-art, ultra-capable model from the worldâs buzziest AI company.
âOpenAIâs GPT-5 is very good, but it seems like something that would have been released a year ago,â says Kieran Klassen, a developer who has been building an AI assistant for email inboxes. âIts coding capabilities remind me of Sonnet 3.5,â he adds, referring to an Anthropic model that launched in June 2024.
Amir SalihefendiÄ, founder of the startup company Doist, said in a social media post that heâs been using GPT-5 in Cursor and has found it âpretty underwhelmingâ and that âitâs especially bad at coding.â He said the release of GPT-4 felt like a âLlama 4 moment,â referring to Metaâs AI model, which had also disappointed some people in the AI community.
On X, developer Mckay Wrigley wrote that GPT-5 is a âphenomenal everyday chat model,â but when it comes to coding, âI will still be using Claude Code + Opus.â
Other developers describe GPT-5 as âexhaustiveââat times helpful, but often irritating in its long-windedness. Wang, who was pleased overall with the frontend coding project she assigned to GPT-5, says that she did notice that the model was âmore redundant. It clearly could have come up with a cleaner or shorter solution.â (Kapoor points out that the verbosity of GPT-5 can be adjusted, so that users can ask it to be less chatty or even do less reasoning in exchange for better performance or cheaper pricing.)
