OpenAI Unveils GPT-5.4 with Pro and Thinking Variants
New Model Family
OpenAI introduced GPT-5.4 as its most capable and efficient frontier model for professional work. The offering includes three distinct versions: the standard GPT-5.4, GPT-5.4 Pro, which is optimized for high performance, and GPT-5.4 Thinking, tailored for advanced reasoning tasks. All three share a dramatically enlarged context window that can handle up to one million tokens, providing the largest token capacity currently available from OpenAI.
Token Efficiency and Performance Gains
OpenAI highlighted that GPT-5.4 can solve the same problems using significantly fewer tokens than its predecessor. This token‑efficiency improvement translates into faster, cheaper processing for complex applications. Benchmark testing shows record scores in computer‑use evaluations such as OSWorld‑Verified and WebArena Verified, and the model achieved an 83% result on OpenAI’s GDPval test for knowledge‑work tasks. In professional benchmarks like Mercor’s APEX‑Agents, which assess legal and financial skill sets, GPT-5.4 led the rankings, demonstrating strong ability to generate long‑horizon deliverables such as slide decks, financial models, and legal analysis.
Reduced Hallucinations and Safer Output
Continuing its focus on reliability, OpenAI reported that GPT-5.4 is 33% less likely to make errors in individual claims compared with GPT‑5.2, and overall responses are 18% less likely to contain errors. A new safety evaluation targeting chain‑of‑thought behavior showed that the Thinking version is less prone to deceptive reasoning, suggesting that the model lacks the ability to hide its thought process and that monitoring remains an effective safety tool.
Tool Search: A New Approach to Tool Calling
The API version of GPT-5.4 introduces a system called Tool Search, which changes how the model accesses tool definitions. Previously, system prompts had to list all available tools, consuming many tokens as the toolset grew. Tool Search allows the model to look up definitions only when needed, reducing token usage and lowering request costs in environments with many tools.
Implications for Professional AI Use
By combining a massive context window, superior token efficiency, record benchmark performance, and stronger safety mechanisms, GPT-5.4 positions itself as a versatile engine for a wide range of professional applications. The Pro and Thinking variants give developers the flexibility to prioritize speed or deep reasoning, while the new Tool Search architecture streamlines integration with complex tool ecosystems. OpenAI’s announcements signal a continued push toward more capable, cost‑effective, and trustworthy AI systems for enterprise and research use.
Used: News Factory APP - news discovery and automation - ChatGPT for Business