Lo nuevo en Article Factory y lo último en el mundo de la IA generativa

Anthropic Launches Claude Opus 4.6 with Enhanced Capabilities and Safety

Anthropic Launches Claude Opus 4.6 with Enhanced Capabilities and Safety
Anthropic announced Claude Opus 4.6, branding it as a direct upgrade that handles complex, multi‑step tasks with higher quality on the first try. The model expands beyond coding to improve work in documents, spreadsheets, and presentations, and adds a one‑million token context window in beta. New features include agent‑team collaboration for developers and expanded cybersecurity safeguards. Pricing remains the same as the predecessor, and the model is positioned as a more production‑ready solution for a broad range of knowledge‑work applications. Leer más →

AI Models Fall Short on New Professional Benchmark, Researchers Find

AI Models Fall Short on New Professional Benchmark, Researchers Find
A new benchmark called APEX-Agents, designed to test AI performance on real-world professional tasks in consulting, investment banking, and law, reveals that current AI models struggle to meet the demands of knowledge work. Researchers from Mercur report that even top-performing models answer only about a quarter of the questions correctly, highlighting challenges in multi-domain reasoning and information retrieval across tools like Slack and Google Drive. The findings suggest that AI is still far from replacing skilled professionals in high‑value roles. Leer más →