Recent News by Day

What is new on Article Factory and latest in generative AI world - 2026-02-13

Showing 4 articles from 2026-02-13 Show all news

OpenAI Leverages Cerebras Wafer-Scale Chip to Boost Codex Speed

OpenAI Leverages Cerebras Wafer-Scale Chip to Boost Codex Speed Ars Technica2
OpenAI has teamed with Cerebras to run its Codex-Spark coding model on the Wafer Scale Engine 3, a chip the size of a dinner plate. The partnership aims to improve inference speed, delivering roughly 1,000 tokens per second, with higher rates reported on other models. The move reflects OpenAI’s broader strategy to reduce reliance on Nvidia by striking deals with AMD, Amazon and developing its own custom silicon. The faster coding assistant arrives amid fierce competition from Anthropic, Google and other AI firms, underscoring the importance of latency for developers building software. Read more →

Google Warns of Large-Scale AI Model Extraction Attacks Targeting Gemini

Google Warns of Large-Scale AI Model Extraction Attacks Targeting Gemini CNET
Google’s Threat Tracker report reveals that hackers are conducting "distillation attacks" by flooding the Gemini AI model with more than 100,000 prompts to steal its underlying technology. The attempts appear to originate from actors in North Korea, Russia and China and are classified as model extraction attacks, where adversaries probe a mature machine‑learning system to replicate its capabilities. While Google says the activity does not threaten end users directly, it poses a serious risk to service providers and AI developers whose models could be copied and repurposed. The report highlights a growing wave of AI‑focused theft and underscores the need for stronger defenses in the rapidly evolving AI landscape. Read more →

Google Reports Model Extraction Attacks on Gemini AI

Google Reports Model Extraction Attacks on Gemini AI Ars Technica2
Google disclosed that commercially motivated actors have tried to clone its Gemini chatbot by prompting it more than 100,000 times in multiple non‑English languages. The effort, described as “model extraction,” is framed as intellectual‑property theft. The company’s self‑assessment also references past controversy over using ChatGPT data to train Bard, a warning from former researcher Jacob Devlin, and the broader industry practice of “distillation,” where new models are built from the outputs of existing ones. Read more →

OpenAI Launches Codex‑Spark, a Fast, Lightweight Coding Assistant Powered by Cerebras Chip

OpenAI Launches Codex‑Spark, a Fast, Lightweight Coding Assistant Powered by Cerebras Chip TechCrunch
OpenAI unveiled Codex‑Spark, a lightweight version of its Codex coding assistant designed for rapid inference and real‑time collaboration. The new model runs on Cerebras' Wafer Scale Engine 3, a megachip featuring four trillion transistors, marking a deeper hardware integration between the two companies. Currently in a research preview for ChatGPT Pro users, Spark aims to accelerate prototyping while complementing the heavier, longer‑running tasks of the original Codex model. Read more →