Nvidia launches DGX Spark, a compact AI supercomputer for desktops
Compact Design, Powerful Software
The DGX Spark is an ARM‑based system that runs Nvidia's DGX OS, an Ubuntu Linux‑based operating system built specifically for GPU processing. It comes with Nvidia's AI software stack preinstalled, including CUDA libraries and the company's NIM microservices.
Pricing and Performance
Prices for the DGX Spark start at US $3,999. According to The Register, the GPU computing performance of the GB10 chip is roughly equivalent to an RTX 5070. However, the RTX 5070 is limited to 12 GB of video memory, while the DGX Spark provides 128 GB of unified memory, allowing it to run far larger AI models.
Memory Advantage
With 128 GB of unified memory, the DGX Spark can handle models that would exceed the capacity of typical consumer GPUs. For example, a 120‑billion‑parameter language model would require about 80 GB of memory, far more than most single‑GPU solutions can accommodate.
CEO Jensen Huang’s Personal Delivery
Nvidia founder and CEO Jensen Huang marked the occasion of the DGX Spark launch by personally delivering one of the first units to Elon Musk at SpaceX's Starbase facility in Texas. This echoes a similar delivery Huang made to Musk at OpenAI in 2016. In a statement, Huang said, "In 2016, we built DGX‑1 to give AI researchers their own supercomputer. I hand‑delivered the first system to Elon at a small startup called OpenAI, and from it came ChatGPT. DGX‑1 launched the era of AI supercomputers and unlocked the scaling laws that drive modern AI. With DGX Spark, we return to that mission."
Market Position
While the DGX Spark is not as powerful as high‑end GPUs like the RTX 5090 or AI server GPUs such as the H100, its lower price point and substantial memory make it a less expensive option for organizations seeking on‑premise AI capability without the cost of large‑scale server infrastructure.
Usado: News Factory APP - descoberta e automação de notícias - ChatGPT para Empresas