From somewhere in the middle of Nvidia’s endless press waffle:
“The RTX Spark superchip features an NVIDIA Blackwell RTX GPU with 6,144 CUDA cores and fifth-generation Tensor Cores with FP4 precision, connected via the NVIDIA NVLink®-C2C chip-to-chip interconnect to a high-performance, 20-core NVIDIA Grace™ CPU.
MediaTek, a market leader in Arm-based system-on-a-chip designs, collaborated with NVIDIA on the custom CPU design, contributing to its best-in-class power efficiency, performance and connectivity.“
As best as I can tell its something like the Apple M series SoC, but for Windows: CPU + GPU with unified memory.
It has 6,144 CUDA cores is similar to a RTX 4070 (5,888) but a lot less than a 4090 (16,384), but what it does have is support for FP4.
When they claim "1 Petaflop AI compute", thats what they mean. For comparison, a RTX 4090 has ~1.3 Petaflops of FP8 processing.
The second big deal is the NVLink-C2C interconnect, which provides up to 900 GB/s of bidirectional bandwidth between GPU and CPU. For comparison, the Apple M4 has 120 GB/s and the M3 Ultra has 819 GB/s. Notably, the Apple M series does not have FP4 support, so this could mean a significant performance improvement over Apple's offerings.
This reads as lip service for investors with no real life value. "Remember guys, we are still AI frontier material!"
Skimmed through the page no mention of what spark is. Is it a new ISA? SoC with CPU, GPU and NPU? Or just GPU+AI?
From somewhere in the middle of Nvidia’s endless press waffle:
“The RTX Spark superchip features an NVIDIA Blackwell RTX GPU with 6,144 CUDA cores and fifth-generation Tensor Cores with FP4 precision, connected via the NVIDIA NVLink®-C2C chip-to-chip interconnect to a high-performance, 20-core NVIDIA Grace™ CPU.
MediaTek, a market leader in Arm-based system-on-a-chip designs, collaborated with NVIDIA on the custom CPU design, contributing to its best-in-class power efficiency, performance and connectivity.“
https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-...
Nvidia Grace is an ARM core.
As best as I can tell its something like the Apple M series SoC, but for Windows: CPU + GPU with unified memory.
It has 6,144 CUDA cores is similar to a RTX 4070 (5,888) but a lot less than a 4090 (16,384), but what it does have is support for FP4.
When they claim "1 Petaflop AI compute", thats what they mean. For comparison, a RTX 4090 has ~1.3 Petaflops of FP8 processing.
The second big deal is the NVLink-C2C interconnect, which provides up to 900 GB/s of bidirectional bandwidth between GPU and CPU. For comparison, the Apple M4 has 120 GB/s and the M3 Ultra has 819 GB/s. Notably, the Apple M series does not have FP4 support, so this could mean a significant performance improvement over Apple's offerings.