
In addition, the company announced the HPE Cray Supercomputing GX240 liquid-cooled compute blade for its GX5000 platform. The GX240 starts with 16 Nvidia Vera CPUs per blade and scales to 40 blades per rack, supporting up to 640 Nvidia Vera CPUs and 56,320 ARM cores per rack.
In addition, HPE said new network connectivity—Nvidia Quantum-X800 InfiniBand—optimized for large-scale system connectivity is now available with HPE Cray Supercomputing GX5000. The Quantum-X800 InfiniBand switches provide 144 ports of 800 Gb/s connectivity per port with power efficiency features, the vendor stated.
The vendor also rolled out the HPE Compute XD700, an AI server built on Nvidia HGX Rubin NVL8. The system is designed to deliver higher GPU density per rack and reduce space, power, and cooling costs while increasing AI training and inference throughput. Each rack of XD700 servers supports up to 128 Rubin GPUs, providing double the GPU density compared to the previous generation, according to HPE.
During his GTC opening keynote, Nvidia CEO Jensen Huang said: “Vera is arriving at a turning point for AI. As intelligence becomes agentic—capable of reasoning and acting—the importance of the systems orchestrating that work is elevated. The CPU is no longer simply supporting the model; it’s driving it. With breakthrough performance and energy efficiency, Vera unlocks AI systems that think faster and scale further.”
