
Instinct MI350P PCIe cards are available in air-cooled systems with up to eight accelerator cards, which makes them ideal for small, medium, and large AI models for inference and RAG pipelines. It has 144GB of high bandwidth memory 3e (HBM3E) running at up to 4TB/s.
Performance is estimated at 2,299 teraflops (TFLOPS) and up to 4,600 peak TFLOPS at MXFP4, which AMD says is the highest performance currently available in an enterprise PCIe card. It offers native support for lower-precision MXFP6 and MXFP4, which deliver high throughput as well as acceleration through sparsity support for most mainstream 8- and 16-bit precisions.
The MI350P card supports technology called sparsity, where zero values in data sets and matrixes are ignored, thus reducing the processing time. Support for sparsity means higher precision formats, like INT8 and BF16, deliver efficient performance, according to AMD.
