Comments on: Stacking Up AMD Versus Nvidia For Llama 3.1 GPU Inference

By: Robert

Mon, 19 Aug 2024 05:44:16 +0000

FP8 is officially supported in ROCm 6.2.

]]>

By: cc

Wed, 07 Aug 2024 05:47:55 +0000

Is the **No** Sparsity FP peak correct for nVidia ?

https://resources.nvidia.com/en-us-blackwell-architecture/blackwell-architecture-technical-brief?ncid=no-ncid
Table 3, All petaFLOPS and petaOPS are **with Sparsity** except FP64 which is dense.

]]>

By: Anon

Tue, 06 Aug 2024 10:33:10 +0000

I don’t believe AMD supports FP8 though?

]]>

By: Mickey Pearson

Fri, 02 Aug 2024 22:36:10 +0000

By: Timothy Prickett Morgan

Fri, 02 Aug 2024 18:53:53 +0000

By: EC

Thu, 01 Aug 2024 20:28:01 +0000

I’m sorry, but this article is just elucidation of an AMD talking point: We have more memory per GPU. Great! We all get it, it’s been a non-stop message since MI300 launch hundreds of articles ago. It’s a single metric, and the message could have been published with just an a single image of the top chart.

Where are the performance numbers, across a variety of applications and hardware configurations, just like every other piece of high tech hardware that makes a claim?

AMD’s lack of laying the cards on the table (with MLPerf for example) is a cynical attempt to stake a high ground narrative while no one (press) holds them to account.

Based on link backs to this article it’s working.

]]>

By: Timothy Prickett Morgan

Wed, 31 Jul 2024 11:48:32 +0000

By: Jlagreen

Wed, 31 Jul 2024 07:00:16 +0000

In reply to Timothy Prickett Morgan.

No, it’s not because of 2 things:

1. You and everyone else only check on inferencing and ignore that maybe customers want to do training as well. Public AI models are nice but using company specific data for training is an untapped market which will benefit primarily Nvidia.

2. Every benchmark so far is on 8x to 16x GPU systems and therefore a bit strange. How does benchmarking look like at scale? How does AMD vs. Nvidia perform if you combine a cluster with 100s or 1000s of GPUs? Everyone talks about their 1000s cluster GPUs and we benchmark only 8x GPUs in inferencing. It’s time for AMD to present itself at MLPerf. Until then it’s all cherry picked.

]]>

By: Timothy Prickett Morgan

Tue, 30 Jul 2024 16:24:59 +0000

By: Calamity Jim

Tue, 30 Jul 2024 07:15:12 +0000

Cool analysis! If I read well, today, MI300X, trundles, tramples, and trounces H100/200, but tomorrow, B100/200 will assert commensurate competitive riposte … to be followed by substantial throws, pins, chokes, jointlocks and ippons by MI325X/350/400X. This is competition at its best (eh-eh-eh)!

]]>