Nvidia evaluating multi chip module GPU designs

by Mark Tyson on 4 July 2017, 10:00

Tags: NVIDIA (NASDAQ:NVDA)

Quick Link: HEXUS.net/qadjbi

Add to My Vault: x

Researchers from Arizona State University, Nvidia, the University of Texas, and the Barcelona Supercomputing Centre have published a paper (PDF) that looks at improving GPU performance using Multi-Chip-Module (MCM) GPUs. The team see MCM GPUs as one way to sidestep the deceleration of Moore's Law and the performance plateau predicated for single monolithic GPUs.

Transistor scaling cannot happen at historical rates anymore and chipmakers are staying with certain manufacturing processes longer but optimising performance in other ways. As "the performance curve of single monolithic GPUs will ultimately plateau," researchers are looking at how to make better performing GPUs from package-level integration of multiple GPU modules.

It is proposed that easily manufacturable basic GPU Modules (GPMs) are integrated on a package "using high bandwidth and power efficient signalling technologies," to create multi chip module GPU designs. To see if such a proposal is worthwhile and can bear fruit worth picking, the research team has been evaluating designs using Nvidia's in-house GPU simulator. Theoretical performance comparisons against multi-GPU solutions were also made.

MCM GPUs could do wonders for increasing the SM count and many GPU applications "scale very well with increasing number of SMs," observe the scientists. The research team looked at the possibilities of a 256 SMs MCM-GPU in the paper, and are pleased by its potential. Using the simpler GPM building blocks and advanced interconnects this 256 SM chip "achieves 45.5% speedup over the largest possible monolithic GPU with 128 SMs," assert the researchers.

In further tests the 256 SM equipped MCM-GPU "performs 26.8% better than an equally equipped discrete multi-GPU, and its performance is within 10% of that of a hypothetical monolithic GPU that cannot be built based on today’s technology roadmap," concluded the research paper.

Research to reality delays mean we shouldn't expect MCM GPU graphics cards for enthusiasts from Nvidia for a couple of hardware generations.



HEXUS Forums :: 6 Comments

Login with Forum Account

Don't have an account? Register today!
So kind of like Ryzen and Infinity Fabric. Looks interesting.
already happened once. Hope this time will end much better:
http://www.tomshardware.com/reviews/gigabyte,954-2.html
Gunbuster
So kind of like Ryzen and Infinity Fabric. Looks interesting.

That is exactly what AMD are hoping to achieve with Infinity fabric, as well as using it for better CPU/GPU interconnects.
Goodman2576
already happened once. Hope this time will end much better:
http://www.tomshardware.com/reviews/gigabyte,954-2.html

It's a bit different to that, it's more akin to how threadripper uses 4 CPUs for one big SoC than say the GTX295 which has two different SoCs.
Yeah basically IF can be internal, ie between the two 4core blocks in Ryzen or external say between 2 on Threadripper, 4 on Epyc or between the sockets in a dual Epyc board.
Nvidia's GPUs are already pretty modular internally, by making the connection more flexible between the clusters would allow for easier to scale architecures. ie, a 2080 could just be two 2060s on one chip.

FYI my language is muddled as I am a layman in these kind of things.