The Nvidia Volta GPU architecture is still likely to be the basis for the next generation of GeForce graphics card. Right now Volta silicon is only being packed into pro-level machine, but with the announcement of real-time raytracing hitting PC games by the end of the year we're going to need a new breed of GPU with the power of Volta inside them.
Check out our pick of the best graphics cards you can buy today.
Q3 2018 seems like the most likely release for Volta-based gaming cards, because they're definitely not launching anything before then. But if you can't wait Nvidia have launched the $3,000 Titan V...
The professional-class Volta – the Tesla V100 – uses TSMC's 12nm FinFET design and the full GV100 GPU has 5,376 CUDA cores. Imagine a GTX 2080 Ti with that.
Don't shoot the messenger, but it's possible the consumer Volta cards will push
prices up again. $699 for the GTX 2080? Not beyond the realms of possibility...
We're too early in the release cycle for there to be any performance figures flying around, but we're hoping for greater efficiency and better with DX12 and Vulkan.
Nvidia’s Volta tech is the silicon successor to the Pascal generation of graphics cards; the generation that brought us the mighty GTX 1080 Ti and Titan Xp cards. But can it deliver the same generational performance boost offered by Nvidia's impressive last-gen graphics silicon?
The first professional-level, ultra-expensive processor, the V100, officially unveiled last year. Since then we’ve seen the $3,000 Titan V and some server-style monster rigs, but nothing gaming related. But if the gaming performance of the gaming Nvidia Volta GPUs provide anything like the performance delta the first Tesla V100 silicon is displaying against the last-gen professional Pascal cards then we're in for a treat.
Volta, in the pro space, is out-performing Pascal by 132%.
As is Nvidia’s wont the new GPU architecture is taking its name from a famous historical scientist. Alessandro Volta gave his name to the Volt having been a pioneer of electrical energy and its storage. He was also the discoverer of butt-gas - fun little science fact for you there.
But recently there have been the most paper-thin rumours about new Nvidia Ampere and Nvidia Turing codenames being released as the next graphics chips from the Nvidia skunkworks. Hell, we can make up spurious GPU codenames in our sleep...
Both GPU names have come from a single line in two disparate articles surrounding Nvidia financial results, and have no sources to make the names in any way believable. If they're anything they could purely be what Nvidia use to reference the gaming variants of the Nvidia Volta tech.
Volta, however, is a GPU architecture that definitely exists and was first unveiled, at least in theoretical form, at Nvidia’s Graphics Technology Conference way back in 2013. It was originally meant to be the GPU silicon which followed directly on from the Maxwell architecture (which made up the GTX 900-series of graphics cards), but a year later up pops the Pascal design used in the most recent 10-series GeForce parts, pushing the prospective Nvidia Volta chips further back.
Which brings us onto...
Rumour has it that Nvidia's Volta graphics cards could be gearing up to launch in Q3, 2018. SK Hynix are reportedly ramping up production of GDDR6, which Nvidia are also reported to be utilising exclusively with their unreleased graphics cards.
When they first announced the arrival of GDDR6 Hynix mentioned they were “planning to mass produce the product for a client to release high-end graphics card [sic] by early 2018 equipped with high performance GDDR6 DRAMs.” I’m not the only one to think that is a reference to Nvidia and the release of GDDR6-based Volta GPUs.
Samsung have also announced their own GDDR6 memory is set to "play a critical role in early launches of next-generation graphics cards" which is playing into our suspicions of a relatively early 2018 launch. Though that could just because we're desperate for a new GPU launch to potentially salve the cryptocurrency crisis that's stolen all our graphics cards...
These memory chips are also expected to be 20% more expensive than GDDR5 for manufacturers from the get-go - although eventually dropping in price as manufacturing capacity moves away from the older standards.
We had thought that maybe Nvidia would launch something at GDC this year, but our sources within the green team told us they were categorically not going to be launching a new GPU in March. Nor did they be launching anything game-related at GTC a week later either - it was all just deep-learning, AI, and wheely things.
The closest we’ve come to a genuine Nvidia Volta-based graphics card we could jam into our PCs is the $3,000 Titan V. But that’s still sporting the GV100 GPU, not the GV104 we’re expecting to be the base of a potential GTX 2080 of the next generation.
All the released Volta GPUs, from the Tesla V100, to the Titan V and Quadro GV100, have been running the same 12nm TSMC lithography. TSMC have also said they were entering volume production of their 12nm FinFET chips in the fourth quarter of 2017, which is why we were expecting a full launch of Nvidia's consumer-facing Volta cards relatively early in 2018.
Originally, Volta was supposed to be built using TSMC's new 10nm process, but the pace of transistor shrinkage has become rather laggardly in recent years. Other rumours had Nvidia sticking with TSMC’s existing 16nm tech in order to be able to stick to their roadmap and get actual Nvidia Volta cards on the shelves in 2018. It's now looking like somewhere between the two.
TSMC’s 10nm process, unlike Intel’s Cannonlake design, is expected to be more of a stop-gap measure between their current process and the almost mythical 7nm lithography. To put that into perspective 7nm is about the height of three Tom Cruises standing on each other’s shoulders. But getting down to 10nm seems to be more of a challenge than maybe even they expected. I’m guessing either the price is prohibitive or the yields too low to offset the performance benefits of a newer design. Or a combination of both.
The 12nm node is apparently based on TSMC’s existing 16nm design, but with density, performance and energy efficiency improvements. Whether this 12nm node will genuinely be packed with 12nm transistors, or whether it’s just going to be clever marketing, is currently about as clear as thermal paste.
The layout of the Volta GPU is a little different to previous Nvidia GPUs, but that's mostly so they can squeeze in all of those 5,120 CUDA cores. In fact, the full GV100 GPU is actual capable of housing 5,376 CUDA cores, so the Titan V and Tesla V100 chips aren't even the most powerful potential Nvidia Volta GPUs.
The actual setup is unlikely to change a huge amount with the gaming variants, apart from the double precision cores being stripped out. If Nvidia follow tradition each of the streaming multiprocessors (SMs) that make up the GV104 will have a certain amount of non-gaming silicon stripped out of them to make the more mainstream graphics cards.
But Nvidia have said there is specific hardware in the Volta architecture that benefits ray-tracing for games, but that's not necessarily the added Tensor cores Nvidia have dropped into the Volta silicon. Those do help, so there's a good chance we'll see Tensor cores crop up in our gaming chips too. That means it's possible that there won't be as sweeping changes to the consumer chip as we've seen in the past.
In terms of the memory configuration the released Nvidia Volta GPUs are running second-gen high bandwidth memory (HBM2), but from the way SK Hynix and Samsung have been talking about GDDR6 we don’t expect the gaming-focused Volta cards to follow the same setup. Which is a good thing. HBM2 may deliver serious levels of memory bandwidth, but GDDR6 is still pretty rapid, and quicker than GDDR5X. It offers a data rate of 16Gbps, as opposed to the 14Gbps of GDDR5X, and more memory bandwidth too.
Hynix have stated that, with a 384-bit memory bus - the sort of design usually favoured by Nvidia’s high-end graphics cards - they are able to offer memory bandwidth of up to 768GB/s, which isn't far off the 900GB/s of the V100's HBM2 design. The Pascal-based Titan Xp uses GDDR5X and can only manage 548GB/s with its mighty 12GB setup.
GDDR6 is around 20% more expensive to produce than GDDR5, but it’s far cheaper than the prohibitive pricing of HBM2. That’s essentially what crippled AMD’s Vega on a price/performance balance, and Nvidia aren’t likely to make the same mistake.
Oh hai, welcome to price speculation corner. Obviously we don’t know how Nvidia are going to price their new cards, but we can still make some educated guesses based on Nvidia passim. And based on the ludicrous pricing of the Titan V.
In short, they’re likely to be more expensive than the old Pascal generation of cards. Well, compared with the original, non-mining-boom-time price anyways.
The prices of single cards are being adjusted upwards, and that would make the Volta GeForce cards the second generation in a row where Nvidia have pushed prices skywards. With the 10-series Pascal-based cards the GTX 1080 was released at an unprecedented level, especially when taking the reference/Founders Edition shenanigans into account.
If $699 is going to become the de facto standard for Nvidia’s high-end graphics cards, as shown with the GTX 1080 Ti cards, then these are worrying times. Unless you’re AMD and confident you can keep undercutting Nvidia’s GPUs.Gotta get them built in volume first guys…
But we’re unlikely to see any Founders Edition shenanigans in terms of pricing. They may still call their reference cards the Founders Edition, but I think the times of charging extra for the basic Nvidia blower design are thankfully over.
In a pure generation-on-generation test between the pro-level Pascal P100 and Volta V100 GPUs the newer Nvidia architecture is posting performance that is 132% faster than the last-gen chips. If we get anywhere near that level of performance boost from gaming applications Volta will be stunning. Realistically that's unlikely because these tests are based on Geekbench benchmarks, run in a Linux environment, using the specific CUDA API, and not Shadow of War at 4K.
The professional Nvidia Volta cards have been tuned especially for AI workloads, with a new Tensor core design at its heart. These new silicon slices won't have anything to do with gaming frame rates, at least for the moment, so if those are the only real improvements with Volta it's possible there won't be a huge performance uplift over Pascal at all.
Obviously we’re still expecting some increased gaming performance from Volta, and I’ve already spoken about the necessity for it to better deal with the low-level APIs of Vulkan and DirectX 12, but we should also expect some improved efficiency born both of the slightly shrunken GPU production process as well as the redesigned architecture itself.
Given that Volta is the namesake of the Italian gentleman credited with the invention of the battery you’d certainly hope for some improved efficiency. On the notebook side Pascal made great strides forward for the performance of mobile GPUs, and Volta ought to carry that progress even further.
When he announced the existence of the Nvidia Volta code-name at GTC 2013 Jen-Hsun Huang explained that "I love that name Volta because it would suggest that it will be even more energy efficient."
Fingers crossed it’s not just a suggestion…
The biggest indicator for the Volta tech being built into a gaming GPU is the introduction of real-time raytracing at the end of the year.
At GDC Nvidia and Microsoft announced a partnership which will bring real-time raytracing to games this year. The big M are bringing the industry standard via the DirectX Raytracing API and Nvidia are building that out with their own RTX hardware accelerator for their Volta "and future" graphics architectures.
RTX is built using a set of software and hardware algorithms specifically designed for the Nvidia Volta architecture, though they wouldn't tell us exactly what part of the GPU design leverages the raytracing feature.
“There’s definitely functionality in Volta that accelerates raytracing,” Nvidia's Tony Tomasi told us, “but I can’t comment on what it is.”
But Volta's Tensor cores have something to do with it, though seemingly only indirectly. The machine learning power of the Tensor cores allows Volta to do something called AI de-noising on an image which looks to be integral on high-end raytracing.
“It’s also called reconstruction,” says Tomasi. “What it does is it uses fewer rays, and very intelligent filters or processing, to essentially reconstruct the final picture or pixel. Tensor cores have been used to create, what we call, an AI de-noiser.
“Using artificial intelligence we can train a neural network to reconstruct an image using fewer samples, so in fact Tensor cores can be used to drive this AI denoiser which can produce a much higher quality image using fewer samples. And that’s one of the key components that helps to unleash the capability of real-time raytacing.”
The key thing is that Tomasi believes there will be games shipping this year which use their real-time raytracing effects, and that surely means there will be Volta-based gaming GPUs arriving before the end of the year.
Either that or a whole new generation of GPUs which still retain the same AI-focused Tensor cores of Volta, but without the name. Which would seem a little odd.
We've also just had released our first glimpses at possibly the first game to include Nvidia's Volta-accelerated RTX platform, Metro: Exodus. 4A Games GDC demo shows their latest title implementing the ambient occlusion and indirect lighting raytracing of RTX in a title that will launch this year.
This page is regularly updated with all the latest news, rumours and information about the Nvidia Volta GPU architecture. Bookmark away and let us know your thoughts about it in the comments below.
Click on the quick links to get back up to the section of your choice.