NVIDIA’s next-gen RTX 40 series graphics cards are slated to launch in the third quarter of 2022. Based on the Ada Lovelace microarchitecture, the next-gen GeForce lineup is set to improve compute density and double down on ray-tracing performance. There will likely be some tweaks to the SM (Streaming Multiprocessor) as we’ve seen over the last several generations, and that’s about it. At least on a microarchitectural level.
According to Greymon55, the Lovelace-based RTX 4070, RTX 4080, RTX 4090, along with their brethren will essentially be a miniaturization of their RTX 30 series predecessors on TSMC’s N5 (5nm) node. NVIDIA also plans to increase the die size to nearly 1000mm^2, and stuff up to 18,000 FP32 ALUs or CUDA cores on the top-end AD102 die. Of course, ray-tracing performance will get special attention as more and more titles adopt the technology. You can expect either a doubling of RT cores or some sparse matrix-grade optimization to significantly enhance the RT capabilities.
|Process||TSMC 12nm||Sam 8nm LPP||TSMC 5nm||3nm?|
|TFLOPs||16.1||37.6||90 TFLOPs?||150 TFLOPs+|
|Memory||11GB GDDR6||24GB GDDR6X||24GB GDDR6X||32GB GDDR7?|
|Launch||Sep 2018||Sep 20||Aug-Sep 2022||2024|
Other than that, as already mentioned, the SM will almost certainly get a shakedown either in terms of ALU count and/or partitioning. Other than that, there’s not much else that should change. The Tensor cores should more or less stay unchanged, perhaps even getting a trim. This sheer focus on compute density and ray-tracing performance are what will likely push the power consumption of the RTX 4080/4090 over 500W despite leveraging TSMC’s cutting-edge 5nm node.