AMD’s Next-Gen FSR 3.0 Tech Could Feature Hardware Acceleration Through RDNA 3 ‘GFX11’ GPUs Brand New WMMA Instructions
One of the key advantages of FSR 1.0 and FSR 2.0 compared to NVIDIA’s DLSS has been that it does not rely on any hardware assistance such as dedicated Machine Learning (ML) blocks but that may soon be coming to an end. While AMD has done an absolutely great job with FSR, offering not only a visual quality on par with NVIDIA’s solution but also by making it more open-source, it looks like in the coming generation, AMD might be going one step ahead & using dedicated machine learning blocks to further boost the performance and visual quality that FSR has to offer. As spotted by @0x22h, the LLVM repository was recently updated with a new commit, introducing WMMA (Wave Matrix Multi-Accumulate) instructions on GFX11 hardware. The GFX11 codename is internally used for AMD’s RDNA 3 GPU family which will be featured in the next-generation Radeon RX 7000 and Radeon Pro graphics cards.
— Greymon55 (@greymon55) June 29, 2022 Similar to how NVIDIA uses matrix multiplactions for deep learning operations through its latest Tensor Core architecture, the AMD WMMA instructions will be fused on a hardware level to help achieve better Machine Learning or DNN operations. Now there aren’t a lot of details provided but this recent update in the LLVM could be a hint at a major graphics pipeline overhaul in the RDNA 3 GPUs. In a year’s worth of time, FSR has already seen 2x the adoption rate compared to its competitor, with over 113 games getting FiedlityFX Super Resolution support in just 1 year compared to 180+ titles in 3.4 years. Making the technology open-source for both PCs and consoles (Microsoft Xbox) will open up room for further adoption. If AMD was to rely on hardware acceleration for FSR tech moving forward, that would also suggest that NVIDIA was right in its decision to implement tensor cores on gaming hardware as early as its Turing generation of GPUs. With that said, NVIDIA will be implementing an even better and more optimized Tensor Core architecture within its next-gen GeForce RTX 40 series graphics cards for DLSS 3.0 and it will be an interesting comparison between it and FSR 3.0.