Skip to content Skip to footer
AMD’s RDNA 2: Shooting for the Top by rbanffy

AMD’s RDNA 2: Shooting for the Top by rbanffy

11 Comments

  • Post Author
    newZWhoDis
    Posted February 20, 2023 at 2:19 pm

    RDNA2’s anemic RT performance was such a letdown. RDNA3 wasn’t much better, hopefully AMD gets it together with RDNA4

  • Post Author
    unpopularopp
    Posted February 20, 2023 at 2:20 pm

    Can anyone tell me why review this in 2023 when we already have RDNA3 GPUs on the market? (RX 7900 XT/XTX)

  • Post Author
    villgax
    Posted February 20, 2023 at 2:22 pm

    AMD really dropped the ball on GPGPU focus, neither for Blender nor any ML framework…..

  • Post Author
    oldycombinator
    Posted February 20, 2023 at 2:40 pm

    [flagged]

  • Post Author
    avtolik
    Posted February 20, 2023 at 3:01 pm

    Unfortunately the gaming hardware prices are also shooting for the top
    https://www.pcgamer.com/ultra-enthusiast-hardware-is-strangl…

  • Post Author
    fulafel
    Posted February 20, 2023 at 3:10 pm

    > Each WGP, or workgroup processor, features four SIMDs. […] Each SIMD has 32-wide execution units for the most common operations, a 128 KB vector register file, and can track up to 16 wavefronts.

    What do they mean by a "SIMD", is it a independent core with its own instruction pointer(s) etc?

    Nice to read a rare explanation mof what the actual HW functionality that can help with ray tracing actually is:

    > At the CU level […] the texture unit can now perform ray intersection tests, at a rate of four box tests per cycle or one triangle test per cycle. Box tests take place at upper levels of the BVH, while triangle tests happen at the last level. A BVH, or bounded vertex hierarchy, speeds up raytracing with a divide-and-conquer methodology

    I wonder what else BVH speedups could be used for in eg GPGPU applications. DB queries that have 3 dimensions eg timespan, attribute set, value span?

  • Post Author
    raphlinus
    Posted February 20, 2023 at 3:27 pm

    These chipsandcheese articles are fantastic, they really go into depth.

    Last week I had occasion to dig deeply into the microarchitecture of the Apple M1 and M2 GPUs (which are very similar to the A series on mobile). I found the metal-benchmarks[1] repo to be very useful, and, relevant to this article, it contains a comparison to the RDNA generations. From that it appears that GPU microarchitectures are starting to converge, even if terminology is not. The standard configuration these days seems to be blocks of 128 threads (WGP's in AMD terminology, "cores" in Apple), organized into 4 SIMDs. Resources like workgroup memory (LDS in AMD lingo) are shared among each of these 128-blocks.

    The linked repo also has links to other resources, including the incredible work being done by the Asahi Linux people to reverse engineer M1 and deliver working open source GPU drivers.

    [1]: https://github.com/philipturner/metal-benchmarks

  • Post Author
    Kukumber
    Posted February 20, 2023 at 3:35 pm

    RDNA2 is a beast, low TDP and great perf, i bought the RX 6600 XT (from ebay ~250) for my VM PCI passthrough needs (gaming) and it never go over 50°C, it's quite impressive

    It's even better than the newest Intel GPUs (except maybe for raytracing, but that's a thing for 'tomorrow')

  • Post Author
    abecedarius
    Posted February 20, 2023 at 5:20 pm

    For anyone else to whom GCN and RDNA are mysterious TLAs, this post is about GPU architectures.

    > To simplify, RDNA and GCN are the codenames for both ISA and Microarchitecture for AMD GPUs.
    https://www.gpumag.com/rdna-vs-navi-vs-gcn/

    And WGP is "workgroup processor".
    https://en.wikipedia.org/wiki/RDNA_(microarchitecture)

    With that much background it's no longer alphabet soup and I can judge whether to read it.

  • Post Author
    sylware
    Posted February 20, 2023 at 5:33 pm

    god this site is good.

  • Post Author
    bitL
    Posted February 20, 2023 at 5:42 pm

    It's weird how AMD managed to make RDNA 3 such a letdown given RDNA 2 fought with 3090/3090 Ti neck to neck in raster performance whereas 7900XTX is at the 4080 level. They use a much better node compared to RDNA 2 as well but are stuck at similar frequencies and gained only around 30% on average while 4090 doubled 3090 performance.

Leave a comment

0/100

the Kick-ass Multipurpose WordPress Theme

© 2023 Kicker. All Rights Reserved.

Sign Up to Our Newsletter

Be the first to know the latest updates

Whoops, you're not connected to Mailchimp. You need to enter a valid Mailchimp API key.