Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AMDGPU: Refine gfx950 xdl-write-vgpr hazard cases #117285

Merged
merged 1 commit into from
Nov 25, 2024

Commits on Nov 25, 2024

  1. AMDGPU: Refine gfx950 xdl-write-vgpr hazard cases

    The 2-pass XDL write VGPR, read by non-XDL SGEMM/DGEMM case
    was 1 wait state overly conservative. Previously, for gfx940,
    the XDL/non-XDL cases happened to have the same number of cycles
    in all cases. Now the XDL consumer case has an additional state for
    2 pass sources.
    arsenm committed Nov 25, 2024
    Configuration menu
    Copy the full SHA
    edb4823 View commit details
    Browse the repository at this point in the history