Refactor dx shaders #1621

ns6089 · 2023-09-11T13:18:51Z

Description

Based on #1602

Shader refactoring.
Mainly deduplication, but also makes adding yuv444 and type2 chroma subsampling (also called topleft, recommended for bt.2020, looks smoother since it uses more pixels for averaging) pretty trivial.

Screenshot

Issues Fixed or Closed

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Dependency update (updates to dependencies)
Documentation update (changes to documentation)
Repository update (changes to repository files, e.g. .github/...)

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have added or updated the in code docstring/documentation-blocks for new or existing methods/components

Branch Updates

LizardByte requires that branches be up-to-date before merging. This means that after any PR is merged, this branch
must be updated before it can be merged. You must also
Allow edits from maintainers.

I want maintainers to keep my branch updated

ns6089 · 2023-09-12T07:48:20Z

Should be ready (after #1602).

The shader file naming scheme is done with future work in mind.
I have these files locally:

cgutman · 2023-09-17T21:21:11Z

Looks like there are some conflicts. Can you rebase?

ns6089 · 2023-09-18T09:27:14Z

Looks like there are some conflicts. Can you rebase?

Not real conflicts, just the commits from the rotation PR, done.

cgutman · 2023-09-19T04:53:58Z

src_assets/windows/assets/shaders/directx/include/convert_yuv420_packed_uv_ps_base.hlsl

+    float3 rgb_top_right = image.Sample(def_sampler, input.tex_right_left_top.yz).rgb;
+    float3 rgb_bottom_left = image.Sample(def_sampler, input.tex_right_left_bottom.xz).rgb;
+    float3 rgb_bottom_right = image.Sample(def_sampler, input.tex_right_left_bottom.yz).rgb;
+    float3 rgb = CONVERT_FUNCTION((rgb_top_left + rgb_top_right + rgb_bottom_left + rgb_bottom_right) * 0.25);


Maybe I'm missing something, but based on this illustration, it seems like we're doing top and center instead of left and top-left.

Alright, buckle in.

Let's concentrate on the "Type 0", this the only one we currently use and what ffmpeg calls AVCHROMA_LOC_LEFT

I've numbered the pixels in the source image, the red square is the area which target chroma subsample occupies.

We calculate that chroma subsample using box filter, or in other words just average all source pixels that are "covered" by the target pixel, if coverage is partial - the corresponding pixel weight is reduced.

Going back to our subsample, we need pixels 2,5 with 1 weight, and pixel 1,3,4,6 with 0.5 weights.

This can be done with 6 texture fetches, but we're too smart (for our good) and do it in 2.
If we ask the linear texture sampler for a point between pixels 2,3,5,6, it will give us the average of these 4 pixels with even weights. Same thing can be done with 1,2,4,5.
And now if we sum these two pixels - we get the weights we want, because pixels 2,5 are used twice, just need to divide it by 2.

So our vertex shader generates these in-between texture points. Type 2 or AVCHROMA_LOC_TOPLEFT is done in a similar manner, 9 texture fetches are reduced to 4 optimized ones.

This is also why literally any scaling breaks our chroma - that requires box filter with different radius and the whole math needs to be redone,

And the last part, which we currently don't do. That linear texture sampling and shader averaging must be done in linear light. Since we currently do it in srgb gamma (unless hdr), we end up with less saturated colors than they should be. But that's harder to notice than outright blocky artifacts we had until recently.

Ok, thanks for the detailed explanation.

We don't use SV_Position in our vertex shaders.

ns6089 force-pushed the shader_refactor branch from 835f0ea to 7e385e5 Compare September 12, 2023 07:43

ns6089 force-pushed the shader_refactor branch from 16a3caf to c329d72 Compare September 18, 2023 09:23

ReenigneArcher force-pushed the shader_refactor branch from c329d72 to ae50a67 Compare September 18, 2023 14:30

LizardByte-bot added the autoupdate label Sep 18, 2023

ReenigneArcher force-pushed the shader_refactor branch from ae50a67 to a452b49 Compare September 18, 2023 22:16

cgutman reviewed Sep 19, 2023

View reviewed changes

ns6089 mentioned this pull request Sep 22, 2023

Various shader-related improvements #1665

Closed

11 tasks

cgutman approved these changes Sep 22, 2023

View reviewed changes

ns6089 added 3 commits October 5, 2023 22:15

Support #include in dx shader compiler

2514270

Refactor shaders

0f24509

Remove unused shader input layout

c01df85

We don't use SV_Position in our vertex shaders.

cgutman force-pushed the shader_refactor branch from a452b49 to c01df85 Compare October 6, 2023 03:15

cgutman merged commit 974c4bd into LizardByte:nightly Oct 6, 2023
43 checks passed

ns6089 mentioned this pull request Oct 29, 2023

Nearest Neighbour filtering #1486

Closed

11 tasks

ns6089 mentioned this pull request Nov 17, 2023

Fix incorrect portrait mode rotation center on Windows #1851

Merged

11 tasks

ns6089 mentioned this pull request Aug 31, 2024

fix(win/video): don't offload chroma subsampling math to texture sampler when downscaling #3014

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor dx shaders #1621

Refactor dx shaders #1621

ns6089 commented Sep 11, 2023

ns6089 commented Sep 12, 2023

cgutman commented Sep 17, 2023

ns6089 commented Sep 18, 2023

cgutman Sep 19, 2023

ns6089 Sep 19, 2023

ns6089 Sep 19, 2023

ns6089 Sep 19, 2023

cgutman Sep 22, 2023

Refactor dx shaders #1621

Refactor dx shaders #1621

Conversation

ns6089 commented Sep 11, 2023

Description

Screenshot

Issues Fixed or Closed

Type of Change

Checklist

Branch Updates

ns6089 commented Sep 12, 2023

cgutman commented Sep 17, 2023

ns6089 commented Sep 18, 2023

cgutman Sep 19, 2023

Choose a reason for hiding this comment

ns6089 Sep 19, 2023

Choose a reason for hiding this comment

ns6089 Sep 19, 2023

Choose a reason for hiding this comment

ns6089 Sep 19, 2023

Choose a reason for hiding this comment

cgutman Sep 22, 2023

Choose a reason for hiding this comment