bring back torch.autograd.Function for float8 matmul #344

vkuzo · 2024-07-26T15:48:05Z

Stack from ghstack (oldest at bottom):

Summary:

This is a redo of
#316

With upcoming support of scaling granularities other than tensorwise,
we need a good way to control which gemm kernel to call and how to scale
the input tensors in fwd and bwd. A torch.autograd.Function override
is the cleanest way to do that, and in 2024 this now works with
torch.compile.

Test Plan:

./test/test_everything.sh

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D60291446

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

vkuzo · 2024-07-26T15:49:10Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

vkuzo · 2024-07-26T15:52:43Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

vkuzo · 2024-07-26T20:43:08Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-07-26T23:09:30Z

This pull request has been merged in 345b3a5.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 26, 2024

This was referenced Jul 26, 2024

[1/x] clean up casting functions #345

Closed

[2/x] clean up casting functions: delayed scaling #346

Closed

drisspg approved these changes Jul 26, 2024

View reviewed changes

facebook-github-bot closed this in 345b3a5 Jul 26, 2024

facebook-github-bot added the Merged label Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bring back torch.autograd.Function for float8 matmul #344

bring back torch.autograd.Function for float8 matmul #344

vkuzo commented Jul 26, 2024 •

edited

Loading

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024

facebook-github-bot commented Jul 26, 2024

bring back torch.autograd.Function for float8 matmul #344

bring back torch.autograd.Function for float8 matmul #344

Conversation

vkuzo commented Jul 26, 2024 • edited Loading

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024

facebook-github-bot commented Jul 26, 2024

vkuzo commented Jul 26, 2024 •

edited

Loading