Implement forward- and reverse mode AD in the interpreter #2186

vox9 · 2024-10-13T21:18:15Z

Apologies for closing the old PR; I am quite new to this.

So, as promised, I cleaned up my code a bit. That being said, more work needs doing.

Here is a list of tasks that immediately spring to mind:

Clean up vjp2 and jvp2. They are ugly, and hugely inefficient. It seems implementing them is my kryptonite. I look forward to seeing them have the beauty they deserve ;)
Fix the horrible time complexity of deriveTape. I'm thinking this can be achieved by either (1) implementing Tape as a graph instead of a tree, or (2) assigning each TapeOp a unique ID using a counter in EvalM. deriveTape would have to initially run through the Tape, putting each unique TapeOp in a lookup table, and counting the references to it. The Tape can then be derived starting from the output. Each time a reference to a TapeOp is encountered, the sensitivity, which is propagated to it, is added into a pool kept in the lookup table, and the number of references is decreased by one. When the number of references reaches zero, the Tape is derived. Thus, each Tape is derived only once.
Make sure the error messages fit in.
Perhaps move more responsibility from Interpreter.hs to AD.hs. I feel like the former uses a lot of functions from the latter, making the code unnecessarily complex to read.
Perhaps use doOp for computations of ValuePrims. This would make the code for applying mathematical operations cleaner. Currently, it contains a lot of similar or duplicate code.

I have also littered the code with TODOs just ripe for the taking, and added a lot of explanatory text, as you mentioned that you would use this in your teaching. I have probably added too much, so feel free to delete it.

vox9 · 2024-10-13T21:23:19Z

It is probably worth mentioning that the old version of the code can be found here: interpreter-ad-old

athas · 2024-10-13T21:59:30Z

Will you fix the remaining style errors or shall I?

vox9 · 2024-10-13T22:12:34Z

Honestly, I'd love to, but I'm not entirely sure that I can, within a reasonable time frame. I'm not yet comfortable enough to feel that I understand the "Haskell" way of doing things, nor even the functional way - my ugly implementation of vjp2 and jvp2 are a great example of this. However, if you're up for having a chat about some of the details, I could probably shine it up pretty well. I'm also quite curious as to what you will be using the code for.

athas · 2024-10-13T22:15:01Z

You literally just have to run the ormolu formatter on the source code: https://github.com/diku-dk/futhark/blob/master/STYLE.md#ormolu

vox9 · 2024-10-13T22:16:02Z

Wow, doesn't get any easier than that ;) I'll give it a try right away

vox9 · 2024-10-13T22:23:48Z

Alright, I did it, but it changed a bunch of files which have nothing to do with this PR. I imagine you only want me to commit the changes to AD.hs, Values.hs, and Interpreter.hs?

vox9 · 2024-10-13T22:27:11Z

Oh, my bad, my tired eyes missed the If you find yourself working on such code, please reformat it while you are there. Committing now

This reverts commit 2e21d68.

athas · 2024-10-14T11:20:08Z

Thank you for the work. I have merged your implementation and created this issue to address the most significant remaining problem: #2187

I would certainly welcome further contributions, but the current implementation is operational.

clean up the interpreter implementation of automatic differentiation

b246be0

vox9 and others added 16 commits October 14, 2024 00:27

reformat

2e21d68

Revert "reformat"

70a0105

This reverts commit 2e21d68.

Style fixes.

64851d3

Note in CHANGELOG.

d9e9d6b

This dataset is too large to interpret.

f083cab

More natural argument order.

0485253

More style rewrites.

dee2adc

Style and Haddock comments.

2a71149

No oclgrind for these.

449aea9

Oops.

295c599

Consistent naming.

fc0766e

Style.

1cb5db7

More style improvements.

d30773b

Support negation.

1d0ba96

More style fixes.

3ef612a

Add missing --exclude.

8133201

athas merged commit 25c73ee into diku-dk:master Oct 14, 2024
31 checks passed

This was referenced Oct 14, 2024

Implement AD in interpreter #1556

Closed

Improve asymptotic efficiency of reverse-mode AD in interpreter #2187

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement forward- and reverse mode AD in the interpreter #2186

Implement forward- and reverse mode AD in the interpreter #2186

vox9 commented Oct 13, 2024

vox9 commented Oct 13, 2024

athas commented Oct 13, 2024

vox9 commented Oct 13, 2024

athas commented Oct 13, 2024

vox9 commented Oct 13, 2024

vox9 commented Oct 13, 2024

vox9 commented Oct 13, 2024

athas commented Oct 14, 2024

Implement forward- and reverse mode AD in the interpreter #2186

Implement forward- and reverse mode AD in the interpreter #2186

Conversation

vox9 commented Oct 13, 2024

vox9 commented Oct 13, 2024

athas commented Oct 13, 2024

vox9 commented Oct 13, 2024

athas commented Oct 13, 2024

vox9 commented Oct 13, 2024

vox9 commented Oct 13, 2024

vox9 commented Oct 13, 2024

athas commented Oct 14, 2024