Autodiff batching2 #139351

ZuseZ4 · 2025-04-04T05:16:10Z

~~I will rebase it once my first PR landed.~~ done.
This autodiff batch mode is more similar to scalar autodiff, since it still only takes one shadow argument.
However, that argument is supposed to be width times larger.

r? @oli-obk

Tracking:

Tracking Issue for autodiff #124509

ZuseZ4 · 2025-04-09T01:10:42Z

@oli-obk I'm almost done with features, but there are two paths forward here, so I'd appreciate some help with the design.

The *v variants (dupv, dualvonly) allow better vectorization, by accepting larger shadow arguments.
Each shadow of a slice &[type] is supposed to be width * num_elements_of_primal_slice * byte_sizeof(type) bytes large.
We currently don't support generics but we should keep them in mind, and we already support aliases.

If you look at rustc_codegen_llvm you'll see that I hardcoded the byte_size_of(type) to 4, since my tests use floats.
An upstream version of course needs to figure that out more reliably.

In my typetree work (the only part I have not upstreamed from my fork) I have a little bit of logic here to handle them, but I haven't used it yet to figure out the byte size, so I'm not sure if that's legal: https://github.com/EnzymeAD/rust/blob/322f2226c1f672c9b5e934b15d255ae0d66bd0e2/compiler/rustc_middle/src/ty/typetree.rs#L196

If you say it's too hard for now, I could merge a workaround which analyzes the types in the ast frontend, which wouldn't support aliases or generics, but at least could handle &[f32] vs &[f64]. It's getting late for me so I might miss something obvious, but I feel like we should be able to figure out the size in rustc_monomorphize to handle more than that.

Also, there are reasons due to which a user might specify a larger stride than what I'd compute by default,
so I'll allow users under all combinations to provide an extra integer after *v arguments, which would replace whatever we computed here. But this way they could easily index out of bounds, so I'll mark generated functions in that case as unsafe. Once we figured out the part above, I'll add the code and tests for this to clarify it.

oli-obk

Generic code will require a lot of extra work anyway (we'll need to find the right trait bounds and such so that a generic function will never fail to monomorphize)

compiler/rustc_builtin_macros/src/autodiff.rs

compiler/rustc_monomorphize/src/partitioning/autodiff.rs

compiler/rustc_ast/src/expand/autodiff_attrs.rs

ZuseZ4 · 2025-04-09T07:29:57Z

Generic code will require a lot of extra work anyway (we'll need to find the right trait bounds and such so that a generic function will never fail to monomorphize)

If my old trick still work (and I don't see why not) it's trivial to add, as long as we don't move to the rustc_intrinsic which we discussed.

A dummy function calls the source function in a black-boxed call. In the past it used to do this with the same generics, in the latest rework from proc-macros to builtin_macros however I didn't implement it anymore. We could re-add it though.
So if you call df with T=f32, then df will (in it's dummy body) call f with T=f32.
Now on llvm-ir level I delete the function body, but I first safe which function we're calling.
So even if a source function is instantiated with different types, I always remember which instantiation of f belongs to which instantiation of df. Now, if we remove the body (which would help with the other things discussed) then of course we'd start from zero.

compiler/rustc_ast/src/expand/autodiff_attrs.rs

oli-obk · 2025-04-17T09:13:02Z

@bors r+ rollup

bors · 2025-04-17T09:13:07Z

📌 Commit d7c0c32 has been approved by oli-obk

It is now in the queue for this repository.

…iaskrgr Rollup of 8 pull requests Successful merges: - rust-lang#139351 (Autodiff batching2) - rust-lang#139483 (f*::NAN: guarantee that this is a quiet NaN) - rust-lang#139498 (Ignore zero-sized types in wasm future-compat warning) - rust-lang#139967 (Introduce and use specialized `//@ ignore-auxiliary` for test support files instead of using `//@ ignore-test`) - rust-lang#139969 (update libc) - rust-lang#139971 (Make C string merging test work on MIPS) - rust-lang#139974 (Change `InterpCx::instantiate*` function visibility to pub) - rust-lang#139977 (Fix drop handling in `hint::select_unpredictable`) r? `@ghost` `@rustbot` modify labels: rollup

Rollup merge of rust-lang#139351 - EnzymeAD:autodiff-batching2, r=oli-obk Autodiff batching2 ~I will rebase it once my first PR landed.~ done. This autodiff batch mode is more similar to scalar autodiff, since it still only takes one shadow argument. However, that argument is supposed to be `width` times larger. r? `@oli-obk` Tracking: - rust-lang#124509

rustbot assigned oli-obk Apr 4, 2025

rustbot added A-attributes Area: Attributes (`#[…]`, `#![…]`) F-autodiff `#![feature(autodiff)]` S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Apr 4, 2025

This comment has been minimized.

Sign in to view

ZuseZ4 force-pushed the autodiff-batching2 branch from 4b84744 to 2935695 Compare April 4, 2025 23:01

This comment has been minimized.

Sign in to view

ZuseZ4 force-pushed the autodiff-batching2 branch from 2935695 to 88389b5 Compare April 5, 2025 07:10

This comment has been minimized.

Sign in to view

ZuseZ4 force-pushed the autodiff-batching2 branch from 88389b5 to ce3ab30 Compare April 9, 2025 01:00

ZuseZ4 marked this pull request as ready for review April 9, 2025 01:03

oli-obk reviewed Apr 9, 2025

View reviewed changes

compiler/rustc_builtin_macros/src/autodiff.rs Show resolved Hide resolved

compiler/rustc_monomorphize/src/partitioning/autodiff.rs Outdated Show resolved Hide resolved

oli-obk reviewed Apr 9, 2025

View reviewed changes

compiler/rustc_ast/src/expand/autodiff_attrs.rs Outdated Show resolved Hide resolved

oli-obk reviewed Apr 9, 2025

View reviewed changes

compiler/rustc_ast/src/expand/autodiff_attrs.rs Show resolved Hide resolved

ZuseZ4 force-pushed the autodiff-batching2 branch from ce3ab30 to ae6247c Compare April 10, 2025 09:22

This comment has been minimized.

Sign in to view

ZuseZ4 added 2 commits April 16, 2025 17:13

working dupv and dupvonly for fwd mode

a68ae0c

passing test for dualv

d7c0c32

ZuseZ4 force-pushed the autodiff-batching2 branch from ae6247c to d7c0c32 Compare April 16, 2025 21:37

ZuseZ4 requested a review from oli-obk April 16, 2025 22:22

oli-obk approved these changes Apr 17, 2025

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 17, 2025

matthiaskrgr mentioned this pull request Apr 17, 2025

Rollup of 8 pull requests #139992

Merged

bors merged commit 87a1635 into rust-lang:master Apr 18, 2025
6 checks passed

rustbot added this to the 1.88.0 milestone Apr 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autodiff batching2 #139351

Autodiff batching2 #139351

ZuseZ4 commented Apr 4, 2025 •

edited

Loading

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

ZuseZ4 commented Apr 9, 2025 •

edited

Loading

oli-obk left a comment

ZuseZ4 commented Apr 9, 2025 •

edited

Loading

This comment has been minimized.

oli-obk commented Apr 17, 2025

bors commented Apr 17, 2025

Autodiff batching2 #139351

Autodiff batching2 #139351

Conversation

ZuseZ4 commented Apr 4, 2025 • edited Loading

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

ZuseZ4 commented Apr 9, 2025 • edited Loading

oli-obk left a comment

Choose a reason for hiding this comment

ZuseZ4 commented Apr 9, 2025 • edited Loading

This comment has been minimized.

oli-obk commented Apr 17, 2025

bors commented Apr 17, 2025

ZuseZ4 commented Apr 4, 2025 •

edited

Loading

ZuseZ4 commented Apr 9, 2025 •

edited

Loading

ZuseZ4 commented Apr 9, 2025 •

edited

Loading