[docs] Regional compilation docs #11556

sayakpaul · 2025-05-15T04:01:20Z

What does this PR do?

sayakpaul · 2025-05-15T04:02:06Z

docs/source/en/optimization/torch2.0.md

+the repeated blocks of the provided `nn.Module`.
+
+```py
+# Make sure you're on the latest `accelerate`: `pip install -U accelerate`.


Merge after accelerate new version is released this week.

HuggingFaceDocBuilderDev · 2025-05-15T04:08:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

IlyasMoutawwakil · 2025-05-15T11:23:59Z

docs/source/en/optimization/torch2.0.md

+Enabling regional compilation might require simple yet intrusive changes to the
+modeling code. However, 🤗 Accelerate provides a utility [`compile_regions()`](https://huggingface.co/docs/accelerate/main/en/usage_guides/compilation#how-to-use-regional-compilation) which automatically _only_ compiles
+the repeated blocks of the provided `nn.Module`.


no we actually compile the rest of the model as well 😅 I found out in my post that some people thought only the encoder/decoder block will be compiled in regional, which is not true.
I changed the docs to be more explicit huggingface/accelerate#3572 (comment)

👁️ But https://docs.pytorch.org/tutorials/recipes/regional_compilation.html suggests a completely different recipe no? No full compilation but only regional and I always thought that is what should be done.

What am I missing?

regional compilation is simply: cut into regions and then compile those regions. I didn't compare the two approaches but I believe in the context of the pytorch tutorial they were simply trying to reduce cold start, not trying to keep inference optimized as well (they didn't benchamrk inference).

So

inference latency of compiling full model >= inference latency of regionally compiling repeated blocks + compiling additional blocks in a model

cold start time of compiling full model >> cold start time of regionally compiling repeated blocks + compiling additional blocks in a model

Is my understanding right or is it still fragmented?

Do you think providing an option to NOT compile the rest of the blocks could still make sense?

yes that is how it works !

Do you think providing an option to NOT compile the rest of the blocks could still make sense?

doesn't make sense for me personally, since you will miss on the tuning of the task-specific head. Do you have any specific cases where we don't want to compile the rest of the model ?

SunMarc

Thanks !

docs/source/en/optimization/torch2.0.md

Co-authored-by: Ilyas Moutawwakil <[email protected]>

sayakpaul added 2 commits May 15, 2025 09:30

add regional compilation docs.

b87a962

minor.

581cba4

sayakpaul requested review from SunMarc and IlyasMoutawwakil May 15, 2025 04:01

sayakpaul mentioned this pull request May 15, 2025

[gguf] Refactor __torch_function__ to avoid unnecessary computation #11551

Merged

6 tasks

sayakpaul commented May 15, 2025

View reviewed changes

Merge branch 'main' into regional-compilation-docs

ea889c1

IlyasMoutawwakil reviewed May 15, 2025

View reviewed changes

SunMarc approved these changes May 15, 2025

View reviewed changes

sayakpaul added 2 commits May 15, 2025 18:42

Merge branch 'main' into regional-compilation-docs

c7eb7fe

reviwer feedback.

8881dc6

sayakpaul requested a review from IlyasMoutawwakil May 15, 2025 13:14

IlyasMoutawwakil reviewed May 15, 2025

View reviewed changes

docs/source/en/optimization/torch2.0.md Outdated Show resolved Hide resolved

Update docs/source/en/optimization/torch2.0.md

bacd403

Co-authored-by: Ilyas Moutawwakil <[email protected]>

sayakpaul merged commit 9836f0e into main May 15, 2025
5 checks passed

sayakpaul deleted the regional-compilation-docs branch May 15, 2025 13:41

DN6 added the roadmap Add to current release roadmap label Jun 5, 2025

github-project-automation bot added this to Diffusers Roadmap 0.34 Jun 5, 2025

github-project-automation bot moved this to In Progress in Diffusers Roadmap 0.34 Jun 5, 2025

DN6 moved this from In Progress to Done in Diffusers Roadmap 0.34 Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[docs] Regional compilation docs #11556

[docs] Regional compilation docs #11556

Uh oh!

sayakpaul commented May 15, 2025

Uh oh!

sayakpaul May 15, 2025

Uh oh!

SunMarc May 15, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 15, 2025

Uh oh!

IlyasMoutawwakil May 15, 2025

Uh oh!

sayakpaul May 15, 2025

Uh oh!

IlyasMoutawwakil May 15, 2025 •

edited

Loading

Uh oh!

sayakpaul May 15, 2025 •

edited

Loading

Uh oh!

IlyasMoutawwakil May 15, 2025 •

edited

Loading

Uh oh!

SunMarc left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[docs] Regional compilation docs #11556

[docs] Regional compilation docs #11556

Uh oh!

Conversation

sayakpaul commented May 15, 2025

What does this PR do?

Uh oh!

sayakpaul May 15, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc May 15, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 15, 2025

Uh oh!

IlyasMoutawwakil May 15, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul May 15, 2025

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil May 15, 2025 •

edited

Loading

sayakpaul May 15, 2025 •

edited

Loading

IlyasMoutawwakil May 15, 2025 •

edited

Loading