Performance Tuning Guide is very out of date #2861

msaroufim · 2024-05-09T16:57:35Z

🚀 Descirbe the improvement or the new tutorial

The first thing you see when you Google PyTorch performance is this. The recipe is well written but it's very much out of data today
https://pytorch.org/tutorials/recipes/recipes/tuning_guide.html

Some concrete things we should fix

For fusions we should talk about torch.compile instead of jit.script
We should mention overhead reduction with cudagraphs
We should talk about the *-fast series as places people can learn more
For CPU specific optimization the most important one is launcher core pinning so we should either make that a default or explain the point more
Instead of the CPU section we can instead go more into the inductor CPU backend
AMP section is fine but maybe expand to quantization
DDP section needs to be moved somewhere else with some FSDP performance guide
GPU sync section is good
Mention tensor cores and how to enable them and why they're not enabled by default

cc @sekyondaMeta @svekars @kit1980 @drisspg who first made me aware of this with an internal note that was important enough to make public

Existing tutorials on this topic

No response

Additional context

No response

svekars · 2024-05-15T16:40:26Z

Related: #2695

orion160 · 2024-06-04T02:39:25Z

It's pretty interesting. I'll go over a PR on this days updating it!

msaroufim · 2024-06-04T02:53:13Z

feel free to ping me here or on https://discord.gg/FBMQJQJn whenever you're ready for a review

orion160 · 2024-06-04T17:48:16Z

/assigntome

orion160 · 2024-06-05T20:45:43Z

@msaroufim What are the *-fast series?

orion160 · 2024-06-06T18:08:45Z

[X] Ops fusion with torch.compile -> done by @desertfire
[] CUDA graphs
~~[] *-fast -> ???~~
[] core pinning -> Already in the doc, put a minor section explaining it
~~[] inductor CPU backend -> ???~~
[] Tensor core explanation

msaroufim · 2024-06-07T23:12:14Z

Hi @orion160 what I was referring to was https://pytorch.org/blog/accelerating-generative-ai-2/ and https://pytorch.org/blog/accelerating-generative-ai

orion160 · 2024-06-07T23:42:23Z

Mmmmm I read it, I could mention it but it feels a bit out of place... The tuning perf leverages high level tweaks and these blog entries leverages optimizations in an adhoc model centric way

orion160 · 2024-06-07T23:43:24Z

It could be like an epilogue mentioning these case studies.

@msaroufim Do you think with that changes the issue can be complete?

desertfire mentioned this issue May 29, 2024

Update the fusion section of tuning_guide.py #2889

Merged

4 tasks

svekars closed this as completed in #2889 May 30, 2024

msaroufim reopened this May 31, 2024

sekyondaMeta added medium docathon-h1-2024 labels Jun 4, 2024

github-actions bot assigned orion160 Jun 4, 2024

orion160 mentioned this issue Jun 7, 2024

Added CUDA graph, Tensor Core and Core pinning explaination #2912

Merged

svekars closed this as completed in #2912 Jun 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance Tuning Guide is very out of date #2861

Performance Tuning Guide is very out of date #2861

msaroufim commented May 9, 2024 •

edited by pytorch-bot bot

Loading

svekars commented May 15, 2024

orion160 commented Jun 4, 2024

msaroufim commented Jun 4, 2024

orion160 commented Jun 4, 2024

orion160 commented Jun 5, 2024

orion160 commented Jun 6, 2024 •

edited

Loading

msaroufim commented Jun 7, 2024

orion160 commented Jun 7, 2024

orion160 commented Jun 7, 2024

Performance Tuning Guide is very out of date #2861

Performance Tuning Guide is very out of date #2861

Comments

msaroufim commented May 9, 2024 • edited by pytorch-bot bot Loading

🚀 Descirbe the improvement or the new tutorial

Existing tutorials on this topic

Additional context

svekars commented May 15, 2024

orion160 commented Jun 4, 2024

msaroufim commented Jun 4, 2024

orion160 commented Jun 4, 2024

orion160 commented Jun 5, 2024

orion160 commented Jun 6, 2024 • edited Loading

msaroufim commented Jun 7, 2024

orion160 commented Jun 7, 2024

orion160 commented Jun 7, 2024

msaroufim commented May 9, 2024 •

edited by pytorch-bot bot

Loading

orion160 commented Jun 6, 2024 •

edited

Loading