wgpu should cache pipelines #7716

jimblandy · 2025-05-22T22:03:25Z

wgpu is too slow when render or compute pipelines are created repeatedly.

At present, each call to wgpu_core::global::Global::device_create_compute_pipeline results in a separate call to wgpu_hal::Device::create_compute_pipeline. Render pipelines are similar. However, the WebGPU specification says (§2.2.4 User Agent State):

... It is expected that user agents will have compilation caches for the result of expensive compilation like GPUShaderModule, GPURenderPipeline and GPUComputePipeline.

This means that applications are within their rights to assume that calling createRenderPipeline in every animation frame, with the same parameters, should be cheap.

This Firefox profile shows that the Marching Cubes demo ends up pegging the Canvas Renderer thread running DXC. The profile has other problems, like taking 270ms for every requestAnimationFrame call, but the time in DXC is responsible for the pace of calls to requestAnimationFrame being only around 1fps.

The text was updated successfully, but these errors were encountered:

jimblandy · 2025-05-22T23:27:41Z

It seems like using wgpu_hal's existing pipeline cache feature would actually fix this for Vulkan. Even if you just create an empty pipeline cache, Vulkan says:

Pipeline cache objects allow the result of pipeline construction to be reused between pipelines and between runs of an application. Reuse between pipelines is achieved by passing the same pipeline cache object when creating multiple related pipelines. Reuse across runs of an application is achieved by retrieving pipeline cache contents in one run of an application, saving the contents, and using them to preinitialize a pipeline cache on a subsequent run.

The dx12 backend supplies only a dummy implementation. I don't know if Direct3D has anything that behaves the way Vulkan's pipeline cache objects do. cc @cwfitzgerald @magcius

cwfitzgerald · 2025-05-22T23:53:38Z

There is ID3D12PipelineLibrary but what I heard this was focusing on serializing between runs not deduplicating within a run. D3D12 does have implicit pipeline caches, though our biggest cost on d3d12 is compiling HLSL -> DXIL/DXBC which is a user space operation, so we'd need to cache those.

magcius · 2025-05-23T01:07:21Z

I thought you did HLSL -> DXIL at createRenderPipelines time and not createShaderModule time? I think some cache keyed off the shader module (and codegen-relevant state) would prevent a lot of duplicate work here. I had some early prototype experiments for this in Vulkan using dynamic state but I need to get back to it… Jasper

…

On Fri, May 23, 2025 at 8:54 AM Connor Fitzgerald ***@***.***> wrote: *cwfitzgerald* left a comment (gfx-rs/wgpu#7716) <#7716 (comment)> There is ID3D12PipelineLibrary <https://learn.microsoft.com/en-us/windows/win32/api/d3d12/nn-d3d12-id3d12pipelinelibrary> but what I heard this was focusing on serializing between runs not deduplicating within a run. D3D12 does have implicit pipeline caches, though our biggest cost on d3d12 is compiling HLSL -> DXIL/DXBC which is a user space operation, so we'd need to cache those. — Reply to this email directly, view it on GitHub <#7716 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAJ7OSMRCZD5RWWDOU3VV327ZPRTAVCNFSM6AAAAAB5XGLHNSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDSMBSHA3TONRRGQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

hakolao · 2025-05-23T08:24:18Z

I gotta ask, what is the use case for pipeline caching? I create mine once (or recreate on shader reload). Meaning, what is this caching used for if one shouldn't keep recreating pipelines each frame anyway?

Vulkan says: The big advantage of a pipeline cache is that the pipeline state can be saved to a file to be used between runs of an application

Is this something different? (Please explainer assume I know nothing)

magcius · 2025-05-23T09:14:26Z

It sounds like the metaballs application is somewhat poorly written in that it creates the same pipeline every frame, which causes us to do a lot more work than would be necessary if we had a cache. But there are some valid cases even well-written applications where a pipeline or shader cache would be beneficial. A shader cache would help cases where the shader is the same, but the fixed-function state is different, such as using the same shader with depth-write on/off states. In this case, we shouldn’t need to recompile the shader from source, and can reuse the same bytecode. A pipeline cache would help in cases where certain kinds of state are not in the native PSO. e.g. Metal, depth write is dynamic on the command buffer and isn’t part of the PSO; in this case, we should have a cache for only the compiled parts of the Metal PSO and reuse it, and store the Metal PSO and depth state in a separate structure, applying both at setRenderPipeline time. And the same is also true for Vulkan dynamic state. Jasper

…

On Fri, May 23, 2025 at 5:24 PM Okko Hakola ***@***.***> wrote: *hakolao* left a comment (gfx-rs/wgpu#7716) <#7716 (comment)> I gotta ask, what is the use case for pipeline caching? I create mine once (or recreate on shader reload). Meaning, what is this caching used for if one shouldn't keep recreating pipelines each frame anyway? Vulkan says: *The big advantage of a pipeline cache is that the pipeline state can be saved to a file to be used between runs of an application <https://docs.vulkan.org/guide/latest/pipeline_cache.html>* Is this something different? — Reply to this email directly, view it on GitHub <#7716 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAJ7OWXS6Q5J22S42NFY43273LMRAVCNFSM6AAAAAB5XGLHNSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDSMBTGY3TKOJXG4> . You are receiving this because you were mentioned.Message ID: ***@***.***>

teoxoy · 2025-05-23T12:55:11Z

One of the unity demos (https://vfx-demo.cds.unity3d.com) also stutters occasionally due to DxcCompiler::Compile being too slow (profile: https://share.firefox.dev/4kx1wq2). Looking at its API calls (trace.zip):

13 calls to CreateComputePipeline
163 calls to CreateRenderPipeline
39 calls to CreateShaderModule
- entry points in those modules:
  - 13 @compute
  - 12 @fragment
  - 14 @vertex

though our biggest cost on d3d12 is compiling HLSL -> DXIL/DXBC which is a user space operation, so we'd need to cache those.

I think some cache keyed off the shader module (and codegen-relevant state) would prevent a lot of duplicate work here.

I also think we can try caching just the bytecode (to avoid calling into FXC/DXC), that will probably get us far enough. Caching pipelines is more involved since we'd also need to cache everything else that makes up a pipeline.

teoxoy · 2025-05-23T12:57:32Z

Regarding the Marching Cubes demo, this is what it's calling every frame: https://github.com/tcoppex/webgpu-marchingcubes/blob/e464ccf192dcd9ded794dae5593a0e4cbedf487a/js/utils.js#L296

jimblandy · 2025-05-27T16:57:41Z

P1 for Firefox because it blocks Marching Cubes.

github-project-automation bot added this to WebGPU for Firefox May 22, 2025

github-project-automation bot moved this to Todo in WebGPU for Firefox May 22, 2025

teoxoy linked a pull request May 26, 2025 that will close this issue

[d3d12] add a shader cache to avoid calling into DXC/FXC #7729

Open

jimblandy assigned teoxoy May 27, 2025

cwfitzgerald added type: enhancement New feature or request area: performance How fast things go backend: dx12 Issues with DX12 or DXGI backend: vulkan Issues with Vulkan labels May 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

wgpu should cache pipelines #7716

wgpu should cache pipelines #7716

jimblandy commented May 22, 2025

jimblandy commented May 22, 2025

Uh oh!

cwfitzgerald commented May 22, 2025

Uh oh!

magcius commented May 23, 2025 via email

Uh oh!

hakolao commented May 23, 2025 •

edited

Loading

Uh oh!

magcius commented May 23, 2025 via email

Uh oh!

teoxoy commented May 23, 2025

Uh oh!

teoxoy commented May 23, 2025

Uh oh!

jimblandy commented May 27, 2025

Uh oh!

wgpu should cache pipelines #7716

wgpu should cache pipelines #7716

Comments

jimblandy commented May 22, 2025

jimblandy commented May 22, 2025

Uh oh!

cwfitzgerald commented May 22, 2025

Uh oh!

magcius commented May 23, 2025 via email

Uh oh!

hakolao commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

magcius commented May 23, 2025 via email

Uh oh!

teoxoy commented May 23, 2025

Uh oh!

teoxoy commented May 23, 2025

Uh oh!

jimblandy commented May 27, 2025

Uh oh!

hakolao commented May 23, 2025 •

edited

Loading