Model offload #3889

pcuenca · 2023-06-28T10:43:56Z

< 8GB with model offload (without refiner)

HuggingFaceDocBuilderDev · 2023-06-28T10:49:54Z

The documentation is not available anymore as the PR was closed or merged.

pcuenca · 2023-06-28T11:23:47Z

When running the refiner we have a peak of ~9 GB because the call sequence is:

text_encoder_2 -> vae.encode -> [unet loop] -> vae.decode

However the offload chain goes text_encoder_2 -> unet -> vae. Therefore, the vae.encode call does not automatically offload the text encoder. If it did, we'd have < 8GB memory consumption.

Not sure if it makes sense to hardcode a .to("cpu") in this case. How do you feel about it @patrickvonplaten @sayakpaul @williamberman

Saves some GPU RAM in img2img / refiner tasks so it remains below 8 GB.

HuggingFaceDocBuilderDev · 2023-06-30T10:43:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

* Add new text encoder * add transformers depth * More * Correct conversion script * Fix more * Fix more * Correct more * correct text encoder * Finish all * proof that in works in run local xl * clean up * Get refiner to work * Add red castle * Fix batch size * Improve pipelines more * Finish text2image tests * Add img2img test * Fix more * fix import * Fix embeddings for classic models (#3888) Fix embeddings for classic SD models. * Allow multiple prompts to be passed to the refiner (#3895) * finish more * Apply suggestions from code review * add watermarker * Model offload (#3889) * Model offload. * Model offload for refiner / img2img * Hardcode encoder offload on img2img vae encode Saves some GPU RAM in img2img / refiner tasks so it remains below 8 GB. --------- Co-authored-by: Patrick von Platen <[email protected]> * correct * fix * clean print * Update install warning for `invisible-watermark` * add: missing docstrings. * fix and simplify the usage example in img2img. * fix setup for watermarking. * Revert "fix setup for watermarking." This reverts commit 491bc9f. * fix: watermarking setup. * fix: op. * run make fix-copies. * make sure tests pass * improve convert * make tests pass * make tests pass * better error message * fiinsh * finish * Fix final test --------- Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* Add new text encoder * add transformers depth * More * Correct conversion script * Fix more * Fix more * Correct more * correct text encoder * Finish all * proof that in works in run local xl * clean up * Get refiner to work * Add red castle * Fix batch size * Improve pipelines more * Finish text2image tests * Add img2img test * Fix more * fix import * Fix embeddings for classic models (huggingface#3888) Fix embeddings for classic SD models. * Allow multiple prompts to be passed to the refiner (huggingface#3895) * finish more * Apply suggestions from code review * add watermarker * Model offload (huggingface#3889) * Model offload. * Model offload for refiner / img2img * Hardcode encoder offload on img2img vae encode Saves some GPU RAM in img2img / refiner tasks so it remains below 8 GB. --------- Co-authored-by: Patrick von Platen <[email protected]> * correct * fix * clean print * Update install warning for `invisible-watermark` * add: missing docstrings. * fix and simplify the usage example in img2img. * fix setup for watermarking. * Revert "fix setup for watermarking." This reverts commit 491bc9f. * fix: watermarking setup. * fix: op. * run make fix-copies. * make sure tests pass * improve convert * make tests pass * make tests pass * better error message * fiinsh * finish * Fix final test --------- Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

Model offload.

1c7c6ba

pcuenca marked this pull request as draft June 28, 2023 10:44

Model offload for refiner / img2img

0393c4d

pcuenca and others added 2 commits June 28, 2023 18:52

Hardcode encoder offload on img2img vae encode

6fd4aaa

Saves some GPU RAM in img2img / refiner tasks so it remains below 8 GB.

Merge branch 'sd_xl' into sd_xl_offload

fe3464f

patrickvonplaten marked this pull request as ready for review June 30, 2023 10:37

patrickvonplaten merged commit 558ef96 into sd_xl Jun 30, 2023

patrickvonplaten deleted the sd_xl_offload branch June 30, 2023 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model offload #3889

Model offload #3889

Uh oh!

pcuenca commented Jun 28, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 28, 2023 •

edited

Loading

Uh oh!

pcuenca commented Jun 28, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 30, 2023

Uh oh!

Uh oh!

Model offload #3889

Model offload #3889

Uh oh!

Conversation

pcuenca commented Jun 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcuenca commented Jun 28, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 30, 2023

Uh oh!

Uh oh!

pcuenca commented Jun 28, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 28, 2023 •

edited

Loading