experiment(backend): autocast dtype in CustomLinear #7843

psychedelicious · 2025-03-26T08:22:18Z

Summary

This resolves an issue where specifying float32 precision causes FLUX Fill to error.

I noticed that our other customized torch modules do some dtype casting themselves, so maybe this is a fine place to do this? Maybe this could break things...

See #7836

Related Issues / Discussions

Closes #7836

QA Instructions

Try various model combos. I don't know what I'm doing and this could be a Bad Idea™️.

To reproduce the problem in the linked issue, set precision: float32 in invokeai.yaml, then try to use FLUX Fill.

Merge Plan

n/a

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

This resolves an issue where specifying `float32` precision causes FLUX Fill to error. I noticed that our other customized torch modules do some dtype casting themselves, so maybe this is a fine place to do this? Maybe this could break things... See #7836

invokeai/backend/model_manager/load/model_cache/torch_module_autocast/cast_to_dtype.py

brandonrising · 2025-03-26T15:45:08Z

...backend/model_manager/load/model_cache/torch_module_autocast/custom_modules/custom_linear.py

@@ -73,6 +74,10 @@ def _autocast_forward_with_patches(self, input: torch.Tensor) -> torch.Tensor:
    def _autocast_forward(self, input: torch.Tensor) -> torch.Tensor:
        weight = cast_to_device(self.weight, input.device)
        bias = cast_to_device(self.bias, input.device)
+
+        weight = cast_to_dtype(weight, input.dtype)


This is probably fine, but some models may act weirdly due to potential precision loss if we provide inputs with less precision than the model 🤔 In an ideal world I'd think we'd want to ensure the precision of the inputs are compatible with the model before calling it

psychedelicious requested review from lstein, blessedcoolant, brandonrising, hipsterusername and jazzhaiku as code owners March 26, 2025 08:22

github-actions bot added python PRs that change python files backend PRs that change backend files labels Mar 26, 2025

brandonrising reviewed Mar 26, 2025

View reviewed changes

invokeai/backend/model_manager/load/model_cache/torch_module_autocast/cast_to_dtype.py Outdated Show resolved Hide resolved

brandonrising reviewed Mar 26, 2025

View reviewed changes

tidy(backend): errant comments

eaa1d8e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment(backend): autocast dtype in CustomLinear #7843

experiment(backend): autocast dtype in CustomLinear #7843

psychedelicious commented Mar 26, 2025 •

edited

Loading

brandonrising Mar 26, 2025

experiment(backend): autocast dtype in CustomLinear #7843

Are you sure you want to change the base?

experiment(backend): autocast dtype in CustomLinear #7843

Conversation

psychedelicious commented Mar 26, 2025 • edited Loading

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

brandonrising Mar 26, 2025

Choose a reason for hiding this comment

psychedelicious commented Mar 26, 2025 •

edited

Loading