[Inference Providers] Support for LoRAs #3005

hanouticelina · 2025-04-15T09:38:43Z

This work mirrors the PR in the js client (see huggingface.js#1347).

This PR introduces the support of LoRAs for fal-ai, this should not be merged until the server-side PR is merged (private).

Changes

update _prepare_mapped_model to return a ProviderMappingInfo object. This object returns the provider id, the hf model id (optional) and the lora weights path (optional) that is expected to be sent in the payload.
update _prepare_payload_as_dict and _prepare_payload_as_bytes signatures to accept a ProviderMappingInfo object as an argument.

the logic is implemented within the TaskProviderHelper base class, to avoid duplicating similar logic when we support LoRAs for the other providers. Note that LoRAs can also be used with other tasks like text-to-video as instance.

Wauplin

I haven't tested locally yet but left some high-level comments related to naming + structure to match as much as possible the JS counterpart.

src/huggingface_hub/inference/_providers/_common.py

HuggingFaceDocBuilderDev · 2025-04-15T15:41:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

… into fal-ai-loras

Wauplin

Looking forward to this!

src/huggingface_hub/inference/_providers/_common.py

src/huggingface_hub/inference/_providers/fal_ai.py

… into fal-ai-loras

@Wauplin

) Following @Wauplin 's comment [here](huggingface/huggingface_hub#3005 (comment)) Pre-requisite: merge and deploy [this internal PR first](huggingface-internal/moon-landing#13498)

SBrandeis

lgtm

src/huggingface_hub/inference/_providers/_common.py

…l-ai-loras

hanouticelina · 2025-04-25T13:58:36Z

✅ I tested the PR with :

from huggingface_hub import InferenceClient


client = InferenceClient(
    provider="fal-ai",
)
image = client.text_to_image(
    prompt="a boy and a girl looking out of a window with a cat perched on the window sill",
    model="openfree/flux-chatgpt-ghibli-lora",
)

output image :

Wauplin

All good! Pre-approving but still curious about this hardcoded model_name

src/huggingface_hub/inference/_providers/fal_ai.py

src/huggingface_hub/hf_api.py

Co-authored-by: Lucain <[email protected]>

hanouticelina · 2025-04-29T14:59:27Z

i'm merging, CI is red for unrelated reasons (hub-ci seems to be down)

hanouticelina added 2 commits April 15, 2025 11:17

add loras support

c161e60

nit

ea59870

hanouticelina requested review from julien-c, Wauplin and SBrandeis April 15, 2025 09:38

Wauplin reviewed Apr 15, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/_common.py Outdated Show resolved Hide resolved

src/huggingface_hub/inference/_providers/_common.py Outdated Show resolved Hide resolved

src/huggingface_hub/inference/_providers/_common.py Outdated Show resolved Hide resolved

review suggestions

99e14a0

hanouticelina added 4 commits April 15, 2025 17:46

update inference provider mapping object

422f5c5

Merge branch 'main' into fal-ai-loras

26f06c0

fix tests

9558ea8

Merge branch 'fal-ai-loras' of github.com:huggingface/huggingface_hub…

51255a7

… into fal-ai-loras

hanouticelina requested a review from Wauplin April 16, 2025 09:17

Wauplin reviewed Apr 16, 2025

View reviewed changes

SBrandeis self-assigned this Apr 16, 2025

fixes

312a210

hanouticelina requested a review from Wauplin April 17, 2025 10:26

Merge branch 'main' into fal-ai-loras

89f060c

SBrandeis mentioned this pull request Apr 23, 2025

[Inference] LoRA: use the precomputed adapterWeightsPath property huggingface/huggingface.js#1379

Merged

hanouticelina added 2 commits April 24, 2025 22:24

use the precomputed adapterWeightsPath property

f55e628

Merge branch 'fal-ai-loras' of github.com:huggingface/huggingface_hub…

0f21d6a

… into fal-ai-loras

SBrandeis approved these changes Apr 25, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/_common.py Outdated Show resolved Hide resolved

hanouticelina added 2 commits April 25, 2025 15:49

remove unnecessary function

b59a123

Merge branch 'main' of github.com:huggingface/huggingface_hub into fa…

49d725c

…l-ai-loras

Wauplin approved these changes Apr 29, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/fal_ai.py Outdated Show resolved Hide resolved

src/huggingface_hub/hf_api.py Outdated Show resolved Hide resolved

hanouticelina and others added 3 commits April 29, 2025 15:45

Update src/huggingface_hub/hf_api.py

b982612

Co-authored-by: Lucain <[email protected]>

style

b300bce

add comment

a2974bf

Co-authored-by: Lucain <[email protected]>

Wauplin approved these changes Apr 29, 2025

View reviewed changes

hanouticelina merged commit bebc1f7 into main Apr 29, 2025
24 of 25 checks passed

hanouticelina deleted the fal-ai-loras branch April 29, 2025 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference Providers] Support for LoRAs #3005

[Inference Providers] Support for LoRAs #3005

hanouticelina commented Apr 15, 2025

Wauplin left a comment

HuggingFaceDocBuilderDev commented Apr 15, 2025

Wauplin left a comment

SBrandeis left a comment

hanouticelina commented Apr 25, 2025 •

edited

Loading

Wauplin left a comment

hanouticelina commented Apr 29, 2025

[Inference Providers] Support for LoRAs #3005

[Inference Providers] Support for LoRAs #3005

Conversation

hanouticelina commented Apr 15, 2025

Changes

Wauplin left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 15, 2025

Wauplin left a comment

Choose a reason for hiding this comment

SBrandeis left a comment

Choose a reason for hiding this comment

hanouticelina commented Apr 25, 2025 • edited Loading

Wauplin left a comment

Choose a reason for hiding this comment

hanouticelina commented Apr 29, 2025

hanouticelina commented Apr 25, 2025 •

edited

Loading