huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 29.1k
Star 145k

Code
Issues 1.1k
Pull requests 756
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: huggingface/transformers

Labels 133 Milestones 0

New pull request New

756 Open 19,736 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[chat] improvements for thinking models and reduce default verbosity

#38322 opened May 23, 2025 by gante

Loading…

[FlexAttention] Reenable flex for encoder-decoder and make the test more robust

#38321 opened May 23, 2025 by vasqu

Loading…

[Falcon H1] Fix slow path forward pass

#38320 opened May 23, 2025 by dhiaEddineRhaiem

Loading…

Fix some tests (especially compile with fullgraph=True on Python<3.11)

#38319 opened May 23, 2025 by Cyrilvallez

Loading…

5 tasks

[VLM] modeling updates

#38317 opened May 23, 2025 by zucchini-nlp

Loading…

[Tests] Clean up test cases for few models

#38315 opened May 23, 2025 by yaswanth19 • Draft

5 tasks

Use Gradient Checkpointing Layer in Jamba & Blip Related Models

#38310 opened May 22, 2025 by alex-jw-brooks

Loading…

Update altCLIP model card

#38306 opened May 22, 2025 by EmileAydar

Loading…

Remove duplicate docstring: resample

#38305 opened May 22, 2025 by qqii

Loading…

2 of 5 tasks

[custom_generate] don't forward custom_generate and trust_remote_code

#38304 opened May 22, 2025 by gante

Loading…

Updated the model card for ViTMAE

#38302 opened May 22, 2025 by mreraser

Loading…

2 of 4 tasks

🔴[Attention] Bert-based Models Attention Refactor

#38301 opened May 22, 2025 by vasqu • Draft

new failure CI reports

#38298 opened May 22, 2025 by ydshieh

Loading…

Fix from_args_and_dict ProcessorMixin

#38296 opened May 22, 2025 by yonigozlan

Loading…

Fix image token mask in Gemma3

#38295 opened May 22, 2025 by Cyrilvallez

Loading…

[OPT] Fix attention scaling

#38290 opened May 22, 2025 by vasqu

Loading…

Early-error

#38288 opened May 22, 2025 by ArthurZucker

Loading…

Fix pytorch DTensor import issue.(version 2)

#38287 opened May 22, 2025 by syog1ne

Loading…

fix total batch size calculation in trainer

#38286 opened May 22, 2025 by inkcherry

Loading…

5 tasks

Utility script for generating CI reports locally

#38285 opened May 22, 2025 by ahadnagy • Draft

5 tasks

align xpu's autocast behavior w/ cuda by using device agnostic torch.autocast

#38284 opened May 22, 2025 by yao-matrix

Loading…

Add zero dim tensor check when using flash_attention

#38280 opened May 22, 2025 by ranzhejiang

Loading…

[performance_optim] reduce frequency of declaring attention_mask in Ascend NPU flash attention

#38278 opened May 22, 2025 by FightingZhen

Loading…

1 of 5 tasks

Add Normalized-GPT Architecture

#38276 opened May 22, 2025 by shan18

Loading…

4 of 5 tasks

Fix the shape of ModernBertForMaskedLM's output hidden_states

#38272 opened May 21, 2025 by sheryc • Draft

1 of 5 tasks

Previous 1 2 3 4 5 … 30 31 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!