-
-
Notifications
You must be signed in to change notification settings - Fork 7.8k
[FalconH1] Fix output dtype in RMSNorm fallback path for Falcon-H1 (e.g. 0.5B) #18500
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
57 commits
Select commit
Hold shift + click to select a range
8e27206
implement falcon mamba 2 architecture
dhiaEddineRhaiem 191a588
push changes
younesbelkada d54a010
more changes
younesbelkada d86dd28
more changes
younesbelkada a614d9b
more fixes
younesbelkada 004d4e4
more fixes
younesbelkada d2e74be
remove prints
younesbelkada 12f2dab
fix
younesbelkada 4e131c5
some clean up
younesbelkada d882e9f
Merge remote-tracking branch 'upstream/main' into falcon_mamba2
younesbelkada 331ea6d
fixes
younesbelkada 6379856
final fixes
younesbelkada 02e9d89
fix
younesbelkada 615f234
add docs
younesbelkada 76e4554
fix hard-coded tied weights
younesbelkada cfc959f
fix logits issue
younesbelkada 3834a92
Merge remote-tracking branch 'upstream/main' into falcon_mamba2
younesbelkada 536898c
chore: rename to `FalconH1`
younesbelkada 54cc7af
chore: rename file
younesbelkada a8762f2
fix issue
younesbelkada 4ede3a2
fix Tensor parallelism Falcon H1
dhiaEddineRhaiem 85f4292
Merge branch 'falcon-h1-clean' of https://github.com/tiiuae/vllm-inte…
dhiaEddineRhaiem ab427c5
adress maintainer comment on the PR
dhiaEddineRhaiem d585425
formatting
dhiaEddineRhaiem 087cf17
fix pp and residual
ilyasch2 82a365c
clean
ilyasch2 ce57226
Merge remote-tracking branch 'upstream/main' into final_fh1
ilyasch2 958a159
fix
ilyasch2 d0e3c31
clean
ilyasch2 6579ec8
remove unnecessary arguments
ilyasch2 32296aa
clean
ilyasch2 6f738ad
clean
ilyasch2 0189427
modify default rope theta
ilyasch2 37fa5c0
Merge remote-tracking branch 'origin/main' into final_fh1
younesbelkada 4068ff0
fix
younesbelkada 20a84ea
chore: update supported models
JingweiZuo 9fdbf19
chore: clean CI issue
JingweiZuo 2d7ce65
adress TODO tasks and fix formatting
dhiaEddineRhaiem 91e3b2d
fix ruff formatting
dhiaEddineRhaiem 55b132d
fix format
dhiaEddineRhaiem 6df6407
style
dhiaEddineRhaiem bd08b81
fix pre-commit checks
dhiaEddineRhaiem 57d0332
Update requirements/test.txt from pip-compile
dhiaEddineRhaiem e201b92
fix pre-compile issues
dhiaEddineRhaiem baa2722
adress comments
dhiaEddineRhaiem cd51b64
fix: ruff
dhiaEddineRhaiem 4a58965
Merge remote-tracking branch 'public/main' into final_fh1
dhiaEddineRhaiem 559457b
adress comments
dhiaEddineRhaiem 79e94f1
small fix
dhiaEddineRhaiem 1f890d3
adress comment b2
dhiaEddineRhaiem 5f39764
add type annotations
dhiaEddineRhaiem 647f93a
fix test model TYPO
dhiaEddineRhaiem 14e14c7
Merge remote-tracking branch 'upstream/main' into final_fh1
dhiaEddineRhaiem f630452
add min_transformers_version
dhiaEddineRhaiem 87be690
fix norm output dtype when use_rms_norm is not used
dhiaEddineRhaiem e0811a0
Merge branch 'main' into final_fh1
dhiaEddineRhaiem 939927a
add comment for silu precision
dhiaEddineRhaiem File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.