WIP: Integrate SpQR + FSDP functionalities #840

Titus-von-Koeller · 2023-10-23T13:37:37Z

No description provided.

…cfgs

github-actions · 2023-12-20T15:09:33Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

@TimDettmers

This PR adds initial FSDP support for training QLoRA models. It enables basic FSDP and CPU Offload support, with low memory training via FSDP.sync_module_states option unsupported. This PR builds off of #840 commit 8278fca and BNB FSDP by @TimDettmers and @Titus-von-Koeller. An example of using this PR to finetune QLoRA models with FSDP can be found in the demo repo: AnswerDotAi/fsdp_qlora. * Minimal changes for fp32 4bit storage from BNB commit 8278fca * Params4bit with selectable storage dtype * possible fix for double quantizing linear weight & quant storage dtype * minor fixes in Params4bit for peft tests * remove redundant * add float16 * update test * Remove float16 quant cast as there are fp32, bf16, & fp16 quant kernels --------- Co-authored-by: Kerem Turgutlu <[email protected]>

TimDettmers and others added 30 commits August 5, 2023 16:47

Updated 8-bit optimizers to blocksize 256.

bc33a49

Removed non-blockwise optimizers.

5672e15

Added fsdp test.

61eaf7b

Some more custom data type and fsdp test.

bda3722

Added simple swapping for Params4bit.

c28984c

Added missing funcions.

98b57ba

Small fixes to swapping logic

0076227

Switched subclasses.

fe7e0a6

Working swapping.

6d15795

Added swapping benchmark.

dca5a7e

spqr: initial implementation outline for collaborative review

91aa1f8

Simple packing code with test that fails.

3d6ce45

inline spqr quantized model loading, only formatting changed

1a79a80

refactor spqr easier for layer-wise operations

a49fed2

chore: improve project setup - dev environment, formatting + linting …

1fdd218

…cfgs

Added 3-bit packing code with tests (all green).

f78a6d1

Comments for permutation order quantization.

050b345

Added row-wise correct kernel. Test needs to be adjusted.

3bfd4af

format + lint nn/modules.py

4b89b6b

Better FSDP tests. Failing mixed grads test.

3ce14e9

Added manually wrapped mixed gradient test.

9e23d14

Added fixes to Linear4bit for FSDP.

8278fca

spqr: packing, custom linear layer, improvements, bug in cuda

de5259d

updated conda dev env

441960d

improve linting config

b11a663

add pytest config for default args, etc

ba1d319

Fixed indexing error in kPack3Bits.

361af91

fsdp: outline nested module wrapping integration

b5acc7d

fsdp: starting point for tests (Titus)

a2461dd

Added random port number to fsdp tests.

9a391f9

Titus-von-Koeller added 10 commits October 10, 2023 11:55

fsdp: preliminary implementation + tests

68ed65a

updated dev environment

cb97b46

added small improvements to nn introspection helpers

bfc113a

auto-formatting

ca404d0

delete superfluous files

b2f3082

fsdp test: add delay to allow GC to collect remains of old process group

b0d4249

fsdp: manual wrapping w/o use_orig_params test working

ea0a096

enable isort

baec8cb

auto-formatting

27c7acd

fsdp: add LoRA toy model

2e6ac0a

Titus-von-Koeller changed the title ~~WIP: SpQR and FSDP integrations~~ WIP: Integrate SpQR + FSDP functionalities Oct 23, 2023

Titus-von-Koeller added 7 commits October 24, 2023 13:51

update dependencies

407b8c0

fix f-string

9c6212c

setup: minor fixes + auto formatting

85246f0

utils: polish replace_linear

3449684

polish + test replace_linear

a580011

format utils.py

aceeb1d

utils: add further tests for replace_linear

e38c474

152334H mentioned this pull request Dec 22, 2023

8-bit optimizers dont work with FSDP #89

Open

github-actions bot closed this Dec 30, 2023

TimDettmers reopened this Jan 1, 2024

TimDettmers added High Priority (first issues that will be worked on) High Risk Risk of bugs in transformers and other libraries labels Jan 1, 2024

github-actions bot closed this Jan 10, 2024

younesbelkada reopened this Jan 10, 2024

warner-benjamin mentioned this pull request Jan 17, 2024

Initial FSDP Support for QLoRA Finetuning #970

Merged

Titus-von-Koeller closed this Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Integrate SpQR + FSDP functionalities #840

WIP: Integrate SpQR + FSDP functionalities #840

Uh oh!

Titus-von-Koeller commented Oct 23, 2023

Uh oh!

github-actions bot commented Dec 20, 2023

Uh oh!

Uh oh!

WIP: Integrate SpQR + FSDP functionalities #840

WIP: Integrate SpQR + FSDP functionalities #840

Uh oh!

Conversation

Titus-von-Koeller commented Oct 23, 2023

Uh oh!

github-actions bot commented Dec 20, 2023

Uh oh!

Uh oh!