add lora finetuning + example #113

orangetin · 2023-05-09T01:40:55Z

Creates a starting README for LORA finetuning
Adds a general finetune.py that can be used with any model with optional 8-bit and DeeperSpeed support
Adds example notebook

Todo:

Create a table for VRAM requirements for finetuning each RPJ model in both float16 and int8.

Force block size to max supported by model

madroidmaq · 2023-06-06T17:33:42Z

@orangetin how to merge lora model to base model ?

orangetin · 2023-06-06T22:26:47Z

@orangetin how to merge lora model to base model ?

@madroidmaq what do you mean by this? If you're talking about loading the lora model for inference, see this: https://github.com/togethercomputer/OpenChatKit/blob/main/training/lora/example/redpajama-incite-chat-3b_inference.py

madroidmaq · 2023-06-07T15:40:25Z

@orangetin I am attempting to run the Redpajama-3b model on a mobile device, currently utilizing some logic from the MLC-LLM project, and it's working well so far.

However, I would like to fine-tune the model based on some of my private data. I have used LoRA for fine-tuning and have seen some results, which has been a smooth process up to this point.

The issue I am encountering is that I need to merge the LoRA model with the Redpajama-3b model. This is because the MLC-LLM project currently only supports loading a single model file. The inference logic in the current example works well on a PC, but it does not run on a mobile device. My proposed solution is to merge the two models. This approach is feasible in the Chinese-LLaMA-Alpaca project, which internally uses the merge_and_unload() function from the peft library. As a beginner in AI, I have attempted this part but have not been successful.

I would greatly appreciate it if you could provide support for the merge functionality. Thank you very much.

orangetin · 2023-06-09T18:09:01Z

@madroidmaq See this comment for an example for merging the LoRA model with the base model: #127 (comment)

madroidmaq · 2023-06-10T06:23:30Z

@orangetin

Thank you very much, the model was merged correctly. I made the above code a PR #136 , other people with similar needs can use it directly.

orangetin and others added 13 commits May 8, 2023 04:54

Add lora finetuning script

4d332e1

minor fix

49add46

Fixed CPUAdam error

d1f0a79

Update README.md

f23e8e3

Updated example notebook

8250c87

Merge branch 'togethercomputer:main' into peft

2cd1972

minor fix

0b110bd

Update finetune.py

cf5156c

Update finetune.py

5671262

minor fix

7b83aa7

Update finetuning.ipynb

3054964

minor fix

8806690

Fix block size

6c033b6

Force block size to max supported by model

madroidmaq mentioned this pull request Jun 10, 2023

add script: merge the lora weights to the base model #136

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add lora finetuning + example #113

add lora finetuning + example #113

orangetin commented May 9, 2023

madroidmaq commented Jun 6, 2023

orangetin commented Jun 6, 2023

madroidmaq commented Jun 7, 2023

orangetin commented Jun 9, 2023

madroidmaq commented Jun 10, 2023

add lora finetuning + example #113

Are you sure you want to change the base?

add lora finetuning + example #113

Conversation

orangetin commented May 9, 2023

madroidmaq commented Jun 6, 2023

orangetin commented Jun 6, 2023

madroidmaq commented Jun 7, 2023

orangetin commented Jun 9, 2023

madroidmaq commented Jun 10, 2023