Skip to content

Commit 82d138f

Browse files
committed
Fix: default repeat_penalty
1 parent a3cc7bf commit 82d138f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

Diff for: llama_cpp/server/app.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -146,7 +146,7 @@ def get_llama():
146146
)
147147

148148
repeat_penalty_field = Field(
149-
default=0.0,
149+
default=1.1,
150150
ge=0.0,
151151
description="A penalty applied to each token that is already generated. This helps prevent the model from repeating itself.\n\n"
152152
+ "Repeat penalty is a hyperparameter used to penalize the repetition of token sequences during text generation. It helps prevent the model from generating repetitive or monotonous text. A higher value (e.g., 1.5) will penalize repetitions more strongly, while a lower value (e.g., 0.9) will be more lenient.",

0 commit comments

Comments
 (0)