File tree
8 files changed
+198
-10
lines changed- docs/source/features/quantization
- vllm
- engine
- model_executor
- layers
- fused_moe
- quantization
- model_loader
8 files changed
+198
-10
lines changedLines changed: 7 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
29 | 29 |
| |
30 | 30 |
| |
31 | 31 |
| |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
32 | 39 |
| |
33 | 40 |
| |
34 | 41 |
| |
|
Lines changed: 6 additions & 3 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
229 | 229 |
| |
230 | 230 |
| |
231 | 231 |
| |
| 232 | + | |
232 | 233 |
| |
233 | 234 |
| |
234 | 235 |
| |
| |||
259 | 260 |
| |
260 | 261 |
| |
261 | 262 |
| |
| 263 | + | |
262 | 264 |
| |
263 | 265 |
| |
264 | 266 |
| |
| |||
321 | 323 |
| |
322 | 324 |
| |
323 | 325 |
| |
324 |
| - | |
325 |
| - | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
326 | 329 |
| |
327 | 330 |
| |
328 | 331 |
| |
| |||
947 | 950 |
| |
948 | 951 |
| |
949 | 952 |
| |
950 |
| - | |
| 953 | + | |
951 | 954 |
| |
952 | 955 |
| |
953 | 956 |
| |
|
Lines changed: 8 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
93 | 93 |
| |
94 | 94 |
| |
95 | 95 |
| |
| 96 | + | |
96 | 97 |
| |
97 | 98 |
| |
98 | 99 |
| |
| |||
262 | 263 |
| |
263 | 264 |
| |
264 | 265 |
| |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
265 | 272 |
| |
266 | 273 |
| |
267 | 274 |
| |
| |||
1076 | 1083 |
| |
1077 | 1084 |
| |
1078 | 1085 |
| |
| 1086 | + | |
1079 | 1087 |
| |
1080 | 1088 |
| |
1081 | 1089 |
| |
|
Lines changed: 21 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
5 | 5 |
| |
6 | 6 |
| |
7 | 7 |
| |
| 8 | + | |
8 | 9 |
| |
9 | 10 |
| |
10 | 11 |
| |
| |||
514 | 515 |
| |
515 | 516 |
| |
516 | 517 |
| |
517 |
| - | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
518 | 524 |
| |
519 | 525 |
| |
520 | 526 |
| |
| |||
524 | 530 |
| |
525 | 531 |
| |
526 | 532 |
| |
| 533 | + | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
| 541 | + | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
| 546 | + | |
527 | 547 |
| |
528 | 548 |
| |
529 | 549 |
| |
|
Lines changed: 14 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
235 | 235 |
| |
236 | 236 |
| |
237 | 237 |
| |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
238 | 249 |
| |
239 | 250 |
| |
240 | 251 |
| |
241 |
| - | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
242 | 255 |
| |
243 | 256 |
| |
244 | 257 |
| |
|
Lines changed: 125 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
2 | 2 |
| |
3 |
| - | |
| 3 | + | |
4 | 4 |
| |
5 | 5 |
| |
6 | 6 |
| |
7 | 7 |
| |
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
| 11 | + | |
| 12 | + | |
| 13 | + | |
11 | 14 |
| |
12 | 15 |
| |
13 | 16 |
| |
| |||
29 | 32 |
| |
30 | 33 |
| |
31 | 34 |
| |
32 |
| - | |
| 35 | + | |
33 | 36 |
| |
34 | 37 |
| |
35 | 38 |
| |
| |||
49 | 52 |
| |
50 | 53 |
| |
51 | 54 |
| |
| 55 | + | |
| 56 | + | |
52 | 57 |
| |
53 | 58 |
| |
54 | 59 |
| |
| |||
184 | 189 |
| |
185 | 190 |
| |
186 | 191 |
| |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
187 | 310 |
| |
188 | 311 |
| |
189 | 312 |
| |
|
Lines changed: 17 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1245 | 1245 |
| |
1246 | 1246 |
| |
1247 | 1247 |
| |
| 1248 | + | |
1248 | 1249 |
| |
1249 | 1250 |
| |
1250 | 1251 |
| |
| 1252 | + | |
| 1253 | + | |
| 1254 | + | |
| 1255 | + | |
| 1256 | + | |
| 1257 | + | |
| 1258 | + | |
| 1259 | + | |
| 1260 | + | |
| 1261 | + | |
| 1262 | + | |
| 1263 | + | |
| 1264 | + | |
| 1265 | + | |
1251 | 1266 |
| |
1252 | 1267 |
| |
1253 | 1268 |
| |
| |||
1258 | 1273 |
| |
1259 | 1274 |
| |
1260 | 1275 |
| |
1261 |
| - | |
| 1276 | + | |
| 1277 | + | |
1262 | 1278 |
| |
1263 | 1279 |
| |
1264 |
| - | |
1265 | 1280 |
| |
1266 | 1281 |
| |
1267 | 1282 |
| |
|
Lines changed: 0 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
496 | 496 |
| |
497 | 497 |
| |
498 | 498 |
| |
499 |
| - | |
500 | 499 |
| |
501 | 500 |
| |
502 | 501 |
| |
|
0 commit comments