-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix: 4171 - Model loading gets stuck on stop
- Loading branch information
1 parent
a3849c4
commit 541a37b
Showing
8 changed files
with
115 additions
and
22 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
1.0.4-rc4 | ||
1.0.4-rc5 |
58 changes: 42 additions & 16 deletions
58
extensions/inference-cortex-extension/resources/default_settings.json
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,33 +1,59 @@ | ||
[ | ||
{ | ||
"key": "test", | ||
"title": "Test", | ||
"description": "Test", | ||
"key": "cont_batching", | ||
"title": "Continuous batching", | ||
"description": "The number of parallel operations", | ||
"controllerType": "checkbox", | ||
"controllerProps": { | ||
"value": true | ||
} | ||
}, | ||
{ | ||
"key": "n_parallel", | ||
"title": "Parallel operations", | ||
"description": "The number of parallel operations", | ||
"controllerType": "input", | ||
"controllerProps": { | ||
"placeholder": "Test", | ||
"value": "" | ||
"value": "4", | ||
"placeholder": "4" | ||
} | ||
}, | ||
{ | ||
"key": "embedding", | ||
"title": "Embedding", | ||
"description": "Whether to enable embedding.", | ||
"key": "flash_attn", | ||
"title": "Flash Attention enabled", | ||
"description": "To enable Flash Attention, default is true", | ||
"controllerType": "checkbox", | ||
"controllerProps": { | ||
"value": true | ||
} | ||
}, | ||
|
||
{ | ||
"key": "ctx_len", | ||
"title": "Context Length", | ||
"description": "The context length for model operations varies; the maximum depends on the specific model used.", | ||
"controllerType": "slider", | ||
"key": "caching_enabled", | ||
"title": "Caching enabled", | ||
"description": "To enable prompt caching or not", | ||
"controllerType": "checkbox", | ||
"controllerProps": { | ||
"min": 0, | ||
"max": 4096, | ||
"step": 128, | ||
"value": 2048 | ||
"value": true | ||
} | ||
}, | ||
{ | ||
"key": "cache_type", | ||
"title": "KV Cache Type", | ||
"description": "KV cache type: f16, q8_0, q4_0, default is f16 (change this could break the model).", | ||
"controllerType": "input", | ||
"controllerProps": { | ||
"placeholder": "f16", | ||
"value": "f16" | ||
} | ||
}, | ||
{ | ||
"key": "use_mmap", | ||
"title": "To enable mmap", | ||
"description": "To enable mmap, default is true", | ||
"controllerType": "checkbox", | ||
"controllerProps": { | ||
"value": true | ||
} | ||
} | ||
] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters