Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

baby-llama : rename llama_layer to baby_llama_layer examples
#9557 opened Sep 20, 2024 by danbev Loading…
2 of 4 tasks
Update CUDA graph on scale change plus clear nodes/params Nvidia GPU Issues specific to Nvidia GPUs
#9550 opened Sep 19, 2024 by agray3 Loading…
1 of 3 tasks
server: disable context shift examples server
#9544 opened Sep 19, 2024 by VJHack Draft
2 of 4 tasks
add solar pro support python python script changes
#9541 opened Sep 18, 2024 by mxyng Loading…
2 of 4 tasks
Implementations for Q4_0_8_8 quantization based functions - AVX512 version of ggml_gemm_q4_0_8x8_q8_0 ggml changes relating to the ggml tensor library for machine learning
#9532 opened Sep 18, 2024 by Srihari-mcw Loading…
2 of 4 tasks
musa: enable building fat binaries, enable VMM support, and disable Flash Attention on QY1 (MTT S80) Nvidia GPU Issues specific to Nvidia GPUs
#9526 opened Sep 18, 2024 by yeahdongcn Loading…
5 of 7 tasks
llama : add reranking support examples python python script changes server
#9510 opened Sep 16, 2024 by ggerganov Draft
3 tasks
Update clip.cpp examples
#9482 opened Sep 14, 2024 by Tejaakshaykumar Loading…
2 of 4 tasks
CI: Provide prebuilt windows binary for hip devops improvements to build systems and github actions
#9467 opened Sep 13, 2024 by no1wudi Loading…
2 of 4 tasks
RWKV v6: RWKV_WKV op CUDA implementation Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#9454 opened Sep 12, 2024 by MollySophia Loading…
4 tasks done
IBM Granite MoE Architecture python python script changes
#9438 opened Sep 11, 2024 by gabe-l-hart Loading…
3 of 5 tasks
vulkan : do not use tensor->extra ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#9407 opened Sep 10, 2024 by rgerganov Loading…
2 of 4 tasks
naming : normalize the name of callback-related identifiers Apple Metal https://en.wikipedia.org/wiki/Metal_(API) breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#9405 opened Sep 10, 2024 by ggerganov Loading…
feat: support internvl examples python python script changes
#9403 opened Sep 10, 2024 by qlylangyu Loading…
2 of 4 tasks
imatrix : use GGUF to store importance matrices breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. enhancement New feature or request examples python python script changes refactoring Refactoring Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#9400 opened Sep 10, 2024 by compilade Draft
3 of 8 tasks
convert : refactor rope_freqs generation bugfix fixes an issue or bug python python script changes refactoring Refactoring Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#9396 opened Sep 10, 2024 by compilade Loading…
6 of 10 tasks
ggml: Add run-time detection of neon, i8mm and sve ggml changes relating to the ggml tensor library for machine learning
#9331 opened Sep 6, 2024 by eddnjjn Loading…
2 of 4 tasks
ggml/kompute: Add multi-GPU support ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/
#9328 opened Sep 6, 2024 by flynnjiang Loading…
2 of 4 tasks
Enable use to the rebar feature to upload buffers to the device. ggml changes relating to the ggml tensor library for machine learning Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix Vulkan Issues specific to the Vulkan backend
#9251 opened Aug 30, 2024 by mtavenrath Loading…
2 of 4 tasks
metadata: Allow model creators to use modelcard datasets and base_model as a list of dicts examples python python script changes Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#9221 opened Aug 28, 2024 by mofosyne Draft
2 of 4 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.