This website requires JavaScript.
Explore
Help
Sign In
CS348Project
/
llama.cpp
Watch
5
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-11-14 11:07:10 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
0f7e8f389d21482470ffa38fc067fee973f2d7b0
llama.cpp
/
ggml
/
src
/
ggml-cuda
/
quantize.cuh
Johannes Gäßler
808aba3916
CUDA: optimize and refactor MMQ (
#8416
)
...
* CUDA: optimize and refactor MMQ * explicit q8_1 memory layouts, add documentation
2024-07-11 16:47:47 +02:00
979 B
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink