This website requires JavaScript.
Explore
Help
Sign In
CS348Project
/
llama.cpp
Watch
5
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-11-09 10:17:06 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
48b73b849880ff43f0dd818252cf00cea5a83061
llama.cpp
/
ggml
/
src
/
ggml-quants.c
Francis Couture-Harpin
48b73b8498
ggml-quants : substract 1 when back in epi8
...
This makes the 1.625 bpw type go faster than q4_0. Still not the fastest.
2024-06-27 02:06:28 -04:00
657 KiB
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink