llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

unbounded 5f939498d5 ggml : unit test for quantization functions (#953 )

* Unit test for quantization functions

Use the ggml_internal_get_quantize_fn function to loop through all
quantization formats and run a sanity check on the result.

Also add a microbenchmark that times these functions directly without
running the rest of the GGML graph.

* test-quantize-fns: CI fixes

Fix issues uncovered in CI
 - need to use sizes divisible by 32*8 for loop unrolling
 - use intrinsic header that should work on Mac

* test-quantize: remove

Per PR comment, subsumed by test-quantize-fns

* test-quantize: fix for q8_0 intermediates

2023-04-22 12:10:39 +03:00

CMakeLists.txt

ggml : unit test for quantization functions (#953 )

2023-04-22 12:10:39 +03:00

test-double-float.c

all : be more strict about converting float to double (#458 )

2023-03-28 19:48:20 +03:00

test-quantize-fns.cpp

ggml : unit test for quantization functions (#953 )

2023-04-22 12:10:39 +03:00

test-quantize-perf.cpp

ggml : unit test for quantization functions (#953 )

2023-04-22 12:10:39 +03:00

test-tokenizer-0.cpp

llama : well-defined static initialization of complex objects (#927 )

2023-04-17 17:41:53 +03:00