llama.cpp/convert_hf_to_gguf.py at a41139723d41aba243148ab8098a6c64aa7fce01

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-04 09:32:00 +00:00

Files

Bartowski 732b5fbf5e convert : avoid calls to tokenizer.added_tokens_decoder (#12473 )

tokenizer.added_tokens_decoder returns a fresh dict every time relatively slowly (~0.04s on average) which results in massive slowdowns when we have a huge number of added tokens

2025-03-20 08:36:37 +02:00

243 KiB

Executable File

Raw Blame History

View Raw

243 KiB Executable File Raw Blame History

243 KiB

Executable File

Raw Blame History