llama.cpp/convert_hf_to_gguf.py at 7ea75035b67f44c22ed7039967f718011fd35ce5

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

Bartowski 732b5fbf5e convert : avoid calls to tokenizer.added_tokens_decoder (#12473 )

tokenizer.added_tokens_decoder returns a fresh dict every time relatively slowly (~0.04s on average) which results in massive slowdowns when we have a huge number of added tokens

2025-03-20 08:36:37 +02:00

243 KiB

Executable File

Raw Blame History

View Raw

243 KiB Executable File Raw Blame History

243 KiB

Executable File

Raw Blame History