llama.cpp/convert_hf_to_gguf.py at 3d82dbcbce2c677fc35fbf99574ccd107d95a1f8

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Bartowski 732b5fbf5e convert : avoid calls to tokenizer.added_tokens_decoder (#12473 )

tokenizer.added_tokens_decoder returns a fresh dict every time relatively slowly (~0.04s on average) which results in massive slowdowns when we have a huge number of added tokens

2025-03-20 08:36:37 +02:00

243 KiB

Executable File

Raw Blame History

View Raw

243 KiB Executable File Raw Blame History

243 KiB

Executable File

Raw Blame History