llama.cpp/convert-hf-to-gguf.py at 1debe72737ea131cb52975da3d53ed3a835df3a6

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-08 10:07:01 +00:00

Files

fairydreaming 9b82476ee9 Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-NeoX base models) (#7461 )

* convert-hf : add conversion of bloom-style qkv tensor to gpt-style qkv (code borrowed from BloomModel)

* llama : add inference support for LLM_ARCH_GPTNEOX

* llama : add model types for every Pythia variant and GPT-NeoX

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

2024-05-23 11:49:53 +02:00

113 KiB

Executable File

Raw Blame History

View Raw

113 KiB Executable File Raw Blame History

113 KiB

Executable File

Raw Blame History