llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

compilade d6bd4d46dd llama : support StableLM 2 1.6B (#5052 )

* llama : support StableLM 2 1.6B

* convert : fix Qwen's set_vocab wrongly naming all special tokens [PAD{id}]

* convert : refactor Qwen's set_vocab to use it for StableLM 2 too

* nix : add tiktoken to llama-python-extra

* convert : use presence of tokenizer.json to determine StableLM tokenizer loader

It's a less arbitrary heuristic than the vocab size.

2024-01-22 13:21:52 +02:00

nix

llama : support StableLM 2 1.6B (#5052 )

2024-01-22 13:21:52 +02:00

cloud-v-pipeline

ci : Cloud-V for RISC-V builds (#3160 )

2023-09-15 11:06:56 +03:00

full-cuda.Dockerfile

python : add check-requirements.sh and GitHub workflow (#4585 )

2023-12-29 16:50:29 +02:00

full-rocm.Dockerfile

python : add check-requirements.sh and GitHub workflow (#4585 )