gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

* gguf-py: Refactor and add file reading support

* Replay changes from #3871

Credit to @cebtenzzre for that pull

* Various type annotation fixes.

* sort imports with isort (again)

* Fix missing return statement in add_tensor

* style cleanup with flake8

* fix NamedTuple and Enum usage

* Fix an issue with state init in GGUFReader

Move examples to an examples/ directory

Clean up examples

Add an example of modifying keys in a GGUF file

Update documentation with info on examples

Try to support people importing gguf/gguf.py directly

* Damagage is not a word.

* Clean up gguf-py/examples/modify_gguf.py whitespace

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* Update gguf-py/examples/modify_gguf.py formatting

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* Update gguf-py/gguf/gguf_reader.py type hint

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* Make examples executable, formatting changes

* Add more information to GGUFReader and examples comments

* Include a gguf Python package version bump

* Add convert-gguf-endian.py script

* cleanup

* gguf-py : bump minor version

* Reorganize scripts

* Make GGUFReader endian detection less arbitrary

* Add JSON dumping support to gguf-dump.py

Which I kind of regret now

* A few for gguf-dump.py cleanups

* Murder accidental tuple in gguf-py/scripts/gguf-dump.py

Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

* cleanup

* constants : remove unneeded type annotations

* fix python 3.8 compat

* Set up gguf- scripts in pyproject.toml

* And include scripts/__init__.py, derp

* convert.py: We can't currently support Q8_0 on big endian.

* gguf-py: SpecialVocab: Always try available sources for special token ids

gguf-py: SpecialVocab: Try to load merges from merges.txt if not in tokenizer.json

gguf-py: SpecialVocab: Add 'add_bos_token' type bools to GGUF metadata
u

* cleanup

* Promote add_X_token to GGUF metadata for BOS and EOS

---------

Co-authored-by: Jared Van Bortel <jared@nomic.ai>
Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>

This commit is contained in:

Kerfuffle

2023-11-10 22:04:50 -07:00

committed by

GitHub

parent 4a4fd3eefa

commit 34b0a08207

20 changed files with 1982 additions and 1176 deletions

									
										12

gguf-py/scripts/__init__.py
									
										Normal file
									
												View File
												
				@@ -0,0 +1,12 @@

				import os

				from importlib import import_module

				os.environ["NO_LOCAL_GGUF"] = "TRUE"

				gguf_convert_endian_entrypoint = import_module("scripts.gguf-convert-endian").main

				gguf_dump_entrypoint           = import_module("scripts.gguf-dump").main

				gguf_set_metadata_entrypoint   = import_module("scripts.gguf-set-metadata").main

				del import_module, os

gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)

12 gguf-py/scripts/__init__.py Normal file Unescape Escape View File

12

gguf-py/scripts/init.py Normal file

View File