mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-29 08:41:22 +00:00

Files

Aman Gupta 8a4a856277 Add LLaDA 8b Diffusion model (#14771 )

* Add support for Llada-8b: diffusion model

* Add README

* Fix README and convert_hf_to_gguf

* convert_hf_to_gguf.py: address review comments

* Make everything in a single example

* Remove model-specific sampling

* Remove unused argmax

* Remove braced initializers, improve README.md a bit

* Add diffusion specific gguf params in set_vocab, remove setting rope_theta and rms_norm_eps

* Remove adding the mask token

* Move add_add_bos_token to set_vocab

* use add_bool in gguf_writer.py

2025-07-31 19:49:09 +08:00

CMakeLists.txt

Support diffusion models: Add Dream 7B (#14644 )

2025-07-16 20:03:51 +08:00

diffusion-cli.cpp

Add LLaDA 8b Diffusion model (#14771 )

2025-07-31 19:49:09 +08:00

README.md

Add LLaDA 8b Diffusion model (#14771 )

2025-07-31 19:49:09 +08:00

README.md

Diffusion Text Generation

This directory contains implementations for Diffusion LLMs (DLLMs)

More Info:

Example of using Dream architechture: llama-diffusion-cli -m dream7b.gguf -p "write code to train MNIST in pytorch" -ub 512 --diffusion-eps 0.001 --diffusion-algorithm 3 --diffusion-steps 256 --diffusion-visual

Example of using LLaDA architechture: llama-diffusion-cli -m llada-8b.gguf -p "write code to train MNIST in pytorch" -ub 512 --diffusion-block-length 32 --diffusion-steps 256 --diffusion-visual