llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

Reese Levine 74b8fc17f9 ggml webgpu: profiling, CI updates, reworking of command submission (#16452 )

* Add profiling

* More detailed profiling

* Rework command submission to avoid global locks

* Update wait handling

* try new method of waiting on futures

* Add serializing of command submission in some cases

* Add new pool for timestamp queries and clean up logging

* Serialize command submission in CI and leave a TODO note

* Update webgpu CI

* Add myself as WebGPU codeowner

* Deadlock avoidance

* Leave WebGPU/Vulkan CI serialized

* Fix divide by 0

* Fix logic in division by inflight_threads

* Update CODEOWNERS and remove serialize submit option

2025-10-07 13:48:56 -07:00

actions

ci : refactor sdk caching to minimize storage (#16414 )

2025-10-06 17:40:21 +02:00

ISSUE_TEMPLATE

ggml: initial IBM zDNN backend (#14975 )

2025-08-15 21:11:22 +08:00

workflows

ggml webgpu: profiling, CI updates, reworking of command submission (#16452 )

2025-10-07 13:48:56 -07:00

copilot-instructions.md

ci : add copilot-instructions.md (#15286 )

2025-08-21 11:47:52 +02:00

labeler.yml

ggml: initial IBM zDNN backend (#14975 )

2025-08-15 21:11:22 +08:00

pull_request_template.md

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00