mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

Pascal 81d54bbfd5 webui: remove client-side context pre-check and rely on backend for limits (#16506 )

* fix: make SSE client robust to premature [DONE] in agentic proxy chains

* webui: remove client-side context pre-check and rely on backend for limits

Removed the client-side context window pre-check and now simply sends messages
while keeping the dialog imports limited to core components, eliminating the
maximum context alert path

Simplified streaming and non-streaming chat error handling to surface a generic
'No response received from server' error whenever the backend returns no content

Removed the obsolete maxContextError plumbing from the chat store so state
management now focuses on the core message flow without special context-limit cases

* webui: cosmetic rename of error messages

* Update tools/server/webui/src/lib/stores/chat.svelte.ts

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

* Update tools/server/webui/src/lib/stores/chat.svelte.ts

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

* Update tools/server/webui/src/lib/components/app/chat/ChatScreen/ChatScreen.svelte

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

* Update tools/server/webui/src/lib/components/app/chat/ChatScreen/ChatScreen.svelte

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

* chore: update webui build output

---------

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

2025-10-12 18:06:41 +02:00

.storybook

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

e2e

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

scripts

webui: Remove running llama-server within WebUI dev.sh script (#16363 )

2025-10-01 08:40:26 +03:00

src

webui: remove client-side context pre-check and rely on backend for limits (#16506 )

2025-10-12 18:06:41 +02:00

static

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

.gitignore

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

.npmrc

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

.prettierignore

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

.prettierrc

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

components.json

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

eslint.config.js

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

package-lock.json

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

package.json

Always show message actions for mobile UI + improvements for user message sizing (#16076 )

2025-09-26 15:59:07 +02:00

playwright.config.ts

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

README.md

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

svelte.config.js

webui: switch to hash-based routing (alternative of #16079 ) (#16157 )

2025-09-26 18:36:48 +03:00

tsconfig.json

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

vite.config.ts

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

vitest-setup-client.ts

SvelteKit-based WebUI (#14839 )

2025-09-17 19:29:13 +02:00

README.md

llama.cpp Web UI

A modern, feature-rich web interface for llama.cpp built with SvelteKit. This UI provides an intuitive chat interface with advanced file handling, conversation management, and comprehensive model interaction capabilities.

Features

Modern Chat Interface - Clean, responsive design with dark/light mode
File Attachments - Support for images, text files, PDFs, and audio with rich previews and drag-and-drop support
Conversation Management - Create, edit, branch, and search conversations
Advanced Markdown - Code highlighting, math formulas (KaTeX), and content blocks
Reasoning Content - Support for models with thinking blocks
Keyboard Shortcuts - Keyboard navigation (Shift+Ctrl/Cmd+O for new chat, Shift+Ctrl/Cmdt+E for edit conversation, Shift+Ctrl/Cmdt+D for delete conversation, Ctrl/Cmd+K for search, Ctrl/Cmd+V for paste, Ctrl/Cmd+B for opening/collapsing sidebar)
Request Tracking - Monitor processing with slots endpoint integration
UI Testing - Storybook component library with automated tests

Development

Install dependencies:

npm install

Start the development server + Storybook:

npm run dev

This will start both the SvelteKit dev server and Storybook on port 6006.

Building

Create a production build:

npm run build

The build outputs static files to ../public directory for deployment with llama.cpp server.

Testing

Run the test suite:

# E2E tests
npm run test:e2e

# Unit tests
npm run test:unit

# UI tests
npm run test:ui

# All tests
npm run test

Architecture

Framework: SvelteKit with Svelte 5 runes
Components: ShadCN UI + bits-ui design system
Database: IndexedDB with Dexie for local storage
Build: Static adapter for deployment with llama.cpp server
Testing: Playwright (E2E) + Vitest (unit) + Storybook (components)