mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-11-02 09:12:03 +00:00 
			
		
		
		
	* model-conversion: add model card template for embeddings [no ci] This commit adds a separate model card template (model repository README.md template) for embedding models. The motivation for this is that there server command for the embedding model is a little different and some addition information can be useful in the model card for embedding models which might not be directly relevant for causal models. * squash! model-conversion: add model card template for embeddings [no ci] Fix pyright lint error. * remove --pooling override and clarify embd_normalize usage
		
			
				
	
	
		
			14 lines
		
	
	
		
			194 B
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			14 lines
		
	
	
		
			194 B
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
---
 | 
						|
base_model:
 | 
						|
- {base_model}
 | 
						|
---
 | 
						|
# {model_name} GGUF
 | 
						|
 | 
						|
Recommended way to run this model:
 | 
						|
 | 
						|
```sh
 | 
						|
llama-server -hf {namespace}/{model_name}-GGUF -c 0 -fa
 | 
						|
```
 | 
						|
 | 
						|
Then, access http://localhost:8080
 |