mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-10-28 08:31:25 +00:00 
			
		
		
		
	mtmd : support InternVL 2.5 and 3 (#13422)
* convert : internvl support * InternVL3-1B working * fix regression * rm mobilevlm from test * fix conversion * add test for internvl * add to list of pre-quant * restore boi/eoi check * add clarify comment for norm eps
This commit is contained in:
		| @@ -252,6 +252,13 @@ int32_t mtmd_tokenize(mtmd_context * ctx, | ||||
|  | ||||
|     } | ||||
|  | ||||
|     else if (proj_type == PROJECTOR_TYPE_INTERNVL) { | ||||
|         // <img> ... (image embeddings) ... </img> | ||||
|         marker_modified = "<img>" + ctx->image_marker + "</img>"; | ||||
|         string_replace_all(prompt_modified, ctx->image_marker, marker_modified); | ||||
|  | ||||
|     } | ||||
|  | ||||
|     // llava-1.5, llava-1.6, Yi-VL, Yi-34B, granite: don't need to add prefix and suffix | ||||
|     // for glm-edge, BOI and EOI token's embeddings are not present in the text model | ||||
|  | ||||
|   | ||||
		Reference in New Issue
	
	Block a user
	 Xuan-Son Nguyen
					Xuan-Son Nguyen