mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-10-31 08:51:55 +00:00 
			
		
		
		
	 6e84b0ab8e
			
		
	
	6e84b0ab8e
	
	
	
		
			
			Implemented ggml_sycl_op_soft_max() F16 src1(mask) support for which a pragma deprecation warning was added during #5021. To do this, had to decouple it from ggml_sycl_op_flatten which always considered src1 to be of fp32 type(many OP functions are dependent on it). * SYCL: SOFTMAX F16 mask support and other fixes * test-backend-ops: Add F16 mask test cases
		
			
				
	
	
	
		
			162 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			162 KiB