Johannes Gäßler
							
						 
					 | 
					
						
						
							
						
						dc685be466
					 | 
					
						
						
							
							CUDA: add FP32 FlashAttention vector kernel (#7188)
						
						
						
						
						
						
						
						* CUDA: add FP32 FlashAttention vector kernel
* fixup! CUDA: add FP32 FlashAttention vector kernel
* fixup! fixup! CUDA: add FP32 FlashAttention vector kernel
* fixup! fixup! fixup! CUDA: add FP32 FlashAttention vector kernel 
						
						
					 | 
					
						2024-05-12 19:40:45 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 |