Fix flash attention

fa3 latest version changed the return shape of the varlen func to be consistent w fa2. this pr fixes the fa3 attention call as done in https://github.com/Wan-Video/Wan2.2/pull/64
2026-01-11 08:43:32 +00:00 · 2025-08-27 11:43:35 +02:00 · 2025-08-27 11:43:35 +02:00 · ca23a2fc59
commit ca23a2fc59
parent 7c81b2f27d
1 changed files with 1 additions and 1 deletions
--- a/wan/modules/attention.py
+++ b/wan/modules/attention.py
@ -107,7 +107,7 @@ def flash_attention(
            max_seqlen_k=lk,
            softmax_scale=softmax_scale,
            causal=causal,
-            deterministic=deterministic)[0].unflatten(0, (b, lq))
+            deterministic=deterministic).unflatten(0, (b, lq))
    else:
        assert FLASH_ATTN_2_AVAILABLE
        x = flash_attn.flash_attn_varlen_func(