expand attention_bias to (B, N, S, S) instead of (B, 1, S, S) for backwards compatibility with JS EP
· Sign up or log in to comment