Spaces:

Launchpad
/

Lunchpad

Runtime error

amosyou commited on Apr 12, 2024

Commit

999d005

1 Parent(s): 929c70d

fix: masked fill and imports

Files changed (4) hide show

src/__init__.py ADDED Viewed

File without changes

src/modules/__init__.py ADDED Viewed

File without changes

src/modules/multihead_attention.py CHANGED Viewed

@@ -124,7 +124,7 @@ class MultiheadAttention(nn.Module):
             # don't attend to padding symbols
             attn_weights = attn_weights.view(bsz, self.num_heads, tgt_len, src_len)
             attn_weights = attn_weights.float().masked_fill(
-                key_padding_mask.unsqueeze(1).unsqueeze(2),
                 float('-inf'),
             ).type_as(attn_weights)  # FP16 support: cast to float and back
             attn_weights = attn_weights.view(bsz * self.num_heads, tgt_len, src_len)

             # don't attend to padding symbols
             attn_weights = attn_weights.view(bsz, self.num_heads, tgt_len, src_len)
             attn_weights = attn_weights.float().masked_fill(
+                key_padding_mask.unsqueeze(1).unsqueeze(2).bool(),
                 float('-inf'),
             ).type_as(attn_weights)  # FP16 support: cast to float and back
             attn_weights = attn_weights.view(bsz * self.num_heads, tgt_len, src_len)

src/utils/__init__.py ADDED Viewed

File without changes