tfa.layers.MultiHeadAttention | TensorFlow Addons
www.tensorflow.org › tfa › layersNov 15, 2021 · If the layer's call method takes a mask argument (as some Keras layers do), its default value will be set to the mask generated for inputs by the previous layer (if input did come from a layer that generated a corresponding mask, i.e. if it came from a Keras layer with masking support. If the layer is not built, the method will call build.