Transformer encoder blocks include:
Answer options
A
Self-attention + feed-forward layers
B
Only LSTM cells
C
Only convolutional layers
D
No nonlinearities
Correct answer: Self-attention + feed-forward layers
Explanation
The correct answer is: Self-attention + feed-forward layers.