Which paper introduced the Transformer architecture?
Answer options
A
"Improving Language Understanding by Generative Models"
B
"Learning Deep Architectures"
C
"Attention Is All You Need"
D
"Neural Machine Translation"
E
"Mastering the Game of Go"
Correct answer: "Attention Is All You Need"
Explanation
The source marks the correct answer as: "Attention Is All You Need".