Capgemini Interview Question

Transformer architecture, self attention machanism

Interview Answer

Anonymous

Jan 21, 2026

Transformer: Attention-based model for fast and parallel sequence processing. Self-Attention: Mechanism that helps words understand context by attending to other words.