Question d’entretien chez TikTok

Why LLM uses Layer Normalization not Batch Normalization