Question d’entretien chez Go Vivace

Why and when do we use multi-headed attention module in Natural Language Processing