Question d’entretien chez togetherAI

Code multi-head attention, how to implement speculative decoding, etc