Posts tagged #Ai

Worth reading

A paper on pruning transformer attention heads during inference without retraining.