Pruning | Wadhwani School of Data Science and Artificial Intelligence

3rd Edition of "WSAI Annual Research Showcase 2026" is scheduled on May 18, 2026 — Click here to Register for the event

On the weak link between importance and prunability of attention heads

Publications

Given the success of Transformer-based models, two directions of study have emerged: interpreting role of individual attention heads and down-sizing the models for efficiency. Our work straddles these two streams: We …

Tags: NLP, Attention, BERT, Pruning