Authors
Narayanaswami, Sai Kiran , Srinivasan, Gopalakrishnan , Ravindran, Balaraman
Preprint Server
arXiv

Sai Kiran Narayanaswami, Gopalakrishnan Srinivasan, Balaraman Ravindran. “QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities”

Preprint link: https://arxiv.org/abs/2412.00408