Sai Kiran Narayanaswami, Gopalakrishnan Srinivasan, Balaraman Ravindran. “QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities”
Preprint link: https://arxiv.org/abs/2412.00408
Sai Kiran Narayanaswami, Gopalakrishnan Srinivasan, Balaraman Ravindran. “QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities”
Preprint link: https://arxiv.org/abs/2412.00408