• Home
  • Preprints
  • QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for …

QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities

Authors
Narayanaswami, Sai Kiran , Srinivasan, Gopalakrishnan , Ravindran, Balaraman
Preprint Server
arXiv

Sai Kiran Narayanaswami, Gopalakrishnan Srinivasan, Balaraman Ravindran. “QuAKE: Speeding up Model Inference Using Quick and Approximate Kernels for Exponential Non-Linearities”

Preprint link: https://arxiv.org/abs/2412.00408