Optimizing Your Model for Inference with PyTorch Quantization
Editor’s Note: Jerry is a speaker for ODSC East 2022. Be sure to check out his talk, “Quantization in PyTorch,” to learn more about PyTorch quantization! Quantization is a common technique that people use to make their model run faster, with lower memory footprint and lower... Read more