Paper Review/Knowledge Distillation
This is a review of
"Asymmetric Temperature Scaling Makes Larger Networks Teach Well Again" presented at NeurIPS 2022.