Understanding Knowledge Distillation in Deep Learning