Gradient Scaling: Enhancing Neural Network Training