Understanding Scaling Laws in Neural Language Models