Size of an NLP model also has other unintended consequences. The more parameters there are, the more power it takes up, the costlier it is to run and the longer a request takes to complete. There is always a sweet spot in terms of speed, quality and cost.