Bigger is not better in machine learning
I heard an interview today featuring the Chief Scientist of Mosaic ML. One of the things Jonathan Frankle said is that bigger is not better. He, of course, is talking about machine model size (e.g., number of parameters).
I posit that we can agree that bigger models may not