2022-11-11T04:58:13+00:00 | 🔗
Bigger models somehow didn't do better (even very very big models and also modest models) in the end the model is fairly small in parameter count