> Is avoiding CF potentially just a matter of sheer scale ? My intuition would b...

		t-vi on Sept 7, 2023 \| parent \| context \| favorite \| on: Can LLMs learn from a single example? > Is avoiding CF potentially just a matter of sheer scale ? My intuition would be that you get more orthogonal directions to the gradient (of previous samples) if you have larger model.