Eh, robotics is going through explosive growth right now with the same computing power that's being used on LLMs. You can take human motion capture of a task, dump it in a robotics simulator for a few hours and get a model that can operate autonomously better than something that would have taken a half a year to teach just a few years back.
In the end, it'll probably require something like model-based RL like Yann LeCun talks about and that's totally different to the LLMs.