You raise a great point. And the Amazon picking staff are onshore in wealthy cou...

whiplash451 · 2026-05-02T08:36:16 1777710976

Probably both vision and dexterity, and the first mistake we make as roboticists/engineers might be to distinguish the two like they're separate problems to solve or that a solution exists where the two live a separate life.

https://rodneybrooks.com/why-todays-humanoids-wont-learn-dex...

grumbelbart · 2026-05-02T10:10:18 1777716618

Agreed. The solution will likely be some vision foundation model that directly sends controls to the robot ("move here, grab, move there"), trained by Amazon with RL to integrate collision avoidance, object detection, grasping point detection, grasp verification etc.

fluoridation · 2026-05-02T10:35:30 1777718130

If we're talking about picking objects at random from one bin and putting it in another, I don't need my eyes to do that. Proprioception (shape and location) and touch (texture) are enough to do that.

jagged-chisel · 2026-05-02T14:02:36 1777730556

But we still don’t have good sensors for this, so our robots try to rely on vision.

fluoridation · 2026-05-02T22:04:57 1777759497

You need those anyway to know how much grip to apply and how to hold on to the object. You can't determine that visually.

jagged-chisel · 2026-05-02T23:18:46 1777763926

Nah, your robot visually determines what it’s grabbing, then looks in a database for the best grip.