This is backwards. Algorithms that can be parallelized are inherently superior, ...

Scene_Cast2 · 2025-10-25T03:12:36 1761361956

There are large, large gaps of parallel stuff that GPUs can't do fast. Anything sparse (or even just shuffled) is one example. There are lots of architectures that are theoretically superior but aren't popular due to not being GPU friendly.

danielmarkbruce · 2025-10-25T03:27:31 1761362851

That’s not a flaw in parallelism. The mathematical reality remains that independent operations scale better than sequential ones. Even if we were stuck with current CPU designs, transformers would have won out over RNNs.

Unless you are pushing back on my comment "all kinds" - if so, I meant "all kinds" in the way someone might say "there are all kinds of animals in the forest", it just means "lots of types".

Scene_Cast2 · 2025-10-25T12:18:23 1761394703

I was pushing back against "all kinds". The reason is that I've been seeing a number of inherently parallel architectures, but existing GPUs don't like some aspect of them (usually the memory access pattern).

danielmarkbruce · 2025-10-25T15:40:15 1761406815

yeah, bad writing on my part.