Die size increases cost exponentially, by decreasing chips per wafer and decreas...

robkop · 2026-04-19T12:37:39 1776602259

You can ablate surprisingly large chunks of a model with near to no effect, you can try this easily - download an open weight model in torch.

Obviously it’s not ideal but you could likely have single digit % of all weights affected and still have a useful model (many caveats here: e.g. locality of damaged weights matters, distribution of errors matters, fail high/low matters, …)

hdndjsbbs · 2026-04-18T19:30:31 1776540631

I mean, you probably can just turn off defective parts of the network. You better believe if this becomes popular they would salvage yields by selling "dumber" chips at a discount.

vrighter · 2026-04-19T04:50:05 1776574205

except that if you do, you've just implemented a different model, with no way to tell which part of it is wrong

hdndjsbbs · 2026-04-26T12:53:08 1777207988

Could you tell that the original model was "right"?