I think OpenAI was chilled out, and mostly lucky. Chilled in the sense that Goog...

ethbr1 · on June 18, 2024

> it had so much real, transformative value, that it spread out with zero marketing

IMHO, in retrospect, the failure gradient of early LLMs is underappreciated in driving adoption.

Windows 95 failure: blue screen with inscrutable error code. Everyone noticed that.

LLM failure: run-around non-answer (user shrugs and tries again) or confident and plausible incorrect answer (user doesn't recognize this without research).

Essentially, the ways in which LLMs didn't work were the most hidden and hardest to discover failure mode.

Which was perfectly tuned for the "I'm going to try this thing for 5 minutes and be amazed" first impression.

Which allowed subsequent generations to backfill the capability gaps.

Tl;dr - We shouldn't underappreciate quiet-failing as a product adoption driver.

FactKnower69 · on June 18, 2024

>We shouldn't underappreciate quiet-failing as a product adoption driver.

I'm certain it drives a lot of early user retention in the short term, but I feel strongly that this is ultimately a very myopic view which will prove catastrophic in the long term in much the same way that swallowing exceptions at runtime builds compounding technical debt you'll have to reckon with sooner or later

more broadly, there is just so much handwaving away all the black box parts of deep neural networks that are completely opaque and there seems to be very little interest in building the tooling to properly visualize, explore, and DEBUG latent space; until those priorities change this whole thing is a huge time bomb.

imagine if instead of coming with full memory dumps and diagnostic codes, BSODs just said "sorry, your computer had an oopsie!", and not a single engineer at Microsoft had a complete understanding of why the BSOD happened in the first place; sometimes it just does that! whoops!

ethbr1 · on June 18, 2024

> imagine if instead of coming with full memory dumps and diagnostic codes, BSODs just said "sorry, your computer had an oopsie!"

So, MacOS? ;)

In all seriousness, I wasn't opining on the usefulness of opaque/hidden errors, but rather the effectiveness of them.

In an alternate reality where the first LLMs instead spit back an error reference instead of English, I don't think we would have seen nearly as rapid mass market adoption.

And, not to put too fine a point on it, early conversational LLMs and image diffusion models were literally trained so their junk output is as plausible as possible.

cma · on June 18, 2024

GPT-3 was out for years already for Googlw to see, but with what OpenAI saw of it they began training an expensive GPT-4, before chatgpt success, maybe started before RLHF.