Performances of local models are pretty bad compared to what AI vendors offer, t...

jqpabc123 · 2026-06-06T14:01:15 1780754475

Dumb idea --- how about if we limit local models to specific domains --- medicine for example.

Most doctors don't care much about engineering or accounting or software development or 10000 other things that big vendor models address.

This area is yet to be really explored. Nvidia aims to provide the hardware to do so.

CamperBob2 · 2026-06-06T17:59:50 1780768790

That's a fairly obvious idea, not dumb at all, but unfortunately it doesn't seem to pan out. Trying to specialize an LLM in one area harms its 'cognition' in all areas. For instance, if you train a coding model without all the Shakespeare and soap operas and Wikipedia and pirated Stephen King books and ancient Roman history and whatever, you end up with a worse coding model.

I'm not sure anyone really understands why.

jqpabc123 · 2026-06-07T12:27:26 1780835246

https://www.ibm.com/think/topics/domain-specific-llm

CamperBob2 · 2026-06-07T16:57:49 1780851469

The article is not backed up by reality. Why would use anything but a domain-specific LLM, if they actually worked?

The author is probably confusing RAG with pretraining. You can RAG on PubMed but you can't arrive at a competitive model by pretraining solely on it.