It's also a good play to try to take resources away from local, self-hosted "Fea...

dhruvdh · on Feb 26, 2024

It's not like Microsoft is working on "Windows AI Studio" [1], or released Orca, or Phi. It's not like there's any talk of AI PCs with mandatory TOPs requirements for Windows 12. Big bad Microsoft coming for your local AI, beware.

[1] https://github.com/microsoft/windows-ai-studio

drakenot · on Feb 27, 2024

The whole embrace, extend.. ?

> Mistral Remove "Committing to open models" from their website

That was 5 hours ago.

Without having insider details it is hard to know why, but the coincidence of timing with the Microsoft deal is not lost on me. It could have even been a stipulation.

Aerbil313 · on Feb 27, 2024

I have no explanation for why Microsoft has started aggressively innovating again (with the introduction of Satya) than my theory that US DoD realized the country's tool of dominance in the future will be predominantly with tech superiority instead of military power. Microsoft's new strategy of running everything on the cloud aligns with this, even if it may have been also motivated by the fact that most people now only own a battery-constrained mobile device and laptops getting smaller and thinner.

whimsicalism · on Feb 26, 2024

You’re downvoted for the snarky tone I guess, but you’re absolutely right

peteradio · on Feb 26, 2024

Even easier to rug pull your own teams project than someone else's.

isoprophlex · on Feb 26, 2024

Moving straight from Embrace to Extinguish, why not!

Zuiii · on Feb 27, 2024

Antitrust. Using their dominance in one market to destroy another. I hope the EU tares them a new one if the us doesn't.

loceng · on Feb 26, 2024

From my understanding, which may be wrong, you only need the massive compute resources initially to create a compiled vector space LLM - and then that LLM once compiled can be run locally?

This is why anti-CSAM measures policy is possible so compiled-release LLMs can have certain vector spaces removed before release; but apparently people are creating cracks for these types of locks?

extr · on Feb 26, 2024

You are a little confused. There’s no “compiling” of LLMs. It’s just once it’s trained, inference takes less compute than further training. So you can run things locally that you couldn’t necessarily train locally.

Not sure where you are getting the CSAM bit. We aren’t that good at blanking out weights in any kind of model, certainly not good enough to lobotomize specific types of content.

loceng · on Feb 26, 2024

Thanks for the clarification.

The CSAM bit seems to then be propaganda from at least one AI company putting out PR to falsely quell people's concerns about their LLMs being able to generate content involving children that's sexualized.

I've yet to see details of how much compute-minimum server requirements are necessary to run LLMs. Maybe you know a source who's compiling a list in a feature matrix that includes such details?

dwaltrip · on Feb 27, 2024

Large LLMs like gpt-3 and gpt-4 need very serious hardware. They have hundreds of billions of parameters (or more) which need to be loaded in memory all at once.

JyB · on Feb 26, 2024

I love that you are using the word lobotomize.

smoldesu · on Feb 26, 2024

I don't see why Mistral would acquiesce. Like the other comment says, Microsoft has a lot of chips on the table for local AI. They didn't even mention DirectML, ONNX or Microsoft's other local AI frameworks - suffice to say Microsoft does care about on-device AI.

So... would Mistral deliberately sabotage their low-end models to appease Microsoft's cloud demand? I don't think so. Microsoft probably knows that letting Mistral fall behind would devalue their investment. It makes more sense to bolster the small models to increase demand for the larger ones, at least from where I'm standing.

foobiekr · on Feb 29, 2024

Money. How do they get revenue in your model?

moneywoes · on Feb 27, 2024

what’s the most promising?

smoldesu · on Feb 27, 2024

If you're asking about Microsoft's APIs - I'd keep an eye on ONNX. It's the most ambitious, but also supports an insane amount of acceleration targets. It would be the proverbial "big guns" if vendors continued investing in more insular frameworks like Metal and CUDA.

teh_infallible · on Feb 26, 2024

So.. Embrace, Extend, Extinguish?