More

lucrbvi · 2026-04-24T09:12:41 1777021961

They have added a lot of optimization focussing on the KV-cache, so they can have a much larger window without eating all the VRAM.

The 1M window might be usable, but it will probably underperform against a smaller window of course.

lucrbvi · 2026-04-20T08:17:11 1776673031

Mollie might be a direct competitor

sumedh · 2026-04-20T08:44:58 1776674698

Isnt Mollie Europe only?

lucrbvi · 2026-04-20T08:56:53 1776675413

Mollie seems to only provide services to business based in European Economic Area, Switzerland and the UK [0], so yes?

[0]: https://help.mollie.com/hc/en-us/articles/115002116105-Can-I...

andrewshadura · 2026-04-20T12:35:15 1776688515

And they also have a minimum turnover limits. They rejected us as too small.

lucrbvi · 2026-04-16T17:37:55 1776361075

Is there anyone that feels that LLMs are wrong for computer use? It's like robotic, if find LLMs alone are really slow for this task

sumedh · 2026-04-17T14:15:13 1776435313

> find LLMs alone are really slow for this task

Faster LLMs will be here by next year.

lucrbvi · 2026-04-16T15:17:42 1776352662

Some people are speculating that Opus 4.7 is distilled from Mythos due to the new tokenizer (it means Opus 4.7 is a new base model, not just an improved Opus 4.6)

aesthesia · 2026-04-16T15:51:17 1776354677

The new tokenizer is interesting, but it definitely is possible to adapt a base model to a new tokenizer without too much additional training, especially if you're distilling from a model that uses the new tokenizer. (see, e.g., https://openreview.net/pdf?id=DxKP2E0xK2).

ACCount37 · 2026-04-16T16:54:36 1776358476

Not impossible, but you have to be at least a little bit mad to deploy tokenizer replacement surgery at this scale.

They also changed the image encoder, so I'm thinking "new base model". Whatever base that was powering 4.5/4.6 didn't last long then.

alecco · 2026-04-16T15:26:25 1776353185

Yes, I was thinking that. But it could as well be the other way around. Using the pretrained 4.7 (1T?) to speed up ~70% Mythos (10T?) pretraining.

It's just speculative decoding but for training. If they did at this scale it's quite an achievement because training is very fragile when doing these kinds of tricks.

ACCount37 · 2026-04-16T15:41:26 1776354086

Reverse distillation. Using small models to bootstrap large models. Get richer signal early in the run when gradients are hectic, get the large model past the early training instability hell. Mad but it does work somewhat.

Not really similar to speculative decoding?

I don't think that's what they've done here though. It's still black magic, I'm not sure if any lab does it for frontier runs, let alone 10T scale runs.

lucrbvi · 2026-04-08T11:22:36 1775647356

Hi! I share my first blog post ever about the definition on AGI following the annoucement of Claude Myhtos Preview.

I hope some of you might found this interesting.

lucrbvi · 2026-03-25T08:50:56 1774428656

Sounds like Multi-Head Latent Attention (MLA) from DeepSeek

veunes · 2026-03-25T09:52:01 1774432321

Nah, those are completely different beasts. DeepSeek's MLA solves the KV cache issue via low-rank projection - they literally squeeze the matrix through a latent vector at train time. TurboQuant is just Post-Training Quantization where they mathematically compress existing weights and activations using polar coordinates

esafak · 2026-03-25T13:36:25 1774445785

No, it is about compressing the KV cache; see How TurboQuant works.

lucrbvi · 2026-03-19T13:19:22 1773926362

This is a weird pattern accross OpenAI/Anthropic to buy startups building better toolings.

I don't really see the value for OAI/Anthropic, but it's nice to know that uv (+ ty and many others) and Bun will stay maintained!

jpalomaki · 2026-03-19T13:46:14 1773927974

Somebody took a deeper look at Claude Code and claims to find evidence of Anthropic's PaaS offering [1]. There's certainly money to be made by offering a nice platform where "citizen developers" can push their code.

From Astral the (fast) linter and type checker are pretty useful companions for agentic development.

[1] https://x.com/AprilNEA/status/2034209430158619084

lucrbvi · 2026-03-19T13:51:13 1773928273

I wouldn't be surprised if Vercel were bought by Anthropic/OAI (but maybe it would be too expensive?)

bikelang · 2026-03-19T14:18:16 1773929896

No no - SpaceX/xAi must now buy Vercel so that we can deploy our bloated Next apps to space.

GCUMstlyHarmls · 2026-03-19T14:27:08 1773930428

Next now renamed to Xext.

dirkc · 2026-03-19T15:58:32 1773935912

At least in space there is lots of space and no heat /s - I'd love for Next to exist in a vacuum

jimmydoe · 2026-03-19T14:15:54 1773929754

Nothing is too expensive. It will be a bidding war.

synthc · 2026-03-19T13:33:26 1773927206

`uv agent` and `bun agent` in 3....2.....1....

rgilliotte · 2026-03-19T13:39:49 1773927589

Totally agree

The value for Anthropic / OAI is that they have a strong interest in becoming the "default" agent.

The one that you don't need to install, because it's already provided by your package manager.

everforward · 2026-03-19T14:26:39 1773930399

I don't think this holds because we're talking about developers who know how to use a package manager, on a piece of software you have to install anyways. The friction of "uv add $other_llm_software" is too low for it to have a real impact.

I think they're more into the extra context they can build for the LLM with ruff/ty.

dec0dedab0de · 2026-03-19T17:21:35 1773940895

until "uv add $openai_competitor" mysteriously breaks in odd, difficult to troubleshoot ways.

siva7 · 2026-03-19T15:15:28 1773933328

You fool think they are targeting developers with this purchase?

everforward · 2026-03-19T15:30:00 1773934200

I don’t think they’re targeting the C suite with it, because they don’t use uv and Microsoft already has Copilot for the “it’s bad but bundled with stuff you’re already paying for” market.

jitl · 2026-03-20T05:16:11 1773983771

idk, i think it's the other way around. I imagine in 5 years my new laptop setup will look like:

    $ curl 'claude.ai/install?key=abcd123' | bash -e
    $ claude 'finish laptop setup from http://github.com/justjake/Dotfiles'

claude will be the one to install / set up the system, not the other way around. claude was certainly the one who installed `uv` on my current machine.

DoctorDabadedoo · 2026-03-19T13:48:04 1773928084

Good that they got some money and a longer runaway, but I have my doubts the product will improve rather than be smothered to death.

Embrace, extend, extinguish. Time will tell.

groby_b · 2026-03-20T14:44:47 1774017887

The most straightforward one: They run a lot of computational sandboxes that need fast setup. Making sure you can shape the package manager to your needs is a pretty straightforward desire.

LoganDark · 2026-03-19T13:36:17 1773927377

I'm not so sure. I sort of wish they hadn't been acquired because these sort of acquihires usually result in stifling the competition while the incumbent stagnates. It definitely is an acquihire given OpenAI explicitly states they'll be joining the Codex team and only that their existing open-source projects will remain "maintained".

luxcem · 2026-03-19T19:09:10 1773947350

The value is to control the tool chain from idea to production so it can be automated by agents. It's no secret that the final goal is to fully replace developers, the flow "idea to production". It's easier to control that flow if you control each tool and every step.

I won't be surprised if the next step is to acquire CI/CD tools.

0x3f · 2026-03-19T13:33:13 1773927193

> it's nice to know that uv (+ ty and many others) and Bun will stay maintained!

Depends if you think the bubble is going to pop, I suppose. In some sense, independence was insulation.

butlike · 2026-03-19T15:15:10 1773933310

They probably prompted for what they should do next and got this as a half-hallucinated response lol

TheCondor · 2026-03-19T18:28:59 1773944939

Does OpenAI use a lot of python?

There is the literal benefit of "we use the hell out of this tool, we need to make sure it stays usable for us" and then there is what they can learn from or coerce the community in to doing.

jbonatakis · 2026-03-19T19:13:45 1773947625

I don't know about OpenAI using a lot of Python, but Astral builds all their tools in Rust and just exposes Python bindings. Codex is all Rust. It feels like a reasonable acquisition from that perspective. They're banking on at least in part on the Astral team being able to integrate with and supercharge Codex.

FaceValuable · 2026-03-19T22:24:20 1773959060

Yeah, it’s all mostly heaps of Python internally aside from Codex.

OutOfHere · 2026-03-19T16:05:18 1773936318

Why do you think that uv, etc. will stay maintained? They will for now, but as soon as cash is tight at OpenAI, they'll get culled so fast that you won't see it coming. This is the risk.

insane_dreamer · 2026-03-19T19:32:54 1773948774

I'm expecting Anthropic to buy Zed

itissid · 2026-03-19T14:11:46 1773929506

Isn't this something to do with their paid pyx(as opposed to ty/ruff etc) thingy?

christina97 · 2026-03-19T14:12:13 1773929533

I mean they are “startups” on the way to mega-companies. They need internal tooling to match.

lucrbvi · 2026-03-10T19:15:11 1773170111

I share the feeling; but people using it are mostly non-technicals (despite the 50+ config files lol) and are just runing it constantly to do random things.

But a message bot + Claude Code/Codex would be the better version

Otterly99 · 2026-03-12T10:50:57 1773312657

I tried it for 2 days and honestly don't see the usefulness either. Although, the big reason is that I paired it with Claude, which only uses the per token billing method. Here are the few improvement on a simple Claude usage:

- As you mentioned, the message bot thing was kind of cool. - It can browse the internet and act (like posting on MoltBook, which I tried). - It has a a permanent "memory" (loads of .md files, so nothing fancy). - It can be schedulded via cron jobs.

Overall, nothing really impressive. It is very gimmicky and it felt very unsafe the whole time (I had already read about the security issues, but sometimes you gotta live dangerously). The most annoying part was the huge token consumption (conversations start at 20k+ because of all the .md files) and it cost me roughly $12 for a few hours of testing.

joe_mamba · 2026-03-10T20:51:31 1773175891

>but people using it are mostly non-technicals

Non-technical people haven't even heard of OpenClaw or Github, let alone know how to use and deploy them. Non-technical people don't even know what OS their Samsung or iPhone is called.

If you can find something on Github and deploy it on your system, you're part of the technical crowd.

firecall · 2026-03-10T21:39:21 1773178761

Well…. In my experience that’s not exactly true!

My hairdresser knew all about it and had ordered a Mac mini.

I have been surprised at how much attention is being paid to this AI thing by pretty much everybody AFAICT.

joe_mamba · 2026-03-10T21:47:18 1773179238

>My hairdresser knew all about it and had ordered a Mac mini.

Your hairdresser can't be a technical person because they're a hairdresser ?? I know a surgeon who writes FOSS software as a hobby. What does profession have to do with being technical or not? Most technical people are self taught anyway.

firecall · 2026-03-10T23:38:57 1773185937

Thats a hot take LOL

> http://hackertimes.com/newsguidelines.html > In Comments > Be kind. Don't be snarky. Converse curiously; don't cross-examine. Edit out swipes.

No, I'm saying they are not a 'technical person'.

I know them very well, and they are not a coder, or a 'technical person' by a broad HN definition.

What I'm saying is that we are at the point where technology is so pervasive in our society, and the lure of AI so seductive, that many more people are excited to try things out than I might have expected.

I suppose it has similarities to the early to mid 1980s and the home computing revolution. Where many people thought they should have a computer at home, even if they were not sure what they'd do with it.

Much like the excitement around AI today!

pigeons · 2026-03-12T20:46:57 1773348417

Most "technical people" haven't bought a mac mini to run openclaw. Doing so fully qualifies you as a "technical person".

joe_mamba · 2026-03-11T09:41:49 1773222109

Why are you pointing out the rules? Did anyone break them?

lucrbvi · 2026-03-09T10:13:33 1773051213

The whole article feels like an ad

piva00 · 2026-03-09T10:16:15 1773051375

Because it is, new account, never posted a single comment, only have this submission so it's just an account for marketing.

lucrbvi · 2026-03-04T20:20:17 1772655617

The "egg" system seems really good!

It will maybe solved soon if we train yet another neural network on scanning GitHub activities; but by also adding other forges like codeberg, gitlab, self-hosted forgejo, etc... to not lock non-github users out

Still really good idea!

jeffreysmith · 2026-03-05T11:15:50 1772709350

Thanks!

Yeah, scanning non-GitHub is on the roadmap and really should be done. I expect there would be value in understanding all of the current GitHub competitors. And I think the forecasts of new GH competitors getting launched (likely by AI companies) will become relevant in the near future.

jeffreysmith · 2026-03-05T11:34:06 1772710446

And I filed that suggestion as an enhancement issue on the repo: https://github.com/2ndSetAI/good-egg/issues/43

Thanks for the idea.