More

ntonozzi · 2026-03-24T22:30:33 1774391433

Something like Cloudflare's Code Mode fixes both of these! No privileged bash environment, no VM necessary, no exposing credentials to the LLM.

As the article states, LLMs are fantastic at writing code, and not so good at issuing tool calls.

m11a · 2026-03-25T04:08:15 1774411695

Cloudflare's Code Mode is conceptually the same as Anthropic's Code Mode (https://www.anthropic.com/engineering/code-execution-with-mc...), or the various open source implementations that predate and postdate those blog posts.

tbh, that companies tried to make something proprietary of this concept is probably why its adoption has been weak and why we have "MCP vs CLI/Skills/etc" debates in the first place. In contrast, CLI tools only require a general a bash shell (potentially in a sandbox environment), which is very standardised.

ntonozzi · 2026-03-16T22:27:46 1773700066

The UK is begging for people to build datacenters: https://www.theguardian.com/technology/2026/mar/09/revealed-...

hdgvhicv · 2026-03-16T22:43:01 1773700981

National governments maybe. Local ones aren’t, and it’s those that fill with nimbys and maintain the majority of control over planning

nxobject · 2026-03-17T00:01:52 1773705712

Everyone wants the datacenter somewhere in their country for sovereignty... just not next to them. Quelle surprise. At this point you may as well build supermarkets on top of them just to sell 'em to people.

fc417fc802 · 2026-03-17T04:16:34 1773720994

They're so absurdly capital intensive at this point that they probably ought to be buried at least 50 meters down. If any reasonably capable countries ever face off directly they'll probably be one of the first things to go.

aitchnyu · 2026-03-17T11:29:07 1773746947

US suburban development followed nuclear war threats. Will history repeat itself by unconcentrating servers?

fc417fc802 · 2026-03-17T12:18:26 1773749906

Given the rapidly increasing power densities I expect it would be far more straightforward to bury them. I believe a single 42u rack of last gen nvidia hardware is already more energy intensive than the HVAC for a mcmansion.

However it occurs to me that the electrical grid becomes a high priority military target in this scenario. Maybe datacenters should go all in on building their own power plants.

ntonozzi · 2026-03-16T16:38:30 1773679110

Perhaps because he is a journalist whose job is to report reality, not avoid threats?

abracadaniel · 2026-03-16T17:46:47 1773683207

Until they start trying the carrot instead of the stick. Then it becomes a bidding war to determine the "reality"

zadikian · 2026-03-16T23:34:19 1773704059

They can also just do that bidding war in the resolution contract

asdff · 2026-03-16T18:55:43 1773687343

Always is. 'Reality' is a subjective accounting.

bigyabai · 2026-03-16T17:11:12 1773681072

Avoiding threats is non-optional for most Israeli journalists:

https://en.wikipedia.org/wiki/List_of_journalists_killed_in_...

https://en.wikipedia.org/wiki/Israeli_Military_Censor

yonixw · 2026-03-16T17:57:24 1773683844

The reason no one responds to this list is because it's just one big gish gallop

It's enough to see that you brought a link to the Israeli Military Censor to hint at a conspiracy, to understand who you're dealing with.

But even if you go into the list, you'll see at the top that those who were shot were in the middle of the battle, where Israeli forces were surprised, to the point of massacre alongside civilians. And there, it turns out, they didn't shoot to save themselves by the skin of their teeth, but simply wanted to kill journalists.

Also, a quick search shows that "Mohammad Jarghoun" ("מוחמד ג'רגון") was not a journalist at all, but a media worker, that according to the CPJ [1], during wartime he receives journalistic status. (Also not mentioned in AJ [2]. what a surprise...)

Another comment to the pantheon of "the most logical failures, in the fewest words". And then no one understands why the ICC will never consider such reports..

[1]: https://www.the7eye.org.il/501320

[2]: https://www.aljazeera.com/news/2023/10/10/at-least-six-pales...

b345 · 2026-03-17T07:54:44 1773734084

How does one say that a media worker is fair game, but a journalist is not? Both are classified as civilians under international law [1]. There are several airstrikes as the method of execution of these journalists as well. Good job cherry-picking the "favourable" examples. Also, I don't know what you're on about, but ICC is clearyly investigating Israeli war crimes of targeted journalist executions [3]. [1]: https://cpj.org/2023/10/journalist-casualties-in-the-israel-... [2]: https://cpj.org/data-methodology/ [3]: https://ifex.org/iccs-israel-palestine-investigation-will-in...

yonixw · 2026-03-17T10:08:29 1773742109

> Both are classified as civilians

Then say civilians. Don't claim what you can't provide. And DO provide context (like, was it still while the massacre was ongoing [1]). But all I can do is to suggest.

> Good job cherry-picking

Me cherry-picking: Taking the literally first entries, Array[0] and Array[1].

Also, you can't claim cherry-picking as invalid against gish-gallop. Since you can't enjoy the size argument only to retract items on the list after the smallest pushback.

Otherwise I can prove God. How? Every sentence in the Bible... Oh, you found some that are wrong? "Good job cherry-picking"!

And not to mention dozens of more problems with the list (no mentioning any IDF comments, no source for titles, etc.). This is just a bad list. Simple as.

> ICC is clearly investigating

Investigating != Judgment. But good, send them more. But please send them a list starting with items that might hold the smallest of scrutinies. And don't prove it by hinting at conspiracies just because Israel has a security censor. But all I can do is to suggest.

[1]: https://13tv.co.il/item/news/abroad/dynw9-903794689/

bigyabai · 2026-03-16T18:21:57 1773685317

[flagged]

yonixw · 2026-03-16T18:55:29 1773687329

Another "banger" comment that shows you did not read your sources links, here is one from the the wiki (Israeli Military Censor):

https://www.academia.edu/10481823/The_Israeli_paradox_The_mi...

Maybe your third comment will finally succeed...

bigyabai · 2026-03-16T19:00:23 1773687623

Both researchers in that paper are Israeli residents. Do you have an independent report that corroborates their findings?

yonixw · 2026-03-16T19:29:44 1773689384

The wiki sources FROM YOUR LINKS, are suddenly not enough once they are against you?

Now now... one may mistakenly think you have some inconsistencies in your theory... and you out to revisit them first before demanding more..

bigyabai · 2026-03-16T20:45:49 1773693949

The link corroborates my claim; the State of Israel does not protect press freedom.

I am asking you to cite a better counterarguement, if you want to disprove it. Or concede that Israeli journalists are regularly threatened by their government.

munificent · 2026-03-17T04:14:40 1773720880

I feel like being a journalist in a warzone is already exposure to a sufficient number of threats for the benefit of human society that we shouldn't simply accept them being exposed to any entirely different set of completely unnecessary threats from a pile of sociopaths running their own sick gambling dead pools.

ntonozzi · 2026-03-10T18:55:18 1773168918

Why do they need to run benchmarks to confirm performance? Can't they run an example prompt and verify they get the exact same output token probabilities for all prompts? The fact that they are not doing this makes me suspicious that they are in fact not doing the exact same thing as vLLM.

It is also a bit weird that they are not incorporating speculative decoding, that seems like a critical performance optimization, especially for decode heavy workloads.

lukebechtel · 2026-03-10T19:03:32 1773169412

Yes, speculative decoding will make both us and VLLM faster, but we believe it would be a relatively even bump on both sides, so we didn't include it in this comparison. Worth another test!

nyrikki · 2026-03-11T17:57:07 1773251827

> Can't they run an example prompt and verify they get the exact same output token probabilities for all prompts?

You don’t even get that with GPUs in general, or really floating point in general.

The Art of Computer Programming. Volume 2: Seminumerical Algorithms section 4.2.2 with explain where it loses floating addition associativity property.

Apartness relations are another possible lens.

ntonozzi · 2026-03-13T21:02:01 1773435721

Yeah you can: https://thinkingmachines.ai/blog/defeating-nondeterminism-in....

nyrikki · 2026-03-14T04:26:46 1773462406

> However, as the name “batch-invariant” suggests, the technique is currently limited to handling variations related only to the batch dimension, making it robust to continuous batching and other batch-size–related changes, but not to other forms of nondeterminism like changing the TP sizes or GPU types.

https://arxiv.org/abs/2506.09501

jeeeb · 2026-03-11T09:00:32 1773219632

> It is also a bit weird that they are not incorporating speculative decoding

Wouldn’t speculative decoding decrease overall throughput, but optimise (perceived) responsiveness?

YetAnotherNick · 2026-03-11T09:13:31 1773220411

For compute bound region(high batch size) yes, but for low batch size it could improve the throughput.

ntonozzi · 2026-03-05T21:50:14 1772747414

IMO the core of the issue is the awful Github Actions Cache design. Look at the recommendations to avoid an attack by this extremely pernicious malware proof of concept: https://github.com/AdnaneKhan/Cacheract?tab=readme-ov-file#g.... How easy is it to mess this up when designing an action?

The LLM is a cute way to carry out this vulnerability, but in fact it's very easy to get code execution and poison a cache without LLMs, for example when executing code in the context of a unit test.

crote · 2026-03-05T22:41:11 1772750471

GHA in general just isn't designed to be secure. Instead of providing solid CI/CD primitives they have normalized letting CI run arbitrary unvetted 3rd-party code - and by nature of it being CD giving it privileged access keys.

It is genuinely a wonder that we haven't seen massive supply-chain compromises yet. Imagine what kind of horror you could do by compromising "actions/cache" and using CD credentials to pivot to everyone's AWS / GCP / Azure environments!

ntonozzi · 2026-03-02T20:15:01 1772482501

Wow that is wild, that is exactly along the lines of my fantasy language. It'd be so easy to go into the deep end building tooling and improving a language like this.

fcatalan · 2026-03-02T20:26:55 1772483215

I have had to check myself a bit, too easy to fall too deep into what is essentially a practical joke

ntonozzi · 2026-02-03T22:41:44 1770158504

That argument was dead _at least_ 2 years ago, when we gave LLMs tools.

ntonozzi · 2026-01-20T23:32:18 1768951938

I've given up on soft delete -- the nail in the coffin for me was my customers' legal requirements that data is fully deleted, not archived. It never worked that well anyways. I never had a successful restore from a large set of soft-deleted rows.

zahlman · 2026-01-20T23:36:33 1768952193

> customers' legal requirements that data is fully deleted

Strange. I've only ever heard of legal requirements preventing deletion of things you'd expect could be fully deleted (in case they're needed as evidence at trial or something).

jandrewrogers · 2026-01-21T00:03:45 1768953825

While not common, regulations requiring a hard delete do exist in some fields even in the US. The ones I familiar with are effectively "anti-retention" laws that mandate data must be removed from the system after some specified period of time e.g. all data in the system is deleted no more than 90 days after insertion. This allows compliance to be automated.

The data subject to the regulation had a high potential for abuse. Automated anti-retention limits the risk and potential damage.

pessimizer · 2026-01-21T02:28:34 1768962514

You're thinking of "legal requirements" as requirements that the law insists upon rather than requirements that your legal department insists upon. You often want to delete records unrecoverably as soon as legally possible; it's likely why you wrote your data retention policy.

SchemaLoad · 2026-01-21T00:22:47 1768954967

I had an integration with a 3rd party where their legal contract required we hard delete any data from them after a year. Presumably so we couldn't build a competing product using their dataset with full history.

ntonozzi · 2026-01-20T23:42:07 1768952527

Many privacy regulations enforce full deletion of data, including GDPR: https://gdpr-info.eu/.

ntonozzi · 2026-01-12T16:16:21 1768234581

Hopefully it gets more tightly integrated.

ntonozzi · 2026-01-07T20:19:53 1767817193

Maybe the best part of this legislation will be that people will realize it's not institutional investors that are driving up home prices. No, that's far too optimistic.

tbrownaw · 2026-01-07T20:39:51 1767818391

When this doesn't make anything better, the conclusion won't be that it was a bad idea but that it somehow didn't go far enough.

ntonozzi · 2026-01-07T23:47:59 1767829679

Home affordability is getting better anyways, which is great, because we are finally having a surge in new & denser home building in popular regions and there mortgage rates are more reasonable than they were in the COVID-era.