More

msdz · 2026-03-23T20:46:23 1774298783

The difference is that the work a contracted tradesperson will do is typically under some sort of guarantee, e.g. typically 2 years on work done in your home (up to 5 for bigger construction etc. type work), at least here in Germany… which you don’t (need to) factor in when DIY-ing.

msdz · 2026-03-20T21:49:08 1774043348

> that they're unable to [manage and] kill child processes they themselves spawn makes it seem like they have zero clue about what they're doing.

Yeah, at the bare minimum these projects could also use something like portless[1] which literally maps ports to human- (and language model-)readable, named .localhost URLs. Which _should_ heavily alleviate assignment of processes to projects and vice versa, since at that point, hard-to-remember port numbers completely leave the equation. You could even imagine prefixing them if you've got that much going on for the ultimate "overview", like project1-db.localhost, project1-dev.localhost, etc.

[1] https://port1355.dev/

embedding-shape · 2026-03-21T10:50:34 1774090234

Well, or just use port 0 like we've done for decades, read what port got used, then use that. No more port collisions ever. I thought most people were already aware of that by now, but judging from that project even existing, seems I was wrong.

hackerthemonkey · 2026-03-22T15:57:35 1774195055

That’s a little different, right? Using port 0 would imply that clients have not hard coded what port they should connect to and also we don’t mind having duplicate processes occupying other ports which are no longer on active use

msdz · 2026-03-17T12:49:12 1773751752

Felt an instant urge to nuke your comment if I could. Excellent work.

msdz · 2026-03-06T13:08:49 1772802529

Interesting article you’ve linked. I’m not sure I agree, but it was a good read and food for thought in any case.

Work is still being done on how to bulletproof input “sanitization”. Research like [1] is what I love to discover, because it’s genuinely promising. If you can formally separate out the “decider” from the “parser” unit (in this case, by running two models), together with a small allowlisted set of tool calls, it might just be possible to get around the injection risks.

[1] Google DeepMind: Defeating Prompt Injections by Design. https://arxiv.org/abs/2503.18813

zbentley · 2026-03-06T13:51:04 1772805064

Sanitization isn’t enough. We need a way to separate code and data (not just to sanitize out instructions from data) that is deterministic. If there’s a “decide whether this input is code or data” model in the mix, you’ve already lost: that model can make a bad call, be influenced or tricked, and then you’re hosed.

At a fundamental level, having two contexts as suggested by some of the research in this area isn’t enough; errors or bad LLM judgement can still leak things back and forth between them. We need something like an SQL driver’s injection prevention: when you use it correctly, code/data confusion cannot occur since the two types of information are processed separately at the protocol level.

TheFlyingFish · 2026-03-06T20:44:12 1772829852

The linked article isn't describing a form of input sanitization, it's a complete separation between trusted and untrusted contexts. The trusted model has no access to untrusted input, and the untrusted model has no access to tools.

Simon Willison has a good explainer on CaMeL: https://simonwillison.net/2025/Apr/11/camel/

zbentley · 2026-03-07T02:22:52 1772850172

That’s still only as good as the ability of the trusted model to delineate instructions from data. The untrusted model will inevitably be compromised so as to pass bad data to the trusted model.

I have significant doubt that a P-LLM (as in the camel paper) operating a programming-language-like instruction set with “really good checks” is sufficient to avoid this issue. If it were, the P-LLM could be replaced with a deterministic tool call.

msdz · 2026-03-05T09:27:09 1772702829

They’d probably get the farthest, but they won’t pursue that because they don’t want to end up leaking the original data from training. It is possible in regular language/text subsets of models to reconstruct massive consecutive parts of the training data [1], so it ought to be possible for their internal code, too.

[1] https://arxiv.org/abs/2601.02671

foota · 2026-03-05T20:52:36 1772743956

Copyright for me not for thee? :) That's a good point though. Maybe they could round trip things? E.g., use the model trained only on internal content to generate training data (which you could probably do some kind of screening to remove anything you don't want leaking) and then train a new model off just that?

msdz · 2026-03-01T10:00:39 1772359239

> CLIs themselves are getting good at [agent coordination] natively

But that's not provider-agnostic, which you mentioned earlier as one selling point of Emdash. :-)

Not-so-unrealistic use case, IMO: What if I want my orchestrator model to be, for example, ran locally due to some form of privacy concerns?

msdz · 2026-02-28T16:18:31 1772295511

I agree, this is inherently unsafe. The two core security issues for agents, I’d say, are in LLMs not producing a “deterministic” outcome, and prompt injection.

Prompt injection is _probably_ solvable if something like [1] ever finds a mainstream implementation and adoption, but agents not being deterministic, as in “do not only what I’ve told you to do, but also how I meant it”, all while assuming perfect context retention, is a waaay bigger issue. If we ever were to have that, software development as a whole is solved outright, too.

[1] Google DeepMind: Defeating Prompt Injections by Design. https://arxiv.org/abs/2503.18813

msdz · 2026-01-23T13:03:43 1769173423

On the other hand, that one same engine would then be under near-full control of a single company (Google), with all the disadvantages a monopoly usually brings.

msdz · 2026-01-23T13:01:23 1769173283

I'm not the founder nor Kagi employee, just a customer, but

> Can you describe or offer any insight into the "significant IP" that you need to protect and defend?

The novel IP is having implemented and still implementing the browser APIs necessary for both Firefox and Chromium extensions to work in a Safari (Webkit)-based browser. See [1] for the significant progress.

> What threats from a larger company are you primarily concerned about?

Integrating said functionality themselves to offer another viable iOS browser, which Kagi is currently the only [2] offerer of (or another viable macOS/future Linux/Windows browser, although more than one exist there already).

[1] https://docs.google.com/spreadsheets/d/14IgSRVop4psUTgtLZlvY... (via: https://help.kagi.com/orion/misc/technical.html)

[2] Unless the EU steps up, all iOS browsers will continue to have to be Webkit-based with minimal, lackluster extension support. Not viable for anything beyond the most basic of use cases.

msdz · 2025-12-17T03:04:01 1765940641

Regarding your first paragraph, I've even talked with people who go out of their way to actively _avoid_ said product after encountering AI-generated advertising. So that'll probably continue to have an effect for as long as average people with good eyes can still distinguish "AI"/generative media from "real"/traditional footage.

turnsout · 2025-12-17T17:02:03 1765990923

I have observed this as well, and we've already seen some pushback when major brands use AI in their creative. I wonder if we're entering an era where AI will actually taint a brand.