Talderigi's comments

Talderigi · 2026-04-21T15:15:07 1776784507

Curious how the semantic caching layer works.. are you embedding requests on the gateway side and doing a vector similarity lookup before proxying? And if so, how do you handle cache invalidation when the underlying model changes or gets updated?

giorgi_pro · 2026-04-21T15:43:22 1776786202

Hey, contributor here. That's right, GoModel embeds requests and does vector similarity lookup before proxying. Regarding the cache invalidation, there is no "purging" involved – the model is part of the namespace (params_hash includes the LLM model, path, guardrails hash, etc). TTL takes care of the cleanup later.

Talderigi · 2026-04-15T17:03:37 1776272617

feels like people are arguing the wrong axis tbh

- it’s not open vs closed anymore, it’s more like bug finding going a few devs poking around to basically infinite parallel scanners

- so now you don’t get a couple of thoughtful reports, you get a many edge cases and half-real junk. fixing capacity didn’t change though

- closing the repo doesn’t really save you, it just switches from white-box to black-box… and that’s getting pretty damn good anyway

real problem is: vuln discovery scaled, patching didn’t. now everything is a backlog game

Talderigi · 2026-04-14T12:40:55 1776170455

If you map Rust threads to warps, aren’t we basically turning the GPU into a very expensive CPU?

zozbot234 · 2026-04-14T12:50:40 1776171040

This blog post doesn't address how GPU "threads" can be mapped to Rust SIMD/SPMD "lanes" yet, though it hints at that. I assume that this is planned to be a topic for a future blog post.

I'd like to understand how the overall amount of "warps" to be launched on the GPU is determined. Is it fixed at shader launch, or can warps be created and destroyed on demand? If it's fixed, these are more like CPU-side "virtual processors" (in OS terminology) than true OS "threads".

hgomersall · 2026-04-14T16:15:09 1776183309

It makes sense when the inner operations are vectorisable, as in the example.

Talderigi · 2026-04-13T13:56:16 1776088576

Is Servo production-ready enough to replace or embed alongside engines like WebKit or Blink?

bastawhiz · 2026-04-13T14:11:44 1776089504

It depends on your use case. I wouldn't use it for a JS-heavy site. But if you have simple static content, it's probably enough. It's worth testing it out as a standalone app before integrating it as a library.

mayama · 2026-04-13T16:38:05 1776098285

It doesn't crash as often as it used to few years ago. JS heavy sites might not work, and layout issues too. And internet gatekeepers cloudflare turnstile doesn't work.

andriy_koval · 2026-04-13T18:11:55 1776103915

why did it crash? Rust is supposed to be memory safe?..

nablaxcroissant · 2026-04-13T19:20:12 1776108012

crashes happen for reasons besides memory safety. web-engines are crazy complicated pieces of software and crashes could happen for any number of reasons. also I would be shocked if this was written using purely safe rust

mkl · 2026-04-13T23:44:21 1776123861

The JS engine is SpiderMonkey, which is C++.

Talderigi · 2026-04-10T18:30:59 1775845859

rust fixed memory safety but left build-time trust wide open. What’s the realistic path to fixing this? sandboxed builds by default, or stricter provenance (sigstore-style) or what?

Talderigi · 2026-04-10T13:36:22 1775828182

We built systems we don’t fully understand, so naturally the next step is… immunity

avs733 · 2026-04-10T13:37:58 1775828278

From liability!

If this were to actually happen I can only imagine financial liability is the least of their concerns?

What scares me most about this is the narrowness of thought to match this fear with this response.

Talderigi · 2026-04-10T13:52:48 1775829168

fully agree, doesn’t really feel like they’re reacting to the same problem they’re describing

Talderigi · 2026-04-10T13:15:54 1775826954

open source but the off switch is centralized