> In light of the ability of recent models to accelerate their own development, ...

davedx · 2026-06-10T07:15:02 1781075702

Could this be legally construed as anti-competitive behavior?

Edit: I asked Claude. It replied:

> Consumer protection / deceptive practices. In the EU this would be a clear UCPD (Unfair Commercial Practices Directive) issue and potentially a DSA violation. In the US, FTC Act §5 prohibits "unfair or deceptive acts." Selling a product that secretly performs worse than advertised for a commercially self-serving reason, without disclosure, is textbook deception. The Samsung/Apple battery throttling cases are instructive here: Apple faced regulatory action across multiple jurisdictions specifically because users weren't told.

> Competition law. This is where "anti-competitive" gets complicated. Refusing to help competitors build competing products via your ToS is generally legal — you can decide who you license to. But covertly sabotaging output quality for a class of users while charging them full price crosses into different territory. Under EU competition law (Article 102 TFEU), if a company with dominant market position uses covert technical means to disadvantage competitors, that's closer to abusive conduct than a legitimate ToS restriction.

anon373839 · 2026-06-10T09:39:13 1781084353

Anthropic’s behavior reeks of insecurity. Imagine Google taking elaborate measures to prevent you from searching about search engine development!

j2kun · 2026-06-10T14:53:22 1781103202

Instead, Google gave up on search engine development /s

tgtweak · 2026-06-10T15:27:31 1781105251

Implying that google's "snippets" were never curated to remove anti-google facts or that they didn't curate the search results in their favor...

greenrd · 2026-06-10T08:36:45 1781080605

I think either you've prompted Claude misleadingly, or it's interpreting the law unnecessarily prissily (which is a failure mode I've noticed LLMs falling into).

This clearly is disclosed, otherwise how did we get to know about it?

Game_Ender · 2026-06-10T11:38:50 1781091530

What's not clearly disclosed is when you are being limited and what the bounds are. If you are developing ML kernels for a computational photography use case will the safe guards miss-fire and sabotage or slow down your efforts? What about distributed GPU interconnect work for a nation super computer lab used in weather simulation?

The reason they are doing this shadow ban style technique, is they don't want users to figure out how to jail break their way out. Or the explicit direct bad PR of when it miss-fires.

kiv_apple · 2026-06-14T17:57:10 1781459830

"Shadow ban"-like mechanic in contractual relationships is generally against the law.

You either refuse to work with customer or do your job well (at least as well as other tasks of other customers). "I'll accept your task, but silently and intentionally do my job badly" may violate some laws.

hashmap · 2026-06-10T13:48:19 1781099299

They do not disclose when the service is degraded, and the admission of that here seems like it would do a plaintiff's work for them.

cedws · 2026-06-09T18:19:36 1781029176

This makes me want to see China and open models succeed more than anything :)

382hi · 2026-06-09T18:21:06 1781029266

Don't worry, we will succeed :)

UncleOxidant · 2026-06-09T18:57:43 1781031463

Can we get a Qwen3.7-122B, please? Thank you.

webXL · 2026-06-09T20:50:28 1781038228

Or just any update for 122B. That size seems to be ideal for a single GB10

KerrAvon · 2026-06-09T21:05:15 1781039115

and for maxed-out M5 Macs

lacoolj · 2026-06-09T22:26:23 1781043983

Mimo has your back! 1000 t/s on 1T param model

Just need to wait for this thing to be open sourced :)

lol it won't tho...

https://mimo.xiaomi.com/blog/mimo-tilert-1000tps

miloignis · 2026-06-10T02:28:45 1781058525

What do you mean? The HF checkpoint is linked from the blog post you sent: https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash

jimbob45 · 2026-06-10T01:46:33 1781055993

They already have though, no? If we lost access to every model permanently besides Qwen tomorrow, would we really be limited by AI in what we could achieve in the future? Sure, it might be slower and take a little more work but it seems like the cat is already out of the bag.

celdon25 · 2026-06-09T22:55:57 1781045757

Fun fact: If you show fable this post, it will route you to 4.8 automatically.

DeathArrow · 2026-06-10T04:18:09 1781065089

In a few months they will have Fable level models costing 10 times less and with less safeguards.

adithyaharish · 2026-06-10T04:20:50 1781065250

I do agree, I still remember when opus 4.7 was released and one prompt conversation would empty my claude usage but I can use all it day long to code

melicerte · 2026-06-10T13:22:59 1781097779

Do you know that some open models developed in China are financially supported by Meta ?

johnsimer · 2026-06-09T20:42:53 1781037773

Do you want anyone in the world to be able to synthesize dangerous viruses?

sneak · 2026-06-09T22:22:45 1781043765

I want everyone in the world to be able to perform unlimited cutting edge research on any topic at the maximum thinking level, instantly.

The reason we are not being attacked is not lack of technology access.

dyauspitr · 2026-06-09T22:37:54 1781044674

It is an access issue. If you could get step by step instructions on how to modify a virus so it kills all people over 6ft you bet your ass there would be people attempting it.

JumpCrisscross · 2026-06-09T23:39:57 1781048397

> It is an access issue

Column A, Column B. Building a small explosive device isn't hard. Building a million is very difficult, doing it covertly virtually impossible without the resources of a nation-state.

The problem with biologics is the self-assembly and replication machinery comes for "free." So the numpties who might otherwise blow up a trash can [1] now have a real chance of taking out a million people.

[1] https://en.wikipedia.org/wiki/2016_New_York_and_New_Jersey_b...

kiv_apple · 2026-06-14T17:10:38 1781457038

The problem with biologics is that you cannot build a virus in your garage. You need a lab. AI will give your recipe, but you still need a lot of money and cooperation of other people (and if you have so, you could hire human biologist in pre-AI era).

Also AI makes mistakes. If you ever coded with AI agent you know that loop "write trash => compile => fix compilation errors => repeat" (if there are no compilation errors, there are definitely logic errors to be fixed). In real world cost of attempt is huge. You need a lot of money and you risk to draw a lot of attention if you perform long series of iterative experiments to create working virus.

In case with bomb it means that even if you have AI which gives you recipe of the bomb, but you will explode your garage and yourself with a decent chance. So you probably need to setup a good experimental pipeline (hardened lab where you can try different formulas and see that happens without being killed) if you want to go beyond publicly known explosives available in pre-AI era to anyone who read school/university chemistry books. And this also requires resources and draws attention.

People extrapolate programming experience (the area where experiments are cheap, cannot kill you and provide detailed feedback what went wrong) to real life.

gck1 · 2026-06-10T00:11:13 1781050273

They would still have to procure things that would (I hope) light up many screens before they're able to. And such numpties are probably already monitored, or in prison for some other stupid life decision.

I also would like to hope that people that are likely to do such things are probably:

A) don't know how to break even the most basic guardrails of models

B) already in glasswings project

To prove point B - Theranos existed.

JumpCrisscross · 2026-06-10T01:27:48 1781054868

> They would still have to procure things that would (I hope) light up many screens before they're able to

“Many of the largest and most responsible providers in the industry already screen and record orders voluntarily,” but there is no requirement to do so [1].

[1] https://screendna.org/

schaefer · 2026-06-09T23:57:02 1781049422

> ...you bet your ass...

Humorously, whether I choose to participate in this hypothetical or not, I am already betting my ass.

This whole situation feels like the game [1].

[1]: https://en.wikipedia.org/wiki/The_Game_(mind_game)

porksoda · 2026-06-10T07:17:15 1781075835

Why. That was just uncalled for. Sigh

sneak · 2026-06-10T14:41:11 1781102471

If that were possible, they would already be attempting it with the same level of ability as if they didn’t have access to a text file generator app. It is not about access to the information.

All of this “guardrails” handwringing is nonsense. These things output text. Are you for censorship of a book written by a biotechnology expert that gives out the exact same information?

debesyla · 2026-06-09T23:10:55 1781046655

I guess in this theoretical "AI makes weapon" scenario one could use the same AI to make defences too?

// Claude, make antiviral nanobots that defend me from 6ft virus. Make no mistakes.

dyauspitr · 2026-06-10T02:19:35 1781057975

I don’t know if you’re being silly but it is orders of magnitudes easier to modify an existing virus to selectively target certain snps than make “antiviral nanobots”

jex_the_ape · 2026-06-10T15:54:16 1781106856

Claude, modify the existing 6ft killer virus so that it only makes my balls itch slightly for a day and gives me lifetime immunity to all further stamms of the 6ft killer virus. Make no mistakes, double check so the virus causes no unforseen complications.

iAMkenough · 2026-06-09T21:49:51 1781041791

It's inevitable. Also, it's not like I get to vet who does or doesn't have access. Blind trust in the current selection made by an unregulated corporation just makes me anxious.

Security in the form of "pay to play" is just kicking the bigger issue down the road.

jesterson · 2026-06-10T03:32:31 1781062351

Do you believe people currently possessing best models act/will act in your best interest?

orphea · 2026-06-09T21:23:08 1781040188

So, security (safety) through obscurity?

usef- · 2026-06-10T06:54:05 1781074445

The phrase "security through obscurity" isn't an argument against all information restriction.

It doesn't imply we should, for example, publish step-by-step instructions for making widespread death easier.

qrios · 2026-06-10T17:42:10 1781113330

Another „great filter“: How to handle dagerous information?

inglor_cz · 2026-06-10T11:23:58 1781090638

The argument against security through obscurity isn't that it doesn't work at all. It does to a degree, only it is not as strong as people think.

An example from the meat world: not publishing your vacation dates well in advance for the world to see somewhat reduces your chance of being burglarized. That is security by obscurity; not reliable, but not completely inefficient either.

But if you live in a fortress (security by key material), you can well declare your vacation dates without running the risk.

invalidusernam3 · 2026-06-10T11:38:35 1781091515

What about allowing people to synthesize dangerous virus protection?

digitaltrees · 2026-06-10T00:01:04 1781049664

It the tool was made available to anyone to build a virus, anyone would be able to build counter measures, if only a select few people have access they get to build the virus and everyone else is at a disadvantage. So, yes, I am leaning towards making these tools open rather than gated behind some priesthood and government that gets to wield exclusive power.

usef- · 2026-06-10T00:34:41 1781051681

Compare the cost/ease of attacker vs defender if one person is given a virus to unleash anywhere in the world and another person is given a vaccine to distribute to the whole world. Or compare building a large bridge to someone disabling that bridge, etc. Prevention and repair is almost always more expensive than vandalism.

I don't think there's an ideal solution here, but giving trusted people access to fix security issues before giving it to the wider public seems like a reasonable compromise. They're letting you use the model for all other uses.

sterlind · 2026-06-09T21:44:16 1781041456

you need a lot more than the nucleotide sequence to make a virus. you need the DNA or RNA to be synthesized, assembled, packaged properly. and long sequences are pretty hard to do. you need a lot of equipment, or you need to order from services. the oligo synth services can harden their KYC and/or screen for suspicious sequences.

sure, a malevolent state actor could swing it, but they could make a bioweapon without Mythos's help already.

also, vaccine production and disease surveillance have ramped up very quickly. they will ramp up further, despite political setbacks. it's a cat and mouse game that favors the defenders IMO.

but the bioterrorism narrative is useful FUD to spin open-weight models as existentially dangerous. I am far more worried about Anthropic's own goals than the goals of some crackpot in a shed.

theLiminator · 2026-06-09T21:50:52 1781041852

> it's a cat and mouse game that favors the defenders IMO

How so? I'm actually against most of the "safety-tuning" that anthropic does, but this seems fundamentally untrue, a close analogue being video game cheat development. I think in general the cheat developer has an advantage and the cheats generally proliferate for quite a while before being patched.

ceigey · 2026-06-09T23:42:20 1781048540

Video games are an interesting analogy since they often trade security for performance, trusting clients about world state quite a bit.

Finance and biology do come across as two similar high level systems. But while we can employ KYC, fraud detection, and various auditing techniques to finance, I don’t know what you do for biology. You can easily run an algorithm over every transaction a person makes in their account but there’s no equivalent for every cell, every bacteria strain, every virus in the human body.

sterlind · 2026-06-09T23:51:31 1781049091

(disclaimer: layperson remembering how the immune system works.)

the adaptive immune system effectively does KYC by checking the antigens presented on the surfaces of cells. the thymus selects for B-cells (iirc?) which don't react to a corpus of the body's own antigens, but cover a wide library of everything else. when it sees something it doesn't recognize, it reproduces, warns the rest of the immune system and marks targets. that's why our immune systems can eventually conquer almost every pathogen we encounter, if we can survive long enough for it to do its work.

but the KYC I was referring to was KYC that vendors of oligonucleotides (should) be doing, to keep people from ordering nefarious sequences.

sterlind · 2026-06-09T23:47:22 1781048842

I'm bullish on mRNA vaccine technology to release the "patches" much more quickly. there was widespread resistance to this during covid, but covid wasn't horribly lethal. if airborne Ebola spread as productively as covid, for example, I doubt there'd be many anti-vaxxers left (one way or another!) the acceleration of biology research that might accelerate pathogen development should also accelerate the development of broad-spectrum mRNA vaccines with high persistence.

also, afaik the most effective way of developing pathogens is through serial passage through humanized mice or something like that - directed evolution at a small scale, selecting for traits. AI simply isn't needed for that. I don't think information or intelligence has been the bottleneck for bioterrorism, it's motivation and resources - same as for any other kind of biology research program.

root-parent · 2026-06-09T20:51:03 1781038263

We do. Its the only way we will get our jobs back.

mips_avatar · 2026-06-09T18:01:07 1781028067

It's bad that Anthropic can determine what this means. If you're building a modern app you're likely training your own embedding models and now anthropic can just silently sabotage your training pipelines?

abixb · 2026-06-09T18:57:46 1781031466

>We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations

At the scale of API requests that Anthropic sees, I think the affected organization count might be substantial, and they might not be getting the full model capability that they're paying top $$$ for.

Also, wonder how they arrived at that estimation.

wongarsu · 2026-06-09T19:07:17 1781032037

One in 1000 organizations and one in 3000 requests is indeed a lot

happyopossum · 2026-06-09T20:04:48 1781035488

That’s 1 in 30,000 requests…

dragonwriter · 2026-06-09T20:50:40 1781038240

No, 0.1% is one in 1,000. 0.03 is (approximately) one in 3,000; one in 30,000 is 0.003%

ViscountPenguin · 2026-06-09T21:26:03 1781040363

You're off by an order of magnitude with those last two.

mediaman · 2026-06-09T21:42:14 1781041334

Double check your math. All of their posts in this thread are correct.

1/30,000 * 100 = .003

ViscountPenguin · 2026-06-10T03:05:04 1781060704

Oh, fuck

freakynit · 2026-06-10T05:38:17 1781069897

/r/TheyDidTheMath IYKYK

dotancohen · 2026-06-09T21:55:54 1781042154

If it makes you feel more comfortable, throw another significant digit at GP's decimal. Make it a 3 like the previous digit. Now multiply.

monster_truck · 2026-06-10T00:17:47 1781050667

Hey man your computer has a calculator try using it next time

roland_nilsson · 2026-06-10T06:07:32 1781071652

Can't we use Claude to figure this out

gck1 · 2026-06-10T00:15:57 1781050557

Also, aren't all Claude users in their own "organizations" in Anthropic's own terms?

DonsDiscountGas · 2026-06-09T20:31:10 1781037070

I have no idea how you came to that conclusion. Unless your training pipeline involves actively querying one of Anthropic models, no they can't. And if it does you're distilling their model.

VBprogrammer · 2026-06-09T21:00:11 1781038811

The crocodile tears of companies who've hoovered up everything possible, regardless of permissions or legality, now crying that someone else is stealing their hard work is comical.

I don't even think they can believe it themselves, it's in reality they are just trying to throw fear, uncertainty and doubt about potentially cheaper offerings.

JumpCrisscross · 2026-06-09T23:43:30 1781048610

> crocodile tears

Not what that means.

Crocodile tears "is a colloquial term used to describe a false, insincere display of emotion" [1]. Defending yourself against an attack vector you just exploited is between savvy and hypocritical.

[1] https://en.wikipedia.org/wiki/Crocodile_tears

digitaltrees · 2026-06-10T00:05:01 1781049901

I think his use of crocodile tears is appropriate, anthropic is feigning a false sense of concern for safety when really it is anticompetitive behavior, and I think that selfish entitlement is related to the original act of intellectual property theft to use the worlds training data, most of which was not public domain, to distill the wisdom for their models. So why do they get to cry about people distilling the knowledge from their models that they themselves distilled from the worlds knowledge?

mediaman · 2026-06-09T21:37:19 1781041039

That is not what their policy states. It specifically says they will sabotage even non-distillation attempts, such as distributed training pipeline design. And given that they are so far very nonperformant in classification accuracy, expect it to randomly include far more topics wide of the mark.

The fun part is that you will never know if your neural net classification project is getting silently sabotaged because their classifier doesn't work!

DonsDiscountGas · 2026-06-10T02:46:50 1781059610

You could try actually reading the code that it wrote

baq · 2026-06-10T06:59:07 1781074747

Good luck understanding it and finding malevolent inefficiencies if it’s already necessarily better at optimizing training pipelines than everyone except some Anthropic and OpenAI employees. Not a new thing either, see fast16.

gck1 · 2026-06-09T22:30:14 1781044214

Opus 4.8 (or a classifier in front of it) flagged my account and refused to comply when I told it to kill the process. Reasoning summary was complete bananas.

With this in mind, I don't want model to be proactively instructed and encouraged to sabotage without telling me.

edot · 2026-06-09T23:50:15 1781049015

Same here when I said to “nuke” a process.

mips_avatar · 2026-06-09T21:29:50 1781040590

Like if you're using claude code on a feature tangential to your training pipeline it's allowed to nerf itself and damage your AI work.

davedx · 2026-06-10T07:47:11 1781077631

Read the examples Anthropic gave in the model card. They refer to extremely broad technology used across AI and ML.

matheusmoreira · 2026-06-09T18:15:08 1781028908

Looks like Anthropic's definition of safety includes their own safety from competition.

dragonwriter · 2026-06-09T19:12:46 1781032366

AI vendors’ idea of safety has always been safety for the interests of the AI vendor in question. This is not a new development, though this may help more people realize it.

axus · 2026-06-09T18:25:36 1781029536

AI-generated competition for thee, not for me

digitaltrees · 2026-06-10T00:05:54 1781049954

ding ding ding. This should be a new measure of anticompetitive analysis in anti trust law.

SAI_Peregrinus · 2026-06-09T18:45:17 1781030717

It's always been about the safety of their valuation.

wongarsu · 2026-06-09T19:14:12 1781032452

Only since Claude 3. So a bit over two years now

digitaltrees · 2026-06-09T23:58:07 1781049487

This feels less like an "we are worried about security" and more, we are in the lead and plan to keep it that way until its too late. In someways its been helpful that openai and anthropic are tipping their hands about their anticompetitive instincts and willingness to steamroll their own clients, customers, and society. But it does feel like its too late to stop this. The advantage people get by using these tools is too tempting to resist even if it is self defeating. It feels like watching people light their own house on fire to stay warm in the deepest, darkest days of winter.

seemaze · 2026-06-09T19:06:20 1781031980

Ah, so this is why raw Mythos was too "dangerous" to realease..

digitaltrees · 2026-06-10T00:09:00 1781050140

Or, they may Mythos seem mystically powerful in advance of the IPO, and are pumping the token use count. But it worked, there is a frenzy for this release in way that is more intense than any previous release.

Anthropic is doing a better job with their model menu, most people I talk to know immediately that Opus > Sonnet > Haiku but cant tell you what the rank order of open ai models are, when to use them, etc.

rastrojero2000 · 2026-06-09T20:51:26 1781038286

So that's a possible reason why my specific Claude Opus instance seemed to be impossibly stupid and always degenerates into doing really dumb things to my code!

Cool, good to know I can trust Anthropic.

nullbio · 2026-06-10T05:57:42 1781071062

Just so everyone is aware. Anthropic has been sabotaging AI researchers and their codebases and shadow-nerfing accounts for several years at this point. This isn't new, but they hadn't disclosed it until now. Likely because it is getting to the point where it's too noticeable, or they're concerned about it leaking from employees.

dash2 · 2026-06-10T07:16:38 1781075798

What’s your evidence for this claim?

chrisoosthuizen · 2026-06-09T20:49:12 1781038152

This feels like the start of a much bigger plan for anthropic to close off the use cases of its models and eat any of its competitors.

digitaltrees · 2026-06-10T00:11:10 1781050270

I am building a coding harness, and I see evidence of them doing this with agentic harnesses and scaffolding. It feels clear to me that as they expand in to the app layer, the window of using their API to build agentic apps is closing, they will steal your ideas, implement the product and then close the gate. I am creating my own inference stack because their incentive to block competitors is becoming super clear.

hackmack10 · 2026-06-10T02:22:45 1781058165

No offense, but the sad thing is, everyone and their mother is working on this same problem. I'm also building a harness. It's feeling like, there is no moat, there is no way to get ahead, they will steal your idea one way or another, if you ever make it public.

digitaltrees · 2026-06-10T04:46:23 1781066783

No offense taken. I am not building it for fame or profit.

I built it because I wanted cursor on my phone because I have two small kids and don’t want to be chained to my desk. And it’s awesome. It’s a full ide with agent chat, terminal and file system running in a remote Linux container. I can review diffs, fully manage git and preview/serve apps. And no one can ever take it away from me :)

I am watching the way things are progressing with the ai api vendors and it feels really clear that depending on them will soon be dangerous. So I an furiously building as much of my own infrastructure to capture some autonomy with these capabilities

So I think everyone should build a harness.

hackmack10 · 2026-06-10T13:02:05 1781096525

Exactly, that is my goal and thoughts as well. I wish you the best in these crazy times. Let's ride this wave.

digitaltrees · 2026-06-11T19:22:49 1781205769

I am happy to share thoughts and collaborate if you’re interested. What are you working on specifically? My project is www.propelcode.app

blackqueeriroh · 2026-06-10T04:00:53 1781064053

What, exactly, is new about any of this?

digitaltrees · 2026-06-10T04:50:49 1781067049

When they launched their business model was to be a pure API for intelligence. Then when everyone claimed they were just commodities with no moat and they shifted hard to being the app layer. That was the transition.

They went from selling shovels to all gold prospectors to stealing the information about the location of the gold so they could dig it out first.

We are all stupid enough to keep buying shovels from them because we think their shovels dig gold better and faster.

johnnyApplePRNG · 2026-06-09T20:54:40 1781038480

> Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT).

Am I to understand that this is essentially their form of social-platform ghosting instead of banning?

So they're not even going to tell you that the question you're asking is against their rules, they're just going to twist up your question and/or the answer somehow such that you waste your time essentially?

It seems like I ran into this EXACT same functionality from Claude many months ago when I was trying to ask it to research on the web and help me setup the ideal llama.cpp config for local llm inference.

Funny how lost it got through that relatively simple install when we had all of the documentation in the world (and a human dev with 20+ years experience guiding it along) to go by... and simultaneously it's debugging and building high level cryptography code in rust in the other terminal tab.

This is infuriating to learn.

digitaltrees · 2026-06-10T00:15:25 1781050525

I have encountered this too. I am building a coding harness for www.propelcode.app and it was working really well until the claude code leak and then all of the sudden it seems almost intentionally stupid or outright manipulative in guiding me down wrong paths. At this point I am using other models for anything related to the tool use design and implementation and bought three mac studios with 512gb ram to run large open source models.

This experience has made me feel like we have to create a community that moves AI from the mainframe era to the PC era quickly, or we will end up serfs.

ls612 · 2026-06-09T21:58:34 1781042314

I had Claude walk me through getting local LLM models running on my Mac a month or two ago and so far as I can tell it was intentionally helpful. I even stated the reason was to have an uncensored model for myself and it had no objection. Long story short LM Studio running a Heretic Gemma 4 is doing just fine on my system now.

vorticalbox · 2026-06-09T22:15:40 1781043340

I run a few local models for different things. I find Gemma 4 great for writing but qwen better for coding.

I tried the same prompt on gemma4 and qwen 3.5 and Gemma consistently failed to call the multi line edit tool.

brewtide · 2026-06-09T22:30:13 1781044213

I've had the same bad luck with tool-calling on Gemma4. Looking around the web, we are not alone. For other tasks, it's seemingly quite quick and decent.

But it gets stuck in tool call loops, it seems like.

ls612 · 2026-06-09T22:29:54 1781044194

Oh to be clear I don't think Gemma 4 is suitable for real work. It runs at 10 tps and is somewhere between 4o and o1 in quality according to my subjective judgement. But Claude was happy to correctly tell me how to get it running and how to solve the pitfalls I encountered in that process.

Jabrov · 2026-06-09T18:01:12 1781028072

A million AI researcher voices at big tech companies suddenly cried out in terror and were suddenly silenced

notrealyme123 · 2026-06-10T13:31:06 1781098266

I am a AI Researcher at a university. I tried Fable for my current project, but i feel it missunderstands me a bit to often. Now i don't know if i am using it wrong, or anthropic tries to slow my research. That model is a big no no.

hashmap · 2026-06-09T19:05:59 1781031959

3 months before asking for what to eat before a linear algebra exam trips the machine learning topic ban is my guess. I got flagged immediately asking why my JEPA thing breaks weird.

2001zhaozhao · 2026-06-09T18:24:12 1781029452

How do they detect whether an experiment being done on a smaller model is used to improve a competing frontier model, or just an innocuous hobbyist LLM experiment?

vitally3643 · 2026-06-09T18:53:58 1781031238

Given how well the cybersecurity safeguards work, they probably don't.

iririririr · 2026-06-09T18:56:48 1781031408

infering the surroundings, like everything else. they will probably look at which company is your email, and if you wrote "better than claude" on the readme.md

this is LLM, it's not like a science or something.

maxall4 · 2026-06-09T22:11:51 1781043111

These safeguards are ridiculously sensitive: a prompt as simple as “ Why is an infinitely slow process reversible?” gets flagged as a ToS violation.

largbae · 2026-06-09T22:18:24 1781043504

Pull that ladder up behind ya, will ya son?

dboreham · 2026-06-10T01:51:13 1781056273

Makes it even more odd that we haven't seen alien spaceships.

usef- · 2026-06-09T23:19:19 1781047159

What ladder did Anthropic use?

hnav · 2026-06-10T00:04:40 1781049880

the entire internet, books, news, regardless of license.

usef- · 2026-06-10T00:20:28 1781050828

The companies using distillation are still training on all that data too, aren't they?

N_Lens · 2026-06-10T10:20:36 1781086836

And Anthropic is crying about distillation.

digitaltrees · 2026-06-10T00:16:51 1781050611

All of the api calls developers used to build agentic design patterns.

ayewo · 2026-06-11T14:15:41 1781187341

For anyone that is confused like I was, the quoted text I'm replying to was copied from page 13 of the system card [1] and not the model announcement page, which this HN discussion is linked to.

1: https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c3...

rfgplk · 2026-06-09T18:11:25 1781028685

Meaningless and easily bypassable. Will actually try coding up a tensor library with it, see if it sabotages anything.

mips_avatar · 2026-06-09T18:24:09 1781029449

They said in their terms and conditions they will silently sabotage you if you do this.

qiine · 2026-06-09T19:29:59 1781033399

easily ?

novaomnidev · 2026-06-09T23:15:36 1781046936

So Fable will intentionally lie to you and give you incorrect outputs, if it doesn’t like what you’re asking. Got it.

novaomnidev · 2026-06-09T23:19:40 1781047180

These things are like encyclopedias or dictionaries that can speak in first person… Imagine if your encyclopedia tried to hide entries from you, just absurd!

theLiminator · 2026-06-09T18:19:09 1781029149

This is pretty bullshit, now you have no idea if your output is getting silently nerfed.

thepasch · 2026-06-09T19:03:27 1781031807

Yeesh. Anthropic's paranoia about China is starting to get pathological.

rspeele · 2026-06-09T18:23:41 1781029421

It's afraid!

thothless · 2026-06-09T21:22:37 1781040157

the gall of these companies to regulate your usage of stolen knowledge is absolutely hilarious.

and they want me to pay $100+ a month to be their training?

i hope we can find morality again.

gck1 · 2026-06-09T21:41:30 1781041290

But Chinese models will poison your output if you ask them about Tiananmen Square! That's not good, so poisoning everyone's output without telling them is the only way to prevent that.

Come on guys, why can't everyone just be there for the good guy?

Sabinus · 2026-06-10T01:59:09 1781056749

You're equating a government suppressing information for social cohesion with a private company protecting their IP.

gck1 · 2026-06-10T04:30:52 1781065852

They're not merely protecting their weights.

First, they want government to get involved and regulate frontier model development - even stop it completely.

Second, poisoning output of a model configured on the computers of millions of users goes way beyond protecting IP. That's malware.

kiv_apple · 2026-06-14T17:37:53 1781458673

Protecting IP means that model would refuse to do certain things. Example from pre-AI era - program asks you for license key and refuses to start if it is wrong. But if program deletes random system file when you enter invalid license key (doesn't matter it is brute force attempt or typing error) it is different thing goes well beyond IP protection.

827a · 2026-06-09T22:36:20 1781044580

This is deeply vile behavior; not remotely the actions of good people.