Hacker Timesnew | past | comments | ask | show | jobs | submit | afavour's commentslogin

I hope that’s exaggeration because being unable to operate without it means you’re going to do a terrible job of reviewing the code it’s producing.

Since the November/December Opus and Claude Code, I found I don't need to read the code any more. Architecture overview sure, and testing yes, but not reading the code directly any more.

Me (and my friends similarly) inspect code indirectly now - telling agents to write reports about certain aspects of the code and architecture etc.


I do regularly read the code that Claude outputs. And about 25% of the time the tests it writes will reimplement the code under test in the test.

Another 25% of the time the tests are wrong in some other way. Usually mocking something in a way that doesn't match reality.

And maybe 5% of the time Claude does some testing that requires a database, it will find some other database lying around and try to use that instead of what it's supposed to be doing.

And even if Claude writes a correct test, it will general have it skip the test if a dependency isn't there--no matter how fervently I tell it not to.

If you're not looking the code at all, you're building a house of cards. If you not reading the tests you're not even building you're just covering the floor in a big sloppy pile of runny shit.


> I do regularly read the code that Claude outputs

You probably could have s/Claude/Human/ in your rant and been just as accurate. I don't know how many times I've flagged these issues in code reviews. And that's only assuming the human even bothered to write tests...

What I find is that when I ask AI to write tests it writes too many, and I agree with you that a lot of them are useless. But then I just tell it that, and it agrees with me and cleans it up. Much faster feedback loop and much better final result.

I feel like people that look at a poor result and stop there and conclude it's useless have made up their mind and don't want to see the better results that are right in front of them if they just spend an extra 5 seconds trying.


How do you know whether the tests it’s spits out are bad if you don’t read the tests.

We’re not dealing AGI here. Tests aren’t strictly necessary for humans. They are for AI. AI requires guardrails to keep from spinning out. That’s essentially the entire premise of the agentic workflow.


> How do you know whether the tests it’s spits out are bad if you don’t read the tests.

I do read the tests (quickly, I admit) and so does OP:

Architecture overview sure, and testing yes, but not reading the code directly any more.

Reading that again I may have misunderstood what they meant by "testing yes", though.


I’m pretty sure they just meant they do testing not that they read the tests and that’s what everyone else who responded interpreted that as well.

You can get Claude to write good tests but based on what I’m seeing at work that’s not what’s happening. They always look plausible even when they’re wrong, so people either don’t read them, skim them very quickly, or read the first few assume the rest work and commit.

I think Claude is great for testing because setting test data and infrastructure is such a boring slog. But it almost always takes a lot of back and forth and careful handholding to get it right.


I read the tests, it also is really really good to have Claude verify that removing the changes in question break the tests. This brings the quality way way up for me.

In comparison, I see this issues in fewer than 1% of the changes I review. Because when it happens you can effectively teach people to stop doing it.

I'd understand not reading the code of the system under test, but you don't even read the tests? I'd do that if my architecture and design were very precise, but at this point I'd have spent too much time designing rather than implementing (and possibly uncovering unknown unknowns in the process).

> Me (and my friends similarly) inspect code indirectly now - telling agents to write reports about certain aspects of the code and architecture etc.

Doesn't this take longer than reading the code?

I can see how some of this is part of the future (I remember this article talking about python modules having a big docstring at the top fully describing the public functions, and the author describing how they just update this doc, then regenerate the code fully, never reading it, and I find this quite convincing), but in the end I just want the most concise language for what I'm trying to express. If I need an edge case covered, I'd rather have a very simple test making that explicit than more verbose forms. Until we have formal specifications everywhere I guess.

But maybe I'm just not picturing what you mean exactly by "reports".


I've seen the code these models produce without a human programmer going over the results with care. It's still slop. Better slop than in the past, but slop none the less. If you aren't at minimum reading the code yourself and you're shipping a significant amount of it, you're either effectively the first person to figure out the magic prompt to get the models to produce better code, or you're shipping slop. Personally, I wouldn't bet on the former.

Yeah, these models have definitely become more useful in the last months, but statements like "I don't need to read the code any more" still say more about the person writing that than about agents.

If I were you I’d very worried about getting laid off. That kind of work isn’t going to keep earning a software engineer salary.

you're a slop maker

It's not. Most developers are pretty bad at their job, and already can't review code very effectively.

They just create even more slop currently, which will be the case until someone realizes they aren't needed to produce slop at all.


Eh, I don't think Swift would ever have dethroned Python. What pain point would it practically solve? I don't use Python often but I don't hear folks complaining about it much.

I do, though, think Swift had/has(?) a chance to dethrone Rust in the non-garbage collected space. Rust is incredibly powerful but sometimes you don't really need that complexity, you just need something that can compile cross-platform and maintain great performance. Before now I've written Rust projects that heavily use Rc<> just so I don't have to spend forever thinking about lifetimes, when I do that I think "I wish I could just use Swift for this" sometimes.

You're right, though, that Swift remains Apple's language and they don't have a lot of interest in non-Apple uses of it (e.g. Swift SDK for Android was only released late last year). They're much happier to bend the language in weird ways to create things like SwiftUI.


> just need something that can compile cross-platform and maintain great performance.

I think Go has already taken that part of the cake.


Go is garbage collected, though. Rust and Swift still occupy a niche Go doesn't.

ARC is a form of garbage collection. Swift does not fare better than Go usually.

This isn't Israel the state, it's a private company that's based in Israel.

If you think for a second that the Israeli state allows this company to sell its services to anyone who is opposed to their interests, you really don't understand how defense companies and states work.

Sure, a private company that just happens to have had multiple former Mossad directors on their board.

Coincidences are funny sometimes :-)


That's like saying that Raytheon doesn't advance US interests in the world

gee I wonder what does the private company gain from defaming another country's "pro-Palestine" party.

...money?

My point is a simple one: the company was hired by someone. Was it the opposition party? To say this is entirely Israel's doing implies the Slovenian party that benefits just happens to have gotten lucky. The reality is likely considerably murkier than that.


Any company that sells this type of services exists as an extension of its parent state. Any contract it offers, especially to a political entity in another state, will be scrutinized by state authorities and allowed by them or not. Sometimes, those contracts will be forced on the company based on state-level negotiations.

Noone is saying that the party that contracted this company (if indeed it was a Slovenian party and not the Israeli state itself) for this service doesn't carry blame. But both the company itself and the state of Israel carry just as much blame for offering, permitting, and carrying out such services.

By your logic, if someone were to found a legal private paid assassin company in France, and then the opposition party in Germany hired this company to assassinate the German chancellor, you'd say that it's unfair for Germany to blame France for this.


Would Black Cube have accepted a client that was very pro-Palestinian and was trying to lure voters away from the pro-Israel party?

And to say it's not Israel's doing implies that Israel just happens to have gotten lucky.

To say that this is entirely the Slovenian party's fault implies that Israel cannot govern their own state. Both are complicit.

> implies that Israel cannot govern their own state

Or more simply that what the Israeli company did, is not illegal in Israel.


I can guarantee you that it is very illegal if you target the wrong government.

Selective prosecution is a recurring issue under Israeli jurisprudence.


Okay, sure. Is Slovenia the wrong government?

NED is not a CIA front sweatie, it's just a private institute

>NED is not a CIA front sweatie

Curious: can you show the research steps you took to reach this conclusion? Really curious how we can all easily determine which companies are and aren't CIA fronts!


Oh thats adorable

Meta spent $2bn lobbying for this ID verification stuff:

https://hackertimes.com/item?id=47361235


No, AI is truly useful in software engineering. I was a skeptic until I started using it. No, it isn’t going to solve every problem out there, but it’s a force multiplier.

You pay understanding for speed. How much this trade is acceptable is up to you and the task you have in front of you. I cannot recommend it as a general solution.

This field doesn’t do well on long-term thinking. Even if all this turns out to be a net loss, it will be reinterpreted as a win and just an opportunity for even more of the same solution. There are numerous examples of this, e.g. the OOP craze. Tech is a stock market of ideas and HN is a trading floor. The “line goes up” logic applies - not merit.

Describing OOP as a "craze" is incredibly out of touch. It's been a thing for, what, three decades?

You may not recall the crazy era of OOP where people would go bonkers with massive object trees trying to objectify everything and using operator overloading to do (dumb) things like adding a control to a window with +=.

OOP is great. "OOP is the one perfect paradigm for all coding" was the craze.

I'm sure I'm not the first person you've seen hinting at OOP (and all that came with it) having been hyped up beyond its merits.

There certainly was an OOP craze, that's not out of touch to talk about.

AbstractBeanFactoryFactoryInterfaceBeanContextFactoryBeanBean.java

That’s just falls. I’ve spent disproportionate amount on “understanding” awful tooling like Gradle and npm. There’s no value in it if you’re not an infra engineer. It would take me a couple of days to manually restructure my hobby app, now I can just say “extract this into another workspace/subproject” and be done with it in minutes. And that’s just one example.

I agree with this sentiment. I just also see AI-driven development in core business logic, where truly understanding what is going on is essential and yet completely disregarded.

If I never have to debug a gradle file ever again, it's all worth it.

You might say the same about garbage collected programming languages. It’s an acceptable tradeoff in a lot of scenarios. Same goes for AI.

According to WSJ they’re getting out of the video game entirely:

https://www.wsj.com/tech/ai/openai-set-to-discontinue-sora-v...


Can we trust the FAA's conclusion?

Its previous head had a term that didn't expire until 2028 but he resigned after pressure from Elon Musk (who didn't like that he got fined), now a Trump-friendly head has been installed. What, realistically, would be the consequences if he lied? Likely none. Government officials lying on record is an every day occurrence these days.


True! Assumptions and speculation are always better.

I’m glad we’ve made our conclusions up front before the report has even come out.

That saves me a lot of reading!


Come on, this is silly. The fact that air traffic controllers are overworked is neither an assumption nor speculation. It is very widely documented.

The only thing we know so far is from two minutes of ATC audio.

That’s literally it. Anything else is speculation and extrapolation.

But don’t let that stop you if you already know what caused the tragedy.


It does not at all mean that this controller was overworked when this crash happened; that would be failed reasoning and misuse of evidence. It just raises the question, which should be looked at.

It's scary that so many don't seem to know the difference. This is how misinformation starts and spreads.


You're 100% right, a "Trump-friendly" administrator has been "installed" so we can't trust the FAA's conclusions. The last guy quit so this guy is definitely going to lie.

What you just described is a long term staffing shortage.

I think OP's point is that sending your browsing data to a server, be it American or Chinese, isn't "no bs".

I see this recurrent feeling on HN that because the US does bad things we shouldn't care about other countries doing the same. I think we should care about all of them!


The feeling is that "no one" cares about it being sent to American servers, why should they suddenly care about it going to Chinese servers just because they're Chinese.

Not only that, USA is far more likely to send someone to kill you than china is. So between the 2 I'll take china (I'd prefer my data to not be sent to any foreign power).

Well perhaps today is a good day to die.

> That descriptions already fits the payola model.

The old payola model. This new model encompasses the old one and adds a neat layer of outright politician bribery on top.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: