“Vibe Coding” vs. Reality

rtfeldman · 2025-03-22T21:11:24 1742677884

I think the bigger "AI hype vs. Reality" gap is about the productivity numbers people casually throw around, like "10x as productive" or even 100x.

For example, here are YC partners quoting a company in a batch claiming "100x speedup" in coding performance compared to the previous month:

https://www.youtube.com/watch?v=IACHfKmZMr8&t=1837s

You can tell this claim is false, because that level of productivity increase would be glaringly obvious to an outside observer; it wouldn't need to be self-reported.

A YC summer batch is 84 days culminating in Demo Day. So a 100x speed improvement would be like a team spending less than 1 day of coding and ending up with something that's on par with Demo Day in terms of functionality. Maybe the design would be wrong, but that wrong design would be just as fully-featured as a Demo Day app.

So if 100x were true, the partners in that video would be talking about how the new batch dynamic is "They get breakfast with a customer, learn something new, have an epiphany, and then later the same day they have their entire app rewritten based on what they learned, and that scratch-rewrite is already at a Demo Day level of functionality." The partners aren't talking about that dynamic because it's not happening. So clearly the self-reported 100x is inaccurate.

Even 10x would result in partners saying "Whoa, in this batch people have a Demo Day-quality app in production by the end of week 1 instead of week 12." The partners have a huge sample size on how much teams get done in what time period, so it would be glaringly obvious to them if this batch were shipping 10x as fast as previous batches.

That external observation would be the headline if it were what the partners were actually seeing. Since that's not the headline, it's clearly not what they're seeing, so 10x can't be the number either.

__gcd · 2025-03-22T21:54:33 1742680473

To be fair if your benchmark is against demo day, at some point Amdahl's law kicks in regardless of how many multiples you have on engineering. Not sure if I believe the multiple of 10x or 100x anyway, but a better metric is “number of customer feedback loops” is a better metric than “can complete one (1) demo day in X time”. My (non YC) impression is that people are hitting more loops.

Also multiples “up” versus “down” are not symmetric. Airplanes are around 10x faster than cars, but that doesn’t mean I’ll be getting to work in 60 seconds.

Jensson · 2025-03-23T01:56:53 1742695013

If you can't do something extremely impressive with the equivalent of hundreds worth of full time engineers then there is something wrong with you as a founder.

Note that real engineers helps you come up with new features and test your product and all that and not just add code, adding code was never a bottleneck on just about any problem ever.

So Amdahl's law applies in terms of the time to add code, not in terms of engineers, they don't do the work of 100 engineers, at best they take 100x less time to add lines of code when they know what they wanna make. But that isn't particularly game changing, as adding code is not the hard or even time consuming part.

mdaniel · 2025-03-22T22:03:52 1742681032

I see your problem (at least according to Yegge's theory): the batch applicants are just too senior. If they were more junior, only then would they benefit from the 100x multiplier. The olds are just too far removed from the enlightened way, you see

daveguy · 2025-03-22T22:15:55 1742681755

Not sure if this is a joke, but it's hilarious either way.

Etheryte · 2025-03-22T20:59:56 1742677196

At this point in time, we're following the time corporate got on the outsourcing craze step for step. It had all the hype, hands off, cheaper for the same work, faster to market, every other argument you've all certainly heard. Then the reality hit. The whole discussion around LLM coding agents feels indistinguishable.

TeMPOraL · 2025-03-22T22:10:39 1742681439

> Then the reality hit.

The reality in which people like me get to do work for US/UK for 4x the salary relative to equivalent work locally, and some of this work is actually cleaning up after folks elsewhere, who being cheapest labor available still got 4x their local salary for this work, and the total is still 4x cheaper than what the US/UK company would pay locally? :).

(I'm only half-joking; in a previous life, I worked on a project with this exact development history.)

The outsourcing market is alive and kicking, and offers a whole spectrum of quality and price. The more to the east of US you are, the easier it is to see :).

> The whole discussion around LLM coding agents feels indistinguishable.

Nah, the difference here is, in outsourcing-to-LLMs scenario, there are no people who do the work and benefit from favorable salary/costs-of-living ratio.

Clubber · 2025-03-22T22:21:13 1742682073

The company I worked for 15 years ago outsourced QA to save money. It was sold as paying $15 for a 4080 video card. When they opened the package, instead of a 4080, it was a brick. The salt in the wound was when they realized they overpaid 3x for a $5 brick. If it wasn't for vendor lock-in (they being the vendor), they would be dead.

The local QA person could run through 100 or so scenarios a day. The offshore people could do 2 a day. They never improved. The offshore people who are tops aren't cheap.

TeMPOraL · 2025-03-22T22:30:51 1742682651

There's an art to outsourcing, and - even worse than with LLMs - it's not something you can ever just do and forget, because without active management, you'll eventually end up wasting money and time while getting nothing in return.

QA is a whole other story, too. Outsourcing QA is stupid, but even more stupid and short-sighted is not having QA in the first place, and that unfortunately is becoming a norm.

There's lots of false economy going with jobs, too. Getting rid of QA may save you salaries, but the work doesn't disappear - it just gets dumped on everyone else, and now you're distracting much more expensive engineers (software or otherwise), who do a much worse job at it (not being dedicated specialists) and cost more. On the net, I doubt it's ever saving companies any money, but the positives are easy to count, while negatives are diffused and hard to track beyond overall feeling that "somehow, everything takes longer than it should, and comes out worse than it should, who knows why?".

Clubber · 2025-03-22T23:10:15 1742685015

>Outsourcing QA is stupid, but even more stupid and short-sighted is not having QA in the first place, and that unfortunately is becoming a norm.

Yes, they were moving in that direction. They centralized the QA team over a suite of probably 15 products, which means no one has any expertise. The QA VP would get mad when QA found bugs because "the dev's were supposed to find all the bugs and the QA was just supposed to just certify the release." The amount of people who don't understand how software dev works in high positions is mind boggling.

They ended up firing all the devs except me and this other guy who gave zero shits and wanted to be a manager. "We" maintained 3 products. Two were pretty standard web apps but one was a full blown decision support system (rules engine) that only I knew. I quit after a few months of killing myself. They paid me a whole lot of money a few years later when they were trying to add features to get a very lucrative government contract.

CharlesW · 2025-03-22T21:14:57 1742678097

> Then the reality hit.

Are we talking about the reality where the size of the global software outsourcing market is $618 billion and growing? https://groovetechnology.com/blog/software-development/outso...

MyOutfitIsVague · 2025-03-22T21:31:22 1742679082

In my experience, I started my career 15 years ago cleaning up codebases developed by cheap outsourced developers. They were an absolute mess. Today, I work with many very talented developers overseas. People who develop code at my level of quality and above (and I am a stickler for high quality, maintainable code).

The thing is how you research, what you expect to get out of it, and what you're willing to pay. There was absolutely a gold rush on bottom-dollar development by cheap overseas developers by management who had no idea how software really worked, and thought they could build a business on cheap offshore development. These were software farms staffed by unappreciated undertrained people from diploma mills. I saw truly shocking things. I saw code written entirely with gotos instead of loops, because the developer had never learned how to write a while loop. The companies spent way more in the long run trying to iterate, ask for changes, and ultimately having to hire higher quality talent for much more money.

I agree with the GP. The LLMs will get better, maybe people will learn that you need an LLM with a skilled developer, or maybe the agents will get good enough to fully drive themselves properly, or maybe just good enough that a non-technical pilot can get good work out of them. Right now, "vibe coding" is largely non-technical people making messy, unmaintainable, insecure code. Some of these are programmers and non-programmers just playing around, but some people are trying to build money-making businesses off this, and it does feel like a very similar situation.

matsemann · 2025-03-22T21:47:05 1742680025

The market where I live is far larger than it was 15-20 years ago. So both have grown, the outsourcing didn't kill local as predicted.

TeMPOraL · 2025-03-22T22:12:10 1742681530

The growth of demand for software development is absurd and heavily skews people's expectations. It won't last forever.

FpUser · 2025-03-22T22:36:20 1742682980

>" It won't last forever."

And who cares if it is true? So far programming is one of very few professions when person can set themselves for life in a relatively short period of time. When / if it is gone there will be something else. I have few friends who'd switched to be a handyman. They are doing great from what I see.

TeMPOraL · 2025-03-22T22:51:43 1742683903

> So far programming is one of very few professions when person can set themselves for life in a relatively short period of time.

If you specifically optimize for it. Most people don't - they specialize and expect to be in their line of work for decades.

> When / if it is gone there will be something else.

There will be something else for young people who are just starting. If you're 20 years into a career and then your line of work disappears overnight, of course you can switch to something else - and enjoy your entry-level salary while competing for jobs with people who are 20+ years younger than you and have no meaningful costs or obligations yet.

Yoric · 2025-03-23T07:36:35 1742715395

That very much depends on where you live.

Also, on what you're optimizing for. If, like many, you're looking for a job that makes sense, for instance (e.g. working to develop technologies or research that you think can change the world for the better), you're probably never going to strike that particular gold.

iamEAP · 2025-03-22T21:35:13 1742679313

My father never dissuaded me from the computer science degree I went for (starting in 2007), but later on in life, he told me that, at the time, he was worried about my choice because outsourcing was all the rage and everyone was saying there’d be no programming work in the states-that it was all going to be done in India.

That reality in fact never materialized.

stusmall · 2025-03-22T21:53:53 1742680433

I think that's a great point. There is a place for outsourcing. Some projects and organizations are well suited for it. Some projects end up only using outsourcing at supplementing some parts of the work. Some can't do it for quality or compliance reasons.

I think we will see something similar with LLMs. There will be areas where it will deliver cheaper and faster. There will be areas where it will deliver nothing but disaster. It'll change the industry but not eat it alive. The folks talking about fully autonomous coding on the near horizon are dreaming.

acdha · 2025-03-22T22:33:05 1742682785

I’ve seen a number of outsourcing project failures. The two things they all had in common were that the organizations in question were terrible at managing projects but they blamed the developers for management’s inability to plan or make decisions, and they were trying for unrealistic savings – it wasn’t enough to save 30-50% on salary, they wanted 90% even if that was below the market rate for those skills even in India.

The first one is definitely happening with the LLM bubble where companies really want to pretend that the hard part of the job isn’t understanding what to build and how to do so maintainability.

The second one is going to be more interesting: I expect LLMs to put downward pressure on wages in a lot of places but also for smarter companies to realize that nothing short of true AGI is going to replace the need for people who can actually understand what the customer needs. If I’m right, this will swing the pendulum back towards specialists again – the seagull guys who come in, declare that their favorite framework will solve everything, and leave are more vulnerable to being replaced by an LLM than someone who knows how to code but is also bringing actual business-relevant experience and judgement which an LLM can’t have.

stusmall · 2025-03-23T03:07:46 1742699266

> the seagull guys who come in, declare that their favorite framework will solve everything, and leave are more vulnerable to being replaced by an LLM

Oh no, now I love LLMs.

TeMPOraL · 2025-03-22T22:58:50 1742684330

> the seagull guys who come in, declare that their favorite framework will solve everything, and leave are more vulnerable to being replaced by an LLM than someone who knows how to code but is also bringing actual business-relevant experience and judgement which an LLM can’t have.

But that's just a continuous variant of the discrete-sounding claim that programming will get eaten by AI soon. After all, the "actual business-relevant experience and judgement which an LLM can’t have" is mostly not related to programming - and the better LLMs get at coding, the less value will the programming parts of the skillset have; take it to the limit, and it's just saying the managers and sales people will stay, while software developers will be gone.

acdha · 2025-03-22T23:27:40 1742686060

I think that’s a question of how you define jobs. For example, I’ve worked with very few managers who could document their business processes in sufficient detail to build an app. Now, is the person who does a business analyst, architect, senior developer, etc.? Who sits down with the users, gets feedback, but understands the needs of multiple parties well enough to tell which points are traps, which should be developed in a different direction, etc.?

Basically, I’m saying people should stop expecting to get six figures for being able to run create-react-app and deploy a container. The analytical and social parts of the job are where I predict LLMs to make fewer inroads because they require non-generic understanding.

TeMPOraL · 2025-03-23T11:34:17 1742729657

> I’m saying people should stop expecting to get six figures for being able to run create-react-app and deploy a container. The analytical and social parts of the job are where I predict LLMs to make fewer inroads because they require non-generic understanding.

That's a fair take. I do wonder though, how much will those "analytical and social parts of the job" be paying - I imagine you might no longer get six figures for that either, because high tech salaries are fueled by absurd growth of the industry, which manifests in a large part in software that's basically just {framework du jour + basic-level CRUD, that hasn't changed much in 30 years + branding}, and that kind of software I expect to get eaten by LLMs entirely.

Even with cookie-cutter app coders out of the way, the remaining software engineers might see the number of jobs implode, crashing salaries for some time, until (maybe) the growth restarts around new kinds of software, kinds that'll be in high demand and not something that can be made by a few "analytical/social" people herding LLMs. I'd normally say this won't happen, but rather that the software economy will slow down, stabilize, and get boring like everything else - but then, so much of software is driven purely by advertising, and advertising is a negative sum game, so surely they'll invent more bullshit jobs for us.

acdha · 2025-03-23T12:30:08 1742733008

Yeah, I’m really not sure either with the general backdrop of looming American disinvestment and the entire world reconsidering reliance on American companies. I don’t think reversing the trend of consolidation is going to be enough to balance it out.

Nextgrid · 2025-03-22T21:24:38 1742678678

The market can stay irrational for longer than you can stay solvent.

This is not to say outsourcing can't ever work, but the situations where it does work are much rarer than what every outsourcing vendor would like you to believe.

I bet a previous client's attempt at outsourcing (well into the 6 figures now) is included in that number... yet the expensive onshore devs outsourcing was supposed to replace are still there 2 years later except now they have to also babysit the offshore idiots and fix their messes.

But hey, the vendor got paid, the idiot executive who fell for their pitch wouldn't want to lose face, so it all gets handwaved away as a continuing success and more money gets thrown into the dumpster fire.

TeMPOraL · 2025-03-23T12:41:54 1742733714

FWIW, if you believe that long-term, the market is a good optimizing engine (I think it's a very reasonable and well-proven belief), then this is just a matter of time before things sort themselves out.

The need for companies to get more value out of less spend won't disappear, nor will the comparative advantage of companies in lower CoL areas of the globe. That's two fundamental incentives on both sides that are aligned, driving the market to find the lowest-energy path from here to there. It'll get there, even if it ends up looking strange (like, idk., maybe cutting out management intermediaries but involving a middleman acting as insurance).

That is, if LLMs won't leapfrog it all and end software dev outsourcing before it started to work well.

Etheryte · 2025-03-22T21:23:07 1742678587

Even at its current size, the software outsourcing business is multiple orders of magnitude smaller than the software business itself. While there's money to be made, clearly the hype didn't live up to even remotely what it promised to be.

booleandilemma · 2025-03-22T21:14:51 1742678091

My first job in the industry was cleaning up a large codebase created overseas by indian developers. Maybe the new kids today will break into the industry by cleaning up messes that have been generated by AI.

exe34 · 2025-03-22T21:49:31 1742680171

the next version of AI will simply make it into a bigger mess.

TeMPOraL · 2025-03-22T22:15:55 1742681755

Nah, the magic/promise of AI is that it has positive chance of getting there, so you can keep feeding it dollars until it eventually gets you the thing you want, and that it's still cheaper than having people do it the old-school way.

We're not there yet, but I don't see anything preventing us from getting there in ~5 years.

(Remember: 5 years ago, SOTA in this space was letting a genetic algorithm poke at an AST and hopefully maybe arrive at a trivial program solving a small algorithmic problem.)

SJC_Hacker · 2025-03-22T22:54:25 1742684065

And full self driving is always just 5 years away.

jryle70 · 2025-03-23T01:07:08 1742692028

5 years? I could swear that I took a ride yesterday in SF with Waymo.

TeMPOraL · 2025-03-22T23:01:48 1742684508

I wouldn't bet against it. Self-driving tech benefits directly from the outputs of sudden and continued growth of R&D in AI, fueled by hype-driven investments.

Apocryphon · 2025-03-22T23:46:01 1742687161

Maybe just like with self-driving cars, trying to hook mechanical precision up to messy human society is going to be fraught and lead to blowback. Meaning that once planes start falling out of the sky from vibe coding Boeing contractors there will be PR and regulatory panics that soften the hype somewhat. Or, the next time Equifax gets mass leaked and they blame their security setup on generative AI. You can’t vibe code your way out of human stupidity and the consequences of production environments.

TeMPOraL · 2025-03-23T18:30:23 1742754623

Maybe, but the same thing could be said about JavaScript and webshit and the world is still there. We can just continue to not use YOLO technologies and culture in safety-critical applications.

gre · 2025-03-22T23:19:08 1742685548

Full self driving is here. (Waymo). LIDAR is required imho.

amarcheschi · 2025-03-22T22:00:58 1742680858

Now, now, not with this mindset. You have to ask the Ai to fix the mess, and if it doesn't, try again!

kolektiv · 2025-03-22T22:07:45 1742681265

As if the person struggling with the mess will have the requisite skill levels to even recognise a mess!

dragonelite · 2025-03-22T22:40:40 1742683240

Ooh but don't you know we just need an extra language with abstractions that can be compiled into prompting statements. Maybe something like sql.

TeMPOraL · 2025-03-23T18:32:08 1742754728

Recognizing a mess is much easier and takes less skill than fixing it.

mdaniel · 2025-03-22T22:05:19 1742681119

AIUI one should be sure to include a grandmother dying if that div doesn't get centered

hn_throwaway_99 · 2025-03-22T22:20:01 1742682001

> It had all the hype, hands off, cheaper for the same work, faster to market, every other argument you've all certainly heard.

Like some of the other responses, I'm baffled by your comment. Have you not seen what's happened in the past 5 years or so?

Yes, there was an outsourcing craze to India after the .com bubble burst in the early 00s that largely failed - the timezone, cultural differences, and lack of good infrastructure support made it fail.

The past 2 companies I've worked for offshored the majority of their software engineering work, and there was no quality difference compared to American devs. The offshore locations were Latin America and Europe, so plenty of timezone overlap. The companies are fully remote, so what difference does it make if the dev is in your same city or a thousand miles away?

I think offshoring has absolutely put downward pressure on US dev salaries in the past couple years.

makeitdouble · 2025-03-22T22:41:32 1742683292

I think you're talking about a different phenomenon, having remote mixed teams is IMHO different from offshoring.

There's at least the crucial difference that you had devs in both western countries and traditional "third world", where the 90s view of offshoring was throwing whole processes abroad and only keep "heads" in-house while the remote teams/companies would deal with all the execution, making it inherently difficult to deal with production monitoring.

PS: to your point offshoring to India has become more common but Indian companies are also not that cheap, so we're past the initial framework. Perhaps the same way outsourcing production to China used to be about sweatshops, when it can now be about unrivaled expertise at a cost.

mellosouls · 2025-03-22T21:46:35 1742679995

This is not about "LLM coding agents".

It's about those agents being (mis)used in the very specific blind faith approach of "vibe coding", not least due to the hype merchants and grifters picking up the phrase and running with it shorn of the original cautionary notes about it being useful for bringing a bit of fun back into non-serious coding.

Criticizing the idea (and conflating it with the wider field of LLM coding agents) without understanding that original context is not really any better.

Vibe-coding <> LLM coding agents, which - when used properly - are brilliant for use in serious code and are here to stay.

MarkMarine · 2025-03-22T22:06:43 1742681203

This is the go community saying a computer will never best human go players.

We already have examples of a model finding more performant sorts [0], given the right incentives and time, and the right system for optimizing (LLMs trained on “average code” probably aren’t it) the computer will best us at creating things for the computer.

Is “vibe coding” real today? Not in my experience, with even Claude code. My hand has to be firmly on the tiller, using my experience and skill to correct its mistakes and guide it. But I can see the current trajectory of improvement, and I’m sure it’ll get there.

[0] https://deepmind.google/discover/blog/alphadev-discovers-fas...

anon373839 · 2025-03-22T23:45:00 1742687100

> This is the go community saying a computer will never best human go players.

I don’t see this. Board games are fundamentally different from software development problems. The latter have imperfect information, unknown requirements and constraints, fuzzy success criteria, and more.

WithinReason · 2025-03-23T07:21:11 1742714471

Which means that as long as you can specify the requirements, constraints, and success criteria you can automate the coding part.

thiht · 2025-03-23T09:02:12 1742720532

So you’re saying you can automate the coding part by… writing the code (in an inferior language)

consumer451 · 2025-03-23T11:27:26 1742729246

> So you’re saying you can automate the coding part by… writing the code (in an inferior language)

Not OP, but yes. But that means you don't need a dev, just someone who knows how to spec correctly in English/Jira, right? Is that likely a dev who moved on to PM? Very likely in 2025.

For better or worse, the future I imagine is a Jira plugin or MCP server that can read a project, the LLM IDE client then asks questions to fill in the blanks... and out comes the app.

For many years this will require a human in the loop. But will that human need to know the intricacies of the latest frontend framework? Less and less as time goes on.

Disclaimer: predicting the future is hard.

tliltocatl · 2025-03-23T15:03:15 1742742195

Trying to provide a programming tool for non-programmers has been a wet dream of some for a while. See SQL, 4GL, DRAKON (lol), VBA, no-code platforms (btw what happend with the hype? How are we not all replaced already) and the most recent but certainly not the last is LLMs. All while sometimes yielding something useful past attempts have consistently and spectacularly failed this objective. Fundamentally because non-programmers don't want to deal with this, otherwise they would have learned some damn proper PL long time ago, it's not THAT hard. And LLMs adds quite some special spice to that. How a vibe coder going to fix LLM output that fails? Without understanding the code, that is.

Regarding "the latest frontend framework" the whole situation is a bit mysterious to me, because somehow everyone keep spending millions of man-hours on yet another react contraption where a static HTML would be enough. From the user perspective all this stuff brings no value, 80% of frontend stuff could have been automated long time ago or just not done at all, yet we keep reinventing the wheel. I don't see how LLMs can change the situation because there was clearly no demand for improved productivity there before.

anon373839 · 2025-03-23T20:13:49 1742760829

If we have the luxury of assuming the hardest parts of the problem are already solved, then sure! The rest is trivial.

saint_yossarian · 2025-03-23T07:56:53 1742716613

Sure, but writing those is just coding in a fuzzier and more verbose language.

dr_dshiv · 2025-03-22T22:27:08 1742682428

Vibe coding is 100% real. Or maybe we should call it code vibing when there is no coding ability. But I just taught 18 professionals with no coding ability to build functional software. Their minds were blown

SJC_Hacker · 2025-03-22T22:47:29 1742683649

If its something like "tic-tac-toe in Javascript", thats been done 1000x before, I wouldn't find it all that impressive.

TeMPOraL · 2025-03-22T22:54:01 1742684041

Most of webdev has been done 1000000x before. Though to be fair, Wix and Squarespace already bit a huge chunk out of that market.

Jensson · 2025-03-22T23:18:32 1742685512

> Most of webdev has been done 1000000x before

Not according to your very specific stakeholder demands / environment/naming/data tables/data protection requirements, otherwise you would just use a library.

Those might seem like trivial differences but plenty of things go wrong there, plenty enough that you can't just use a library instead of a programmer, and then they are plenty enough of such errors that vibe coding will also cause issues.

skeledrew · 2025-03-23T00:24:20 1742689460

I think the point being made is that even though the specific set of requirements may be unique in a stakeholder basis, all the components already exist, and have been combined in many ways. So it really boils down to prompting in such a way that the right set of components are brought together in the right way. That's where the skill now lies.

Jensson · 2025-03-23T00:32:42 1742689962

> So it really boils down to prompting in such a way that the right set of components are brought together in the right way. That's where the skill now lies.

And how is this different from just calling the libraries in the right way to make it adhere to stakeholder requirements?

The statement isn't "its impossible to get an AI to print the code for a right program", but "the work and skills you need to get an AI to print the right program is as much or more than to do it yourself.". That seems to be true for all but trivial programs. Here trivial means you can download a git repo and change some variables to get that result.

TeMPOraL · 2025-03-23T11:39:21 1742729961

> And how is this different from just calling the libraries in the right way to make it adhere to stakeholder requirements?

In that you need humans that can understand stakeholder requirements, constraints of the ___domain, and limits of existing software, so that they can write the necessary glue to make everything work.

Thing is, LLMs know more about every ___domain than any non-expert (and for most software, "___domain experts" are just non-___domain-expert programmers who self-learn enough of it to make the project work), and they can understand what stakeholders say better than other humans can, at least superficially. I expect it won't take long until LLMs can do all of this better than an average professional.

(Yes, it may take a long time before an LLM can replace a senior Googler. But that's not the point. It's enough for an LLM to replace an average code monkey churning out mobile apps for brands, to have a huge chunk of the industry disappear overnight.)

kuschku · 2025-03-24T17:13:51 1742836431

Paul Graham argued that if you act like the average startup, you'll get the same results as the average startup. And the average startup fails.

It follows that if you want to have success, you need to do something new which hasn't been done before.

> LLMs know more about every ___domain than any non-expert

As soon as you're creating something new, or working in a niche field, LLMs struggle.

So do junior developers. But they learn and get better with time. While onboarding a junior developer requires more effort than doing the work yourself, it's worth it in the long run.

IMHO, that's the largest issues LLMs have today. They can't really adapt and learn "in the field". We build a lot of workarounds with memory to circumvent that, but that too only works until the memory exceeds the context.

I've tried using ChatGPT, Copilot, custom GPT 4o models and Cursor. The task they did best at was generating a simple landing page (though they struggled with tailwind 4, cursor spent almost 8 hours debugging that issue).

With tasks that require more niche ___domain knowledge, it went much worse. Cursor finished some of the tasks I gave it, but it took over 10x more time than a junior developer would've spent, and I had to constantly babysit it the entire time, providing context, prompting, writing cursor rules, prompting again, etc. The others failed entirely.

If I start working on an unfamiliar task, I read all the docs, write some notes for myself, maybe build some sample projects to test my understanding of the edge cases. Similarly, if faced with a new task, I build some small prototypes before committing to a strategy for the actual task.

Maybe ML agents would fare better with that approach, instead of today's approach of just creating a mess in the codebase like an intern.

skeledrew · 2025-03-23T00:47:15 1742690835

It's a switch of focus. Instead of being occupied by the grunge work of integrating libraries and code logic from the outset, now focus can remain on the larger picture for longer, with a need to jump into raw code only for the really tricky/unique problems, if there are any. That can be a lot of overheat avoided, if done well.

dr_dshiv · 2025-03-22T23:38:44 1742686724

This one guy made, in bolt.new, a system that would generate song lyrics for a song and sync the text to speech to different parts of the song. Creative and interesting.

Someone else made an “exquisite corpse” drawing game.

And another, a way to annotate medical images.

I think of all these things as functional prototypes. It’s obviously not engineering. But it is pretty magical —

loloquwowndueo · 2025-03-22T22:40:14 1742683214

Wait until they have to maintain, debug or secure that software. You’ll see things really blow at that point.

dr_dshiv · 2025-03-23T01:53:46 1742694826

Totally. Code vibing isn’t software engineering. That should be 100% clear.

mentos · 2025-03-22T22:33:59 1742682839

Cursor? I finally got around to trying it this week and it exceeded my expectations.

ModernMech · 2025-03-23T15:26:51 1742743611

What were your expectations and how did it exceed them?

I have to say, on they basis of your comment I just decided to try Cursor, and I'm sorry to report it immediately disappointed me.

First thing it did was it told me it found a syntax error in code that compiles perfectly. It went ahead and added a closing brace, telling me that "I've fixed the issue by properly closing the method with its brace. The method now has valid syntax and should compile correctly".

It was never broken! This seems like such a regression of tooling; we have parsers for the purpose of finding wrong syntax, and they don't just make up things like missing braces that aren't missing, and then break your code by inserting them. This seems like developing in Kafka's nightmares.

Should I even bother continuing to evaluate Cursor, when it can take perfectly valid and correct code, and immediately make it worse? What other nightmares will it reveal to me, and do I even want to know? I'm kind of astounded at how bad that first impression was, couldn't have been worse really.

soco · 2025-03-24T13:50:43 1742824243

Try refactoring anything in Cursor, that's where the shit really hits the fan. I guess all those folks claiming 100x productivity or applications made with sole vibing, are only building little proof-of concept apps, something which "works" but definitely doesn't need to follow proper requirements. Can it bootstrap an application for you? For sure, but that's just half a day sunk otherwise - all the next weeks of building up on that scaffolding cannot today be automated, or vibed.

kuschku · 2025-03-24T17:17:14 1742836634

The reason bootstrapping takes a while is to make sure you've got a solid foundation for your project.

If you let an LLM bootstrap your project, you get tech debt from the get go. It's costing you more time than it saves.

dr_dshiv · 2025-03-22T22:48:48 1742683728

No, if you have no coding ability and no interest in learning but want to just tell the computer what you want:

1. Lovable

2. Bolt.new

3. Replit agent

Or actually, I’d put Claude #1. You can make all kinds of cool stuff, just in their UI.

bn-l · 2025-03-23T02:40:08 1742697608

> But I just taught 18 professionals with no coding ability to build functional software. Their minds were blown

No you didn’t.

bigpeopleareold · 2025-03-23T09:26:22 1742721982

How did you build that experience? It's speculative now if people can develop equivalent experience when you're job is now to constantly guide an LLM to do the right thing. We can speculate that you see patterns and seek to correct them, but the type of hands-on experience and muscle memory is threatened to be atrophied (very much like just relying on stackoverflow for answers can atrophy your ability to seek core knowledge.)

margalabargala · 2025-03-22T22:22:48 1742682168

Where we are now, I wonder if is similar to the early days of compiled languages existing, back when people still somewhat commonly wrote assembly by hand and didn't trust compilers.

Sure, things like Roller Coaster Tycoon exist, but but writing in a compiled language is so much faster, easier, and more broadly accessible than writing in assembly that compilers took over.

arrowsmith · 2025-03-22T22:39:45 1742683185

TIL that Rollercoaster Tycoon was written in Assembly.

“[Developer Chris] Sawyer wrote 99% of the code for RollerCoaster Tycoon in x86 assembly language for the Microsoft Macro Assembler, with the remaining one percent written in C.” - Wikipedia

What a lunatic.

criddell · 2025-03-22T23:08:39 1742684919

With a good macro library, it’s not that far from writing C.

oblio · 2025-03-22T22:28:34 1742682514

My worry about this approach is: there is a reasonably popular saying that writing code is hard but debugging it is twice as hard (at least), which I think is an accurate description.

LLMs will greatly increase code production, will they also increase debuggability to match?

cameldrv · 2025-03-22T23:19:57 1742685597

I think probably people will also use LLMs to debug.

At this point, I think you can consider vibe-coding with an LLM to be pretty equivalent to using a fairly junior developer with access to stack overflow. It's going to make a lot of mistakes, it's going to make a lot of questionable decisions. Sometimes it will be able to fix its mistakes, sometimes it will spin its wheels and never fix it. It may make a big ball of mud. The entire project may fail.

Two-three years from now, who knows. I'm pretty sure you still won't see 100% reliability in translating English->code, but you also don't see that even with senior developers.

corysama · 2025-03-23T00:35:17 1742690117

> I think probably people will also use LLMs to debug.

I don’t have a link handy, but someone has already set up a service that’s “JS time travel debugger + LLM that knows how to use it”

Pretty sure it was a Show HN recently.

istjohn · 2025-03-23T01:23:00 1742692980

https://news.ycombinator.com/item?id=43023698

TeMPOraL · 2025-03-22T22:46:50 1742683610

LLMs don't just make it easy to accumulate code - they make it easy to throw code away and start again. This already enables taking a different approach to debuggability - if there's a bug and it can't be trivially solved, trash that bit of the code and write it again. It may not be broadly viable just yet, but it will be if the models keep getting cheaper and better.

This is also implied in the idea of "vibe coding" - don't bother understanding the code or debugging it yourself; if it doesn't work the way you like, just say it and have the model fix it until it gets it right (or you run out of money).

Jensson · 2025-03-22T23:22:27 1742685747

> LLMs don't just make it easy to accumulate code - they make it easy to throw code away and start again.

To throw away code you have to understand it, so no its the opposite. Code you don't understand is the hardest to get rid of, so it stays the longest in your codebase.

> if there's a bug and it can't be trivially solved, trash that bit of the code and write it again

How do you know where the bug is if you don't understand the code? There is no known algorithm to take a bug description and return the place in the code the bug is, otherwise bug fixing would be trivial.

Edit: Not to mention that in real production systems your bugs will corrupt the database, and if you haven't set up a logging system etc you will likely not realize for a while forcing you to do a rollback to a very old state losing so much data. You wont last long doing that.

TeMPOraL · 2025-03-23T12:08:51 1742731731

> To throw away code you have to understand it, so no its the opposite. Code you don't understand is the hardest to get rid of, so it stays the longest in your codebase.

No, you don't.

> How do you know where the bug is if you don't understand the code? There is no known algorithm to take a bug description and return the place in the code the bug is, otherwise bug fixing would be trivial.

Yes, there is.

At worst, the bug is somewhere in the entire project. But you probably have a more narrow idea where the bug is, or when it was introduced. "In module X", "In feature Y", "In the last N days/weeks". Not to mention, for most bugs, `git bisect` is enough to precisely narrow down the problematic change, and doing that doesn't actually require understanding anything about the code.

It all boils down to costs. Even if it takes AI a whole day and 1000 attempts to do what would be a relatively simple fix for an experienced developer, if those 1000 attempts cost less than the developer's work-hours, the business will soon learn to prefer AI over people. If and when we get to that point, is mostly just a function of LLM performance and cost. If they get cheap enough, it'll make as much sense to have human developers fix bugs in code, as it makes sense for you to mend holes in your socks instead of buying them in bulk on-line.

> Not to mention that in real production systems your bugs will corrupt the database, and if you haven't set up a logging system etc you will likely not realize for a while forcing you to do a rollback to a very old state losing so much data.

This really depends on the kind of system you're doing, and the kind of data you're storing.

anon373839 · 2025-03-23T06:52:24 1742712744

> if there's a bug and it can't be trivially solved, trash that bit of the code and write it again

When I was in undergrad, I knew a few guys who approached every problem by pasting in snippets from Stack Overflow and tutorial sites until the code “worked”. Did not end well…

TeMPOraL · 2025-03-23T11:47:27 1742730447

That was then, and this is now.

Look, I understand the sentiment. I too want the code to be done properly, well-engineered and thought through. But recall, this is not our job. We are Professionals, and by modern definition, a Professional does whatever is best for the business. And the business doesn't care about the product - it cares about the product's ability to earn them money. If generating shit code, then throwing it away and generating anew at any sign of bug (or spec change) gets cheap enough, this is what the business will want to do. This is what being Professional will mean.

(You can imagine I don't hold "professionalism" in a very high regard.)

See also: basic goods in meatspace. In developed economies, people generally don't repair clothes anymore, and increasingly rarely anyone bothers with repairing appliances. It's cheaper to just throw it away and buy a new one, than to try and repair it. Hell, even construction and remodeling these days involves a lot more of "affix it here permanently; if you need to move it, just smash it and install a new one" approach.

Why wouldn't the same eventually happen with code?

anon373839 · 2025-03-23T20:10:15 1742760615

This is an interesting thought experiment, but I don’t think it’s realistic. Software that “works” doesn’t necessarily work. And software that doesn’t work can cost catastrophic financial losses. Worse yet, those losses are going to be felt more painfully when they are incurred recklessly.

There is a reason that humans developed analytical problem solving as an alternative to trial and error: when you can do it, it’s more effective, and safer.

That’s not to say that disposable code doesn’t have some interesting implications! One of them is that experimentation becomes a lot cheaper, so it’s faster to navigate the search space of possible solutions to a problem. But just taking a “solution” directly from an LLM without validating its correctness is fundamentally unserious and will be punished by reality sooner or later.

Apocryphon · 2025-03-22T23:22:18 1742685738

Sounds very maintainable and rigorously constructed

jryle70 · 2025-03-23T01:01:00 1742691660

How do you know it won't be maintainable? based on your current knowledge?

Apocryphon · 2025-03-23T03:07:16 1742699236

Generating tons of throwaway code that is swapped out every time a bug occurs does not sound like engineering for the long-term.

TeMPOraL · 2025-03-23T12:15:55 1742732155

Since when does any business care about "engineering for the long-term"?

We, the software engineers, care. The business doesn't. In fact, the industry has systematically been trying to beat the care out of engineers - it's unprofessional to care about the work beyond the point it stops making money for the business.

I'm not saying this is right or wrong - but this is how companies roll; if they can fix product issues by having AI throw chunks of code away and do it again, if that's reliably cheaper than having engineers do actual engineering, then that's what businesses will do.

Apocryphon · 2025-03-23T13:46:38 1742737598

What happens when this leads to high-profile disasters?

https://news.ycombinator.com/item?id=43449642

TeMPOraL · 2025-03-23T16:50:56 1742748656

Hopefully it won't. You don't put webshit in control of rockets or cars, and you shouldn't put vibecoded software in control of them either. Programming safety-critical systems is its own thing, and it should be resistant to LLM incursions at least as long as human sign-off is an important part of the job.

Panzer04 · 2025-03-23T03:49:07 1742701747

Ever heard of chesterton's fence?

If you don't understand what the code is doing, removing it sounds like a recipe for disaster.

TeMPOraL · 2025-03-23T12:12:07 1742731927

Have you ever heard of shirts?

In the past, if you'd tear your shirt, you'd spend time mending it, or pay someone to do it for you. Today, you just throw it in the trash and buy a new one, as it's much cheaper and faster.

Think of any other goods we don't repair anymore. Regardless of their internal complexity and beauty of engineering, and no matter how small the defect is, if it's cheaper to replace it wholesale than to repair it, people end up replacing it.

There's no reason to believe the same won't happen to code.

ModernMech · 2025-03-23T15:42:04 1742744524

That's only cheaper because we've made it someone else's problem. Someone is paying that cost -- fast fashion is a real issue that causes waste, pollution, and human rights violations. Now as we face tarriffs on foreign manufactured goods, things don't seem so cheap anymore. Mending socks looks a little better.

You're right, there's no reason to believe the same won't happen to code, but there's also no reason to believe it won't similarly end in all kinds of problems that come back to bite us down the line when the goodtimes are over.

lukeschlather · 2025-03-22T22:58:43 1742684323

o1 has lately seemed more useful for debugging than for actually writing code. With writing code, it may give me some scaffold but I have a pretty particular idea of what I want and I have to rewrite most of it.

With debugging, there have been multiple cases where I was stuck on something, I described the problem in detail and o1 gave me an insightful explanation of what I didn't understand.

It's not magic, but mostly what makes it not magic is it can't respond to very detailed questions. But if I can get it the relevant information into a small space it can draw connections I can't.

kccqzy · 2025-03-23T00:15:32 1742688932

Your example of a model finding more performant sorts did not involve the use of LLMs. I've said multiple times on HN that I believe we need to look past LLMs for the next breakthrough in AI assisted coding. I don't believe LLMs can exceed humans in quality; they can only match humans in quality and beat humans in speed.

MarkMarine · 2025-03-23T00:29:15 1742689755

Yes exactly:

>and the right system for optimizing (LLMs trained on “average code” probably aren’t it

MR4D · 2025-03-22T22:20:21 1742682021

I think it’s partially there:

For creating website (not apps) it absolutely is there. This is just the first rung on the ladder though. It’s not doing Linux kernel development yet, but that time will come eventually. In between are all the other rungs. AI will climb them one by one.

brulard · 2025-03-23T00:32:31 1742689951

For me it is very much hit or miss. I use claude sonnet with Cline. Sometimes I'm blown away that it can create a new page with CRUD functionality, nice UI etc. in one go. Another time it struggles to create a simple web page. Yesterday I needed a very simple landing page. My prompt was something along `Very dark grainy background with centered "name of the page" text.`. It couldn't get neither the background, nor the centering right. Later when it got one right, it screwed up the second. I had to give up after consuming ~$1.5 because we were not getting closer to the result.

MR4D · 2025-03-23T02:23:41 1742696621

That’s fair.

Grok3 has been amazing, but it keeps on wanting me to do dark mode for an app I’m creating. When I finally said yes, it wasn’t able to get dark ode to work, despite multiple tries. Was funny tbh.

igorkraw · 2025-03-22T23:09:19 1742684959

Check the actual paper on the type of sorts it actually got speedup on :-) (hint: a few percentage points on larger n,similar to what pgo might find, the big speedup is for n around 8 or so, where it basically enumerated and found a sorting network)

jmull · 2025-03-22T22:25:21 1742682321

Well… to be frank, this is someone not reading TFA.

The article is pointing out real limitations of vibe coding today (which you appear to agree with). It does suggest AI coding won’t be viable in the future. You should probably update your comment to say something like, “spot on”.

TeMPOraL · 2025-03-22T22:38:29 1742683109

Nah, the article is generalizing from a single sample of current state, ignoring the larger trajectory (that, for AI coding, went from sci-fi to reality in two years).

Sure, the tools aren't perfect, so there's some art to using them now - which the author of TFA seems to be unaware of. Take for example:

> You cannot ask these tools today to develop a performant React application. You cannot ask these tools to implement a secure user registration flow. It will choose to execute functions like is user registered on the client instead of the server.

Of course you can ask them to do it. You can literally ask them to "write better code" (yes, with this exact phrase; see [0]), and you'll get better code. More performant, or more secure - it depends on specifics of the case. Or, you can ask them specifically to focus on security or performance, and you will typically get improvements on those axis.

That's today. Next year, people will know how to prompt the models away from most common failure modes, and the models themselves will be further trained to avoid those same failure modes. In bringing specific problems up, TFA isn't making a convincing argument against future of "vibe coding" - it's literally helping in making it happen.

--

[0] - https://minimaxir.com/2025/01/write-better-code/

Jensson · 2025-03-22T23:26:30 1742685990

> Of course you can ask them to do it. You can literally ask them to "write better code" (yes, with this exact phrase; see [0]), and you'll get better code. More performant, or more secure - it depends on specifics of the case.

That wasn't the requirement though. He said he wanted a performant or secure app, not "more performant" or "more secure". "more" is trivial, but actually getting to a good state is not.

To actually make a larger program secure or performant you need a unified higher level architecture that is adhered to everywhere, vibe coding can't get you that. It can do some micro optimizations as you said, but it can't do these macro contexts and architecture. You can ask it for suggestions for such architectures, but you can't make it implement a full scale large app with all components using it.

jmull · 2025-03-23T19:55:38 1742759738

> Nah, the article is generalizing from a single sample of current state, ignoring the larger trajectory (that, for AI coding, went from sci-fi to reality in two years).

You seemed to have missed the part of the article that clearly lays out the recent progression of AI coding. You’re also refuting the arguments the article makes against the future of vibe coding, but the article doesn’t make any arguments about the future of vibe coding!

We all just end up talking past each other if no one is actually talking about the same thing.

lelandbatey · 2025-03-22T23:25:35 1742685935

Ah yes, the classic "we didn't have this at all 2 years ago, so we can expect a linear increase in capability 2 years from now! Trends will of course continue!"

I think you're wrong. I think AI is going to stagnate and only the surrounding tooling will improve, but not enough to get us to the promised land. Arguably we already are seeing that happen. To see the supposed "this is how all code is written now" world AI proponents keep declaring, we're going to need to see improvements to the current AI where they can operate on context windows two orders of magnitude bigger than they are now, while costs for doing so also drop accordingly. Maybe that can be done, I'm betting it can't.

PlunderBunny · 2025-03-23T00:01:34 1742688094

Remember 2016/17 when we all thought we would have self-driving cars by now? People were modelling intersections without traffic lights, and talking about not needing parking space because the cars would just drive around when you weren’t in them.

So much technology these days seems to be “get it to 80% so we can demo and cash out” but 80% isn’t just an arbitrary number - it seems to be the point and which the remainder of the work to finish is either very hard or (I suspect) impossible.

jryle70 · 2025-03-23T00:58:07 1742691487

We do have self car driving by now. No drivers. Easy to find in SF and many other cities. It's not yet available everywhere, but as the Amara's Law describes, "we tend to overestimate the effect of a technology in the short run and underestimate the effect in the long run".

raydev · 2025-03-23T02:46:59 1742698019

> It's not yet available everywhere

It's not available most anywhere. I don't know what exactly what the threshold should be, but it should be usable by most people in first world countries to make the claim "we have it". We don't have it.

TeMPOraL · 2025-03-23T12:28:23 1742732903

Self-driving has been long proven to be a workable idea. Yes, there's the other 90% of the work making it work reliably and safely enough in diverse environments, but we know this can be done, it's just a matter of throwing money at it.

And it's not like there isn't a possible alternative either - we could be adapting roads to be much easier on self-driving cars. It's just that cooperation and coordination between humans is a way harder problem than self-driving, so it's easier to have a bunch of vendors solve the problems in tech the hard way, rather than to rely on the world to maintain roads properly.

My belief is that, if full self-driving isn't becoming widely available in first world countries in 5 years, it's not going to be because of engineering problems, but rather because of legal and process issue around deploying it.

PlunderBunny · 2025-03-23T21:38:44 1742765924

I don't think we know it can be done - has anyone demonstrated a self-driving car that works in all conditions, but is prohibitively expensive in its current form? If they had, you could extrapolate and say 'it's coming as soon as the tech gets cheaper'.

You can't adapt roads to fog or snow (unless you're going to enclose them in a tunnel). You can't adapt roads to pedestrians or bicycles (unless you prevent them from going near the road). Whatever adaptions you could make would be prohibitively expensive to roll out to all roads everywhere. In both the country I live in, and the USA, the government can barely afford to maintain the existing highway infrastructure.

Of course it will get better, and self-driving will be more widely deployed, but I don't think you'll ever get 100% percent coverage (assuming that's what we both agree is the goal).

I think in a way you're agreeing with my point - the last bit is just too hard, whether it's engineering, legal or political.

redleggedfrog · 2025-03-22T21:18:49 1742678329

I'll share a new wrinkle that casts more shade on the coding LLMs.

We have a fair number of offshore resources that are used for dev. They developers are fully integrated into the team, are in all the stand-ups, and substitute for the usual role of junior programmers. They don't get the grunt-work shoveled on them, they get the same work as everyone else, they're just expected to not be as fast.

In 6 months 2 out of 4 of them been sacked, and surprise, not because we could replace their work with LLM output, but because their use of LLMs was so unrestrained and scattershot the pull requests they submitted had become nightmares. One thing mentioned in the article about unit test creation was something we saw as well. Perhaps this is partly due to working an existing code base where the LLM loses some of its advantage, and certainly some of it was cultural in that progress was thought more important than actual manageable code. The two sacked fellows where told, literally, from my own mouth, multiple times, "You cannot just ask Copilot to write you code, paste the entire thing into Visual Studio with no thought of what has changed, with the end goal of just compiling and meeting the single set of acceptance criteria on your story. You're breaking other things and introducing bugs." It went on deaf ears, and now they're gone. They were nice people, I didn't know how to get through to them, but they were convinced the LLMs were the way to go.

I use LLMs to help write code every day, and I wouldn't want to be without it, but I'm fairly surgical about it. Most of the time Copilot gives you a page of say, React code, or EF Core queries, you have to be really careful about anything you didn't explicitly ask for. Honestly, there is a time savings, but there is not a quality increase. The benefit is subverted by the time it takes to figure out how to ask correctly, the time to vet the output, and the time to fix the little tiny insidious bugs it can introduce.

So, don't go vibe coding and lose your job, is something to think about. I have to admit that it has worn me down meeting these interesting people from far-flung locations only to watch them flounder and get let go.

vunderba · 2025-03-22T22:15:07 1742681707

I've been contracting for a fairly largish company (around 300 devs split across ~40 teams). The US based company was bought out by private equity about a year ago and many senior/lead engineers were forced out and contracted out to cheaper overseas labor.

The company has been relatively ambivalent about the usage of code assistant AI, but during PR reviews it has become very apparent that its seen widespread adoption among the outsourced dev teams purely because of code duplication. Our company has a fairly large number of repositories and bespoke libs for utility type functionality.

In the past, a programmer might have internally said to themselves, "There's no way that somebody hasn't already written this stupid function X or method Y", and they'd take a few minutes to search or reach out to see if it exists within an organization.

Instead during some of the recent code reviews, there has been a huge uptick in core functionality that is very obviously being spit out by the LLM. At best its just extra unnecessary code. At worst it will introduce new bugs since our custom functions often handle business ___domain specific edge cases that an LLM simply wouldn't know about.

redleggedfrog · 2025-03-22T23:06:58 1742684818

Totally lines up with my experience as well. We also have the opposite problem of reams of code generated years ago from a period of unsupervised offshore work where we're slowly paying down the debt but the LLMs will attempt to use the old code for new work. Most of it is spaghetti UI code and nearly impossible to reuse effectively but the LLMs give it their best and we have prompt around it.

lubujackson · 2025-03-22T23:55:25 1742687725

I think people miss that "vibe coding" is a senior engineering tool. You still have to architect the project. You still have to ensure security and performance are handled (if relevant for the task). You still have to envision edge cases and usage patterns. You just don't have to type it out.

Now non-programmers can fumble forward to a working demo. But junior engineers are walking around with a loaded weapon - if they are not learning from how AI solves a problem or using it like StackOverflow to answer specific questions, they are blowing up their own careers.

The future is we are all product engineers or ___domain experts. No one is going to want an army of React engineers in 2 years.

MoonGhost · 2025-03-22T22:43:10 1742683390

BTW, Copilot is not the best at coding. With the quality LLM return exponentially grows. Bigger chunks, fewer bugs, less time checking. From my experience LLMs do not impress on complex algorithms and shine on small utilities. They can use libs I even don't want to learn about.

redleggedfrog · 2025-03-22T22:59:27 1742684367

I haven't really noticed that myself. I go "LLM shopping" fairly frequently trying to find which one of the few I'm paying for gives the best result for the current problem. They all seem to have their shortfalls, although I will say Claude is better for Greenfield work.

MoonGhost · 2025-03-23T06:29:25 1742711365

Well, probably you can try to go down to simpler models to get the idea. (they are almost useless) From my experience better model like Claude or o3 can do things that others simply cannot. At some complexity they start going circles making wrong decisions, forgetting things that are still in context window. But the thing is those complex tasks are usually the most interesting and important.

mr_mitm · 2025-03-22T20:52:45 1742676765

> "Vibe Coding" might get you 80% the way to a functioning concept. But to produce something reliable, secure, and worth spending money on, you’ll need experienced humans to do the hard work not possible with today’s models.

This would have been clear from Karpathy's full statement:

> It's not too bad for throwaway weekend projects, but still quite amusing.

softwaredoug · 2025-03-22T22:00:56 1742680856

Exactly. People have gotten too worked up about this idea without reading the original intention.

axegon_ · 2025-03-22T21:02:17 1742677337

I hate how what is effectively a stupid meme phrase became an actual term in a few days.

> "Vibe Coding" might get you 80% the way to a functioning concept. But to produce something reliable, secure, and worth spending money on, you’ll need experienced humans to do the hard work not possible with today’s models.

The problem is that 80% of the job is a proof of concept at best. 80% is effectively a QA walking into a bar[1].

[1] https://barrypopik.com/blog/a_software_tester_walks

mdaniel · 2025-03-22T21:35:00 1742679300

And that's not even getting into the class of testers which try to order ';--drop table beers;-- beers, <script>alert(1)</script> beers or <![ENTITY q1 "&q0;"><![ENTITY q2 "&q1;&q1;&q1;">&q2; style beers

Mountain_Skies · 2025-03-22T22:24:44 1742682284

Makes me wonder how organic or astroturfed the name is. That a bunch of influencers all started using it at the same time makes me think someone is pushing it, perhaps because a focus group (or LLM) decided 'vibe' was a warm and fuzzy way of pushing the latest iteration of 'move fast and break things'.

dragonelite · 2025-03-22T22:44:34 1742683474

80% mark means you just finished the happy flow and written 30% of the code bases. Now you need to handle the unhappy parts and need to write extra test code for covering all those edge cases.

axegon_ · 2025-03-23T12:52:33 1742734353

Precisely, now you need to finish off the remaining 70%, which is where most of the work that is required truly lies, with or without all the AI bollocks. Whether it takes 2 or 8 days of work to get those 30% done, it's not much of a difference if you need several months(or in some cases years) to get a project to a tolerable "production" state. I'd much rather spend 8, as opposed to an AI-generated codebase which will shoot itself in the head because someone scrolled over it and thought "lgtm".

owebmaster · 2025-03-23T16:02:35 1742745755

The alternative to using AI to generate 80% of the project (30% of the code) isn't much better: coding the initial 30%, which is also a challenge (from 0 to 1).

So there is still a productivity gain for senior developers.

bglazer · 2025-03-22T23:35:04 1742686504

Over the last week I tried to use a combination of Claude and OpenAI o3-mini to do a direct conversion of about 500 lines of uncommented academic modeling code from Matlab to Python. I can’t stress enough how badly these models performed. Nearly every consequential line had some variety of off by one or logic error, often very subtle. I didn’t try cursor or the more agentic systems, but I would be astounded if they properly rigged up a test harness, inspected the output and were able to respond to the runtime errors. I’d be happy to share the code if anyone wants to surprise me.

This is exactly the kind of semi-mechanical, low added value work that would greatly benefit from automation, and they really fell on their faces. I really benefit from these models on greenfield tasks where I can delegate minor drudge work, but in this case I honestly think they actually increased the difficulty.

ikiris · 2025-03-22T23:44:34 1742687074

As a counter to this, I had grok build an entire set of micro services and all I had to clean up was some format strings. It blew me away. Did in an hour what should have taken a week.

raffael_de · 2025-03-22T23:58:12 1742687892

i'm not sure if you can compare run of the mill micro services with "uncommented academic modeling code from Matlab".

TeMPOraL · 2025-03-23T19:04:22 1742756662

Exactly. "uncommented academic modeling code from Matlab" is something very few people ever do, and the importance of that work is negligible in isolation. Meanwhile, "run of the mill micro services" are like most of what the software industry is doing these days; in aggregate, it's also how the industry delivers most of its value and how it makes most of its money.

It's the CRUD thing all over again.

samsin · 2025-03-23T01:00:06 1742691606

Can you share the code it produced?

hu3 · 2025-03-22T23:45:59 1742687159

nice. did you use grok web interface or some ide + grok?

ikiris · 2025-03-23T08:01:23 1742716883

I use cursor, and if there’s a nice interface for cursor to use grok I don’t know about it. Which leads to pain if I try to have the built in options try to fix things because they’re nowhere near as competent currently.

gooseus · 2025-03-22T22:33:43 1742682823

I've been "Vibe-TDDing" all afternoon and I'll tell you what, vibe tests are better than no tests.

And so long as you have some decent-to-solid understanding of coding and testing (this is non-trivial, I've been coding for professionally for ~20 years) then you can direct the machine to put up decent guardrails first, and then you can kinda go nuts and let shit grow, prune it back, repeat.

Basically, if you know what code/tests ought to look and act like, then you can significantly reduce the negative externalities of having LLMs do your coding for you.

MattPalmer1086 · 2025-03-23T10:18:50 1742725130

Yes, agreed. I just ressurected an old code base that I had never finished, as I really couldn't be bothered to test it properly. It would have been several weeks or even months doing this thankless task in my spare time, so wasn't going to happen.

Just for fun, I asked the AI assistant in IntelliJ (free trial) to write most of the tests for me. I was actually blown away. The tests were largely really good. In many cases they were more thorough than I would have bothered with. Even when it didn't manage to write good tests on its own, the AI complete as I was writing them myself was incredibly useful. Most of the time it would predict the line I was going to write next, just press tab to accept.

I did have to review the tests, and there were a few minor mistakes I had to correct. The entire process took about 4 hours - and I was trying it out for the first time.

So this is a massive time saver for me, and lets me take on coding in future I would simply not have had the patience to complete.

bardak · 2025-03-22T23:29:18 1742686158

I think it comes down to the fact that LLMs are powerful tools but just like a really good saw can be extremely helpful to a talented carpenter to an untrained person it useless unless they actually learn to do the job

18172828286177 · 2025-03-22T20:56:23 1742676983

> ever since I started to share how I built my SaaS using Cursor > random thing are happening, maxed out usage on api keys, people bypassing the subscription, creating random shit on db

This has to be a troll no?

ehutch79 · 2025-03-22T22:29:16 1742682556

Sadly, there's a good chance it's not.

They're basically advertising they have a poorly coded, insecure app.

liendolucas · 2025-03-22T22:22:56 1742682176

This idea that now everyone with little knowledge can code is absurd. For sure everyone can code: with hours of dedication, sitting down and trying things out, learning and improving from errors and past experiences. There is no other way round. I don't know how the next generation of coders is going to be like, but my advice still stands: read books, realiable sources, do your homework and "vibe coders" will become so irrelevant that will be extinguished by their own ignorance. Don't get fooled by number crunching programs that seem to "program".

Hippocrates · 2025-03-23T13:24:49 1742736289

It makes me feel very secure in my job that so many engineers ITT are downplaying the ability and productivity of AI coding tools. You can pry cursor out of my cold dead hands. If you aren't seeing a 10x boost, then you must not have tried it lately, or haven't got the experience to prompt well.

What it excels at: - Boilerplate code that's been written 1000x, which can saps your time and enthusiasm for the meaty problems beyond that.

- Complex DSA work. It has been demonstrated millions of times in training material.

- Simple and tedious tasks like making dummy data for tests and struct literals.

- Tightly scoped refactors.

Where does it falter?

- Mapping your product/business to the code or abstractions needed. I think this is where junior devs struggle to leverage it.

- Doing large scale multi-file refactors without proper specifics, guidance, and context. It also can't write a huge project from scratch. Humans are still need to fit the pieces all together or provide guidance. I think this gap closes soon.

Code quality simply isn't a problem IME. If it didn't one-shot your dream abstraction, you probably weren't specific enough in the prompt. Most human-written code is also junk, so pointing out a minor gaffes isn't really a dunk on AI. It's still a massive productivity booster if wielded by even a half-competent engineer.

thunky · 2025-03-23T13:48:43 1742737723

The things you mentioned it does well on are things that help you avoid tedium, but I don't think that's what's most important to businesses. The things you mentioned it does poorly at are the things that matter most.

To pile on: if a large part of our job is purely mechanical, then there is a bigger problem with our engineering processes and AI can't fix that.

Hippocrates · 2025-03-23T14:19:05 1742739545

> if a large part of our job is purely mechanical, then there is a bigger problem with our engineering processes and AI can't fix that.

It is! And AI is fixing precisely that. What businesses actually care about (well, 99% of them where code is written) is shipping fast and solving the immediate problem, NOT code quality and craft. It goes against what I want to believe as an engineer. Most problems are not new, they are not hard, they are not sensitive. You will need to start with a good understanding of the business need. It's not that the AI can't code to this. I will often stub out an abstraction, explain inputs/outputs in detail, provide sample data etc. That's all. There are frighteningly few showstopper problems with AI coding at this point, and it's moving so quickly.

We're not at the point where non-engineers are capable engineers with AI, but if you are an engineer not using AI extensively, you are being lapped.

thunky · 2025-03-23T16:32:05 1742747525

> And AI is fixing precisely that.

I don't think AI is really fixing business problems, though. I think it's only fixing developer problems. And unfortunately nobody really cares about that except for developers.

I just find it sad that instead of focusing on improving how we build things and reducing the need for so much mindless, tedious, repetious, mechanical work, we're content to just build bad things faster with AI and call it a win.

TeMPOraL · 2025-03-23T19:14:34 1742757274

> I just find it sad that instead of focusing on improving how we build things and reducing the need for so much mindless, tedious, repetious, mechanical work, we're content to just build bad things faster with AI and call it a win.

The AI is doing precisely that: reducing the mindless, tedious, repetitious, mechanical work. And what "vibe coding" wants you to embrace is treating high-level code as if it were compiled assembly: an implementation detail you never want to look at or care about if you can help it.

Yes, in some sense AI isn't fixing anything, because all that "mindless, tedious, repetitious, mechanical" code still exists, it's just autogenerated. I too wish we could've first eliminated the need for that entirely. But we didn't, because most programmers and the industry at large still don't understand where the problem is in the first place. They can't see we've long reached Pareto frontier in our programming languages, that we're being limited by the default paradigm of working directly on plaintext codebase that's a single source of truth.

So yeah, in this sense, LLMs aren't fixing anything - they're just an abstraction layer on top of our exhausted coding paradigm.

owebmaster · 2025-03-23T16:12:00 1742746320

> shipping fast and solving the immediate problem, NOT code quality and craft

This is also what puts many companies out of business and create huge security issues. If AI is not fixing this but making it worse, then that's not improving software engineering.

TeMPOraL · 2025-03-23T19:26:30 1742757990

> This is also what puts many companies out of business

Those companies you mention just overdid it. Like with everything else on the market, there's a limit to how much value/quality you can optimize away before the end result stops being fit for purpose. However, existence of this limit doesn't stop companies from racing to the very edge of it.

> and create huge security issues.

Security is mostly a solved problem.

Yes, it truly is - at least from the business point of view.

Nobody except attackers and infosec people cares about the mathematical and technical details, or whether your stack or coding practice is secure enough. Not the customers, as they neither understand any of this, nor could do anything about it even if they did. Not the companies, since they manage it at a higher level of abstraction. Whatever holes and vulnerabilities the AI coding introduces, the industry will account for it. Some headlines will be made, some stocks will move, and nothing will change.

FWIW, I don't like either of these things. I'm an engineer in my heart, so it pains me to be constantly reminded that our work is merely means to an end, and matters only to the extent it can't be substituted by some alternative.

noosphr · 2025-03-22T20:59:09 1742677149

Hot take: Vibe coding is going to be the new Excel of technical debt.

Most tech savvy places will avoid it, most good programmers will never encounter it. A bunch or us will make a career out of fixing the mess it makes after it explodes.

My first real job was doing just that at a broker trader which lost 10m on a trade made by an Excel spreadsheet that used a stale yahoo finance API to get exchange rates.

jaza · 2025-03-22T21:37:14 1742679434

Oh god. Fixing an excel spreadsheet full of crazy macros was my first job too. The nightmares still haunt me.

brentm · 2025-03-22T21:44:49 1742679889

I have to say I am extremely sick of this term.

browningstreet · 2025-03-22T21:51:15 1742680275

Vibe coding is the new HODL

Similar energy

cedws · 2025-03-23T06:04:08 1742709848

Similar energy because it’s similar people. Horseshit salesmen trying to get rich off the next hype wave, and to get suckers to hold their bags.

Apocryphon · 2025-03-22T23:31:02 1742686262

Vibe posting

nextts · 2025-03-22T22:10:44 1742681444

Vibe gambling

benatkin · 2025-03-22T22:12:34 1742681554

Vibe cooking

Everybody can vibe cook. - Vercel

Izkata · 2025-03-22T22:40:08 1742683208

Remember, glue is a great way to keep cheese from sliding off pizza!

DonHopkins · 2025-03-22T22:33:39 1742682819

Vibe Day Trading

benatkin · 2025-03-22T22:49:07 1742683747

Vibe Forex

vunderba · 2025-03-22T22:22:20 1742682140

Humans are mostly "stochastic parrots" so I'm utterly unsurprised that it's hit a critical mass in the training data of people's wetware.

ohgr · 2025-03-22T22:09:13 1742681353

Vibe coding is my favourite fad so far. Much like the last few fads I’m going to make so much money out of cleaning up afterwards it’s unreal.

Really this whole industry is on another fucking planet. I hate it but it’s so easy to make money.

mentalgear · 2025-03-22T21:49:47 1742680187

The biggest give-away is: When you look at the companies making these claims and their jobs page shows they are actively hiring developers.

mdaniel · 2025-03-22T22:06:58 1742681218

Not to mention the fact that those training clusters are not going to supervise themselves (or, maybe I just think that because I haven't had enough koolaid; where's the AWS Console MCP endpoint?)

sfjailbird · 2025-03-22T21:10:49 1742677849

This brilliant piece of satire from Steve Yegge got buried for some reason:

https://news.ycombinator.com/item?id=43446695

Judging by the comments, most people couldn't even tell it was satire, which goes to show how absurd the hype is right now (and probably why it was buried).

throwaway382948 · 2025-03-22T21:36:42 1742679402

He works for a company who tries to sell coding agents. He's absolutely trying to pump it up and induce FOMO. If you can't see that because he hides it behind a layer of humor, that's on you.

Apocryphon · 2025-03-22T23:38:26 1742686706

For the long-time HN crowd this must be like finding out, say, Richard Garriott is a blockchain bro now.

jfengel · 2025-03-22T22:25:22 1742682322

I finally gave in and clicked on the article so I could find out what "vibe coding" is.

You learn something new every day. Some days that thing does not piss you off. Today is not that day.

rasz · 2025-03-23T01:46:17 1742694377

ThePrimeagen is doing some kind of vibe coding ad on twitch right now trying to build game in 7 days. There are 10x coders in the room and two days later they are struggling with hilarious basics like off by 1 errors while tweaking something that could be described as donkey.bas.

musicale · 2025-03-22T20:53:30 1742676810

    "ever since I started to share how I built my SaaS using Cursor"

    random thing are happening, 
    maxed out usage on api keys, 
    people bypassing the subscription,
    creating random shit on db

    as you know
    I'm not technical so
    this is taking me longer that usual
    to figure out

- (leo, 2025)

nextts · 2025-03-22T20:57:08 1742677028

That one is almost a meme on Linked In. For fun I hope it is a long troll (like a long con but for trolling)

And most of linked in (as the algo show me!) is basically this HN post in 100 words or the polar opposite saying how software engineers have had their chips.

sureglymop · 2025-03-22T21:00:49 1742677249

Well I checked out the Saas' website and not even the navigation on the landing page worked correctly. Very confidence inspiring!

DonHopkins · 2025-03-22T22:35:19 1742682919

What does "I'm not technical" actually mean?

It sounds like a euphemism for:

I don't want to work hard.

I don't care about the details.

I don't want to learn new things.

I want somebody else to do my homework.

I don't want to put in the effort it takes to succeed.

I cheated my way through school instead of learning from classes.

I've always had everything handed to me on a silver platter, and I expect that to continue.

I want to put more effort into yapping on Twitter and getting followers that actually doing any work.

musicale · 2025-03-25T04:12:11 1742875931

I think it might be referring to a current lack of experience in programming and debugging.

The situation seems to reflect the issue that Kernighan's Law refers to, which is that debugging code is twice as hard (perhaps more?) as writing it in the first place. I imagine debugging AI-generated code might be even harder.

_factor · 2025-03-23T02:43:39 1742697819

Or I’m in a different industry and it’s ridiculous to require years of schooling for a skill that’s not my primary interest?

Or maybe the numbers looked jumbled and they really didn’t enjoy STEM classes?

You sound arrogant.

Have you ever paid for artwork? You may not have said “I’m not artistic,” but the same criticisms apply in reverse.

DonHopkins · 2025-03-23T04:12:24 1742703144

Doing something that's not your primary interest is one thing.

Not wanting to learn new things or understand how things work or put in any effort to succeed because you think you can just cheat your way through life is a totally different thing.

And that's what the difference between using AI programming tools responsibly and "vibe coding" is.

_factor · 2025-03-23T13:37:15 1742737035

Agreed, but you’re your bunching a lot of people into the category of slacker that don’t belong there.

A new tool released that basically lets you design a GUI through guided voice prompts. The 50yr old school teacher can “vibe code” all she wants.

The real problem arises when someone makes false claims they’re a developer when they’re just managing AI generated code. Making it your primary interest and then cheating to make it seem true.

owebmaster · 2025-03-23T16:30:34 1742747434

> Or I’m in a different industry and it’s ridiculous to require years of schooling for a skill that’s not my primary interest?

It took me 6 months when I was 10 years old to become technical and create a website. If a 10 years old can do it in 6 months, an adult can do it in 1 month if they work hard.

CrzyLngPwd · 2025-03-22T21:04:40 1742677480

New silver bullets are the same as old silver bullets, but these are easy to fire at your own feet.

Trasmatta · 2025-03-22T20:48:57 1742676537

The more I work with LLMs and try things like "vibe coding" the less worried I am about AI taking my job any time soon.

In the right contexts, I find LLMs can speed up my work a lot. But it's nowhere close to being able to replace what I do.

jgilias · 2025-03-22T20:52:11 1742676731

I feel exactly the same. On the other hand, the “speed up my work a lot” part is important, and shouldn’t be overlooked. Like, if someone reads this article and figures they don’t need to learn to use AI for their coding job, that’s the wrong conclusion to make.

Trasmatta · 2025-03-22T20:54:03 1742676843

Agreed, I feel that a developer who refuses to learn to work with AI tools will inevitably fall behind those that do. But I don't see AI being able to replace a developer who can work WITH AI anytime soon (at least on anything non trivial).

jevndev · 2025-03-22T21:32:12 1742679132

I feel like a lot of people are forgetting how good llms are at small isolated tasks because of how much better they've gotten at larger tasks. The best experiences I've had with llms all involve sketching out the interfaces for components I need and letting it fill in the implementation. That mentality also rewards choices that lead to good/maintainable code. you give functions good names so the AI knows what to implement. You make the code you ask it to generate as small as possible to minimize the chance of it hallucinating/going off the rails. You stub simple apis for the same reason. And (unsurprisingly) small, well defined functions are extremely testable! Which is a great trait to have for code that you know can very well be wrong.

In time the AI will be good enough design whole applications in this vibe-code-y way... But all of the examples I've seen so far indicate that even the best publicly available models aren't there. It seems like every example I've seen has the developer bickering with the ai about something it just won't get right - often wasting more time than they were slightly more hands on. Until the tech gets over that I'll stick to it being the "junior developer I give a uml diagram to so they can figure out the messy parts".

dataflow · 2025-03-22T20:57:23 1742677043

> But it's nowhere close to being able to replace what I do.

I'm increasingly worried that that's not the same bar employers will have.

TrackerFF · 2025-03-22T21:06:58 1742677618

Now think where we were 5 years ago, and where we will be in the next 5-10 years.

A lot of kids are going to enroll college to study CS, computer engineering, software engineering, etc. - and will not finish their degrees until 3-5 years. They might just find themselves redundant (junior positions, that is)

minimaxir · 2025-03-22T21:12:15 1742677935

The only defense of vibe coding I'll make is that LLMs are very good at identifying decent implementations of business logic, such as workflows I may not have otherwise considered or found on StackOverflow. That then becomes a decent starting point for future iteration, but would never trust the "vibe" of the code itself even if that's what all the AI hypesters are doing.

Despite developing LLMs for years I haven't actually used them much in day-to-day work, but asking Claude 3.7 Sonnet my coding questions has been a superior experience to just Googling them (particularly if there are specific functional requirements/constraints)

varenc · 2025-03-22T20:54:01 1742676841

Speeding up your work does lead to job loss. If some developers can suddenly be 3x as productive than a company doesn't need as many engineers.

dghlsakjg · 2025-03-22T21:04:45 1742677485

That assumes a static demand for development services, which has, more or less, never happened since computing became a thing.

Python, and other high level languages made a lot of development much faster, but it never lead to reduced engineering needs. Cloud made deploying services massively easier, and as a result we actually have a lot more people working in infrastructure.

Faster development mostly leads to expanded economic viability for new types of software. The real question is what becomes economically feasible if development costs are halved.

jetrink · 2025-03-22T22:21:24 1742682084

Jevon's Paradox: https://en.wikipedia.org/wiki/Jevons_paradox

"In 1865, the English economist William Stanley Jevons observed that technological improvements that increased the efficiency of coal use led to the increased consumption of coal in a wide range of industries. He argued that, contrary to common intuition, technological progress could not be relied upon to reduce fuel consumption."

ModernMech · 2025-03-23T18:30:25 1742754625

It's the same reason widening the road doesn't lead to less traffic.

simonw · 2025-03-22T21:33:03 1742679183

Alternatively: if LLMs make devs 3x productive that means companies can get 3x the value out of an engineering hire, so they should hire more.

(Unless that company somehow has no substantial engineering backlog, which I've yet to encounter anywhere I've ever worked.)

varenc · 2025-03-23T00:03:41 1742688221

This definitely feels true for tech companies where the prospect of more more productive engineers improves their bottom line. The same sort of companies that have a near infinite appetite for talented engineers.

But there's a lot of coders in industries whose core business isn't technology. Knapheide for example, a truck outfitting company where my brother codes. I'd imagine in those companies, being able to do the same work with fewer engineers and less cost would lead to fewer hires. Technology isn't their core product and they aren't being held back by software.

simonw · 2025-03-23T00:44:03 1742690643

I actually think we may see the opposite happen with those kinds of companies too.

At the moment, a truck outfitting company building a customer CRM optimized for their workflow is an absurd idea: they would need a team of a dozen developers working for a year before they could even get a feel for if it was a feasible project or not.

Add LLM assistance and maybe a team of three developers could get to an initial working version in three months.

At that point, companies that had previously ruled out custom software development entirely may find that it makes sense for them - growing the demand for software engineers as a whole.

varenc · 2025-03-23T02:10:30 1742695830

I guess I'm skeptical how much of an impact better software could have on the business success a truck outfitting company? But maybe I'm being overly skeptical. I bet if I really got in the weeds there I'd see tons of opportunities were better/more software could really accelerate things. Thanks for checking my pessimism!

c0redump · 2025-03-22T21:00:26 1742677226

Or it just leads to 3x more stuff being made. This conclusion isn’t nearly as trivial as you’re making it out to be.

AnimalMuppet · 2025-03-22T21:02:14 1742677334

Only if that company has a finite wish list for their software. That is rarely the case.

cmdli · 2025-03-22T21:00:18 1742677218

The job loss depends on the average speed up, however. If the AI is only effective in 10% of tasks (the basic stuff), then that 3x improvement goes down to 1.3x.

toogan · 2025-03-22T21:19:07 1742678347

> The job loss depends on the average speed up,

That's such a economical fallacy that I'd expect the HN crowd to have understood this ages ago.

Compare the average productivity of somebody working in a car factory 80 years ago with somebody today. How many person-hours did it take then and how many does it take today to manufacture a car? Did the number of jobs between then and now shrink by that factor? To the contrary. The car industry had an incredible boom.

Efficiency increase does not imply job loss since the market size is not static. If cost is reduced then things are suddenly viable which weren't before and market size can explode. In the end you can end up with more jobs. Not always, obviously, but there are more examples than you can count which show that.

ekidd · 2025-03-23T00:09:22 1742688562

This is all broadly true, historically. Automating jobs mostly results in creating more jobs elsewhere.

But let's assume you have true, fully general AI. Further assume that it can do human-level cognition for $2/hour, and it's roughly as smart as a Stanford grad.

So once the AI takes your job, it goes on to take your new job, and the job after that, and the job after that. It is smarter and cheaper than the average human, after all.

This scenario goes one of three ways, depending on who controls the AI:

1. We all become fabulously wealthy and no longer need to work at all. (I have trouble visualizing exactly how we get this outcome.)

2. A handful of billionaires and politicians control the AI. They don't need the rest of us.

3. The AI controls itself, in which case most economic benefits and power go to the AI.

The last historical analog of this was the Neanderthals, who were unable (for whatever reason) to compete with humans.

So the most important question is, how close actually are we to this scenario? Is impossible? A century away? Or something that will happen in the next decade?

toogan · 2025-03-23T21:08:36 1742764116

> But let's assume you have true, fully general AI.

Very strong assumption and very narrow setting that is one of the counter examples.

AI researchers in the 80s already told you that AI is around the corner in the next 5 years. Didn't happen. I wouldn't hold my breath this time either.

"AI" is a misnomer. LLMs are not "intelligence". They are a lossy compression algorithm of everything that was put into their training set. Pretty good at that, but that's essentially it.

hollerith · 2025-03-22T20:58:04 1742677084

Wait 10 years, though.

nextts · 2025-03-22T21:02:33 1742677353

This is what is really interesting. What will they do in 10 years. Will you need to learn mathematics to Phd level to produce code that an LLM cannot produce. Will we all become business analysts (Will AI do that too?). I don't think BA is a step down or up. It is probably interesting I did think of going down that path.

People laugh at coders like we are the only manual loom operators when everyone's job, even PotUS can replaced by the AI we can dream will exist.

My thoughts: buy SPX so you own a sliver of the new overlords.

cmdli · 2025-03-22T21:01:20 1742677280

I heard the same thing about self driving cars. AI advancements aren’t predictable, and it’s easy to understand or overestimate.