LongCut logo

Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What Comes After the IDE

By Latent Space

Summary

Topics Covered

  • Senior Engineers Resist Vibe Coding Most
  • Drop IDEs by January or Become Obsolete
  • Invest 2000 Hours to Master AI Trust
  • Future IDEs Become Agent Dashboards
  • Rewriting Code Beats Refactoring It

Full Transcript

We are here live at AI Engineer Summit with Steve Yaggi, the legendary Steve Yaggi of Stevie's tech talks, Stevie's platforms, brands, and most recently source graph and AMP. Welcome. And most

recently, Vive coding, I should >> That's right, the vibe coding book.

>> Uh, so this is the big vibe coding discussion. In the pre-hat, we're

discussion. In the pre-hat, we're discussing the intersection of vibe coding and AI engineering. So, we got the kind of movement leaders of both sides here. How do you see it? It's

sides here. How do you see it? It's

absolutely a movement, right? You got to get people behind it. I mean, there's a I said at the end of my talk today that there's a huge backlash and the backlash is only just brewing now. So, you and I

are pushing forward, right, on on these waves of, you know, AI engineering is about is about building AI enabled applications and being being in AI and vibe coding is about abandoning the old

ways of producing software and embracing the new ways, right? And both of these uh are making people pretty mad, right?

>> I don't I think they're mad if they're identity is tied to the way that they work today with no changes, no room for changes.

>> Yeah. So, I'll start with my first hot take. Huh.

take. Huh.

>> Okay, let's go.

>> There is a demographic that is the most affected by that. Their identity is the most tied up with the way that they work.

>> Okay.

>> It's not junior engineers. It's not mo engineers. They're all vibe coding.

engineers. They're all vibe coding.

It's senior engineers, senior leaders, uh people who have So, basically, you can you can narrow it down to 12 to 15 years of experience. They hate vibe

coding and they hate AI and they are online going my 15 years is better than net AI. Okay, that you saw I don't know

net AI. Okay, that you saw I don't know if you saw Jordan Hubard's post from Nvidia where he just laid out some really nice advice on how to get the most out of agents as you're coding. And

this guy posted and he's like, "Yeah, you know, no, you you stick with your doing your director stuff and leave the programming to programmers, right? When

you have 15 years of experience like me, then >> then you're qualified to talk, right?"

Right. So I said something to him like, "I think you need to learn to read a clock." And he's like, "Until you have

clock." And he's like, "Until you have 15 years of experience." And I'm like, >> "Well, you got more experience than him.

>> I have 45. or should I like go to 60 before I can talk to you or should I like cut out 30 years of experience so I can be as dumb as you right those are my options and uh so I don't know I guess

I'll see him in 15 years >> I I okay I think there's one element that I'm trying to figure out of well these people have to coexist right and most companies are going to have a mix

even open AI by the way we talked about this last night at dinner guys openai has people who don't use AI to code >> they have people who don't use uh codec X they probably are using cursor or something.

>> Okay.

>> But they're not using the agentic loops, right?

>> Yeah. Yeah.

>> And uh yeah, it's uh so you know we talked to you know Andrew Glover there you know the director of dev prod and uh from what he was saying they've been planning on going >> public with this once they have more

data about it. Yeah.

>> Anecdotally they're sharing that uh >> performance >> the performance difference is like 10x any way that you measure it. So lines of code commits business impact whatever.

And it's it's so stark and pronounced that the people who aren't adopting it are now 10 times less productive at performance review time. Two people,

same title, same job, and all of a sudden one of them is 10 times as productive than the other one. What do

you do? And the answer is you panic. You

actually go to HR and you go to legal and you're like, what are our options here? Because the time is coming. Okay,

here? Because the time is coming. Okay,

here's another hot take. All right, if you're still using an IDE to develop code by >> January 1st, you're a bad engineer.

>> There's a there's a hot take for you right now. You still have a what, five,

right now. You still have a what, five, six weeks to to still be an okay engineer while you're using your IDE, but this is the time that you need to drop it and learn how agents code, okay?

Because it's a skill set. I mean, it's so complicated. We wrote this book about

so complicated. We wrote this book about it, me and Jean, Kim, cuz we were, you know, we were playing with it ourselves last year and blogging about it and talking about it, and every blog post was 30 pages.

>> It's like, what are you going to do with a 30-page blog post? That's long even for me, right? Yeah. And at some point I was just, man, like the skills that you got to learn in order to get the AI to do the things that everyone's mad because it's doing them,

>> right? Because everybody's everybody's

>> right? Because everybody's everybody's like, "Well, I tried it. I spent two hours with it and all it produced was garbage." And the answer is actually you

garbage." And the answer is actually you have to spend 200 hours with it. You

have to spend 2,000 hours with it. And

that's not actually an exaggeration.

Gene just pulled up a study that showed that you actually have to spend a year or 2,000 hours with AI before you trust it. And what does trust mean? Trust in

it. And what does trust mean? Trust in

this case specifically means before you as a user can predict what it's going to do. And if it's unpredictable, of course

do. And if it's unpredictable, of course you're going to be mad. But as soon as you've worked at it with it for a full year to where you fully understand its capabilities and its drawbacks, which

haven't really fundamentally changed, it's gotten more capable, but the edges are always the same. It hallucinates, it gets lost, it gets amnesia, dementia, it lies to you, whatever. Right? Those

skills, we've been building them for years now. everybody who's been trying

years now. everybody who's been trying to write code with AI. We've been

trying. It hasn't really worked, but it's been working better and better and better and better, and now it's reached the point where it's working a lot better than all of the other options.

Yeah. And if you haven't tried it in 2 months, you're way out of date. The

models are much better than 2 months ago. If you haven't tried it in a year,

ago. If you haven't tried it in a year, you're a dinosaur. It's just

unbelievable how bad you are. And and

and you know, you may be, look, I have friends who are much better engineers than I am. Okay? I mean world class maybe some of the best in the whole world okay have built technologies that

you've heard of and they're not using AI yet except the occasional I'll ask cursor a chat question like Wikipedia whatever okay those people are going to be the interns in a year

>> you really think so >> with all their experience have >> I have had this hypothesis that has not been really confirmed with any any anecdotal evidence at all until today when I met somebody at your conference

that told me about how he had been in this position, 12 years of experience, didn't want anything to do with AI. And

he met these two PhD students from somewhere in Europe, I forget where, and they were both just super hardcore vibe coders, you know, with with the the agents, right? And he was watching them

agents, right? And he was watching them work and they were super junior and they kind of didn't know what they were doing, but they just had no fear and all the ambition. And all they did was they

the ambition. And all they did was they just kept hammering on the thing going, "Okay, well, why did you do it that way?

Explain it to me. Okay, well, let's let's let's look at other options." And

they would just be kind of the perfect engineer with no context. The perfect no context engineer. What questions are

context engineer. What questions are they going to ask? Have you thought about scaling? Have you thought about

about scaling? Have you thought about security? How is your test coverage?

security? How is your test coverage?

Right? I mean, engineers are going to all ask the same questions, right? And

he realized that engineer in a box is not too far off from knowing the right questions to ask an LLM. And that these two students were so productive with it.

He was blown away that he was like, "Oh, no." Like that's when the right the

no." Like that's when the right the light bulb went on. I have to learn this. And now he's been doing it ever

this. And now he's been doing it ever since. Right. But it ain't easy. I I

since. Right. But it ain't easy. I I

You're not going to pick up claw code and you're not you're not going to just try it and be like it's just going to work for it. Might you might get lucky, but eventually if you don't have the right mindset, if you don't have the right attitude going in now, even with

the right attitude, how often have you swear sworn at your agents in the last two days with the actual f-word or like right >> I'm pretty polite. I say thank you and please.

>> I say thank you and please and go where did you do that? Right. And it's it's because it's because Jean and I realized this after we published the book. You

have this helper. They're very

humanlike. They come in. You have to tell them a lot of stuff and they they they need a lot of guidance. But over

time, they need less guidance. Your

prompts get shorter. Things get

streamlined. They seem to get it.

They're working. Now, if this were a human being, you would draw the conclusion it's because they understand you and they get you and they're finally part of the freaking team. Do not make that mistake with LLMs. Never make the

mistake of anthropomorphicizing an LLM like Larry Ellison, right? The LLM at any moment can stab you in the back.

Okay? It can just be like, "Yeah, we took care of that really hard problem.

Now I'm going to delete your database."

And you're just like, "No." Right? And

and it's because of that it's we call it the hot hand. You you sort of like you're like, "It's going, man. I'm

feeling good. I'm feel this thing gets me. I'm going to make it do a production

me. I'm going to make it do a production change." And that's that's how I found

change." And that's that's how I found out about this. And it's like I was like, "My script can't access prod." And

so it chose to do it in the worst imaginable way. What it did was lock out

imaginable way. What it did was lock out the entire rest of the universe including my live game and everything else and only allowed my script to access prot and it was changing password. It changed the password and I

password. It changed the password and I was like why did you change my password?

Right. And it's like oh I'm so sorry. I

I definitely shouldn't have done that.

What was it?

>> Okay. And I'm just right. This is what will happen to you if you just >> you just try to do a coding. Okay. Bad

things will happen. This is what our book is about really. Right. Well, I

mean, that's not the best ad because then what? Like

then what? Like >> you learn and then eventually you learn how the speed bumps and the corners and everything. It's like driving, right?

everything. It's like driving, right?

It's like driving. Like you you you you want to become like a NASCAR driver.

Like this is high performance stuff.

You're coding with 12 agents at a time and you're you're more ambitious than you've ever been. I was talking to a guy today who's got got way more projects going than I I've gotten. And I don't know where he gets all the time from,

but he's probably doing 10 or 12 like major projects at the same time right now. And he's just doing it all with

now. And he's just doing it all with with a gent coating, you know? So, I

mean, like, man, the the the the ad here is that you will turn into Batman, but you can't just grab the suit and put it on and be like, I'm Batman. You're just

a cosplayer. You're cosplaying at vibe coding. You got to learn how the tool

coding. You got to learn how the tool belt works. And that's going to be pain,

belt works. And that's going to be pain, suffering, and mistakes and learnings.

Now, you can get a lot of it by reading this and all of the other vibe coding books. Read the O'Reilly, watch the

books. Read the O'Reilly, watch the talk. I mean, seriously, like you should

talk. I mean, seriously, like you should like get all of the possible angles at it because it seems to land differently for different people. There will be some analogy where you finally get it. I get

it. It's like this and and it's like a 3D printer and nobody else thought it was like a 3D printer, but somehow that was the magic that made it for you, right?

>> Yeah. I would say one of the biggest surprises from the dinner yesterday was how many people all had the experience where they no longer write single lines of code like they they're really just

kind of prompting and and doing that >> single lines of code. You mean they never write any code at all?

>> They might edit but like I think when they're writing net new they always start with the prompts.

>> No editing, no touch.

>> No editing.

>> It is it is very expensive when you're like that that identifier is misspelled and it's a local. You know, you could just addit it, but it's better for you to close your IDE and probably uninstall

it. No, actually that's not true.

it. No, actually that's not true.

Somebody finally convinced me that IDE are fantastic. Intelligj in particular,

are fantastic. Intelligj in particular, keep it open. It's Gradle build and actually not for the LSP, although you can use it for that.

>> Actually, that's another good way to use the LM if you get an MCP server. But no,

it's that IntelligJ's autoindexing is so much faster and incremental rebuild is so much faster than last night. Yeah.

>> Yeah. So you all all you do is leave intelligent running but you shouldn't look in it. It's a tool for the AI now, right?

>> Amazing. One other thing that is a big part of Zonda hotics. You're saying you said cloud code is not it.

>> Cloud code ain't it.

>> Explain yourself.

>> All right.

>> Everyone here loves cloud code.

>> Everyone here loves cloud code or or AMP if you use our product which is uh is just recently leaprogged cloud code again because of Gemini 3. AMP has this cool feature where it goes to another

model. And just to pre-warm you, I also

model. And just to pre-warm you, I also want to talk about just Google in general and how this Gemini revolution has kind of changed Google's image. But

let's talk about cloud coding.

>> Sure. Cloud code's been around since March. Cloud code has been proven to

March. Cloud code has been proven to work and so but yet probably 80% of the world's 90% of the world's programmers are not using it or anything like it.

You you get certain companies where it's really taken off, you know, but uh but most aren't. The world is stuck on

most aren't. The world is stuck on cursor. The word world is stuck in 2024.

cursor. The word world is stuck in 2024.

Last year we were trying to get people to write with chat, right? And we were like we're tell and they were like no completions and we were like oh god no but it can generate the code and you

just got to paste it in and you just got to do all this stuff and they were like that sounds kind of hard and we're like but it's faster and they they wouldn't do it and then nine months later it finally percolated in and now they're all like I like cursor and it's like

that's so last year dude right like wake up and yet they haven't adopted it and so you have to at this point look at it and say why haven't they adopted it let's go look at the reasons and the

answer is it's too hard it's too hard.

You have to be able to read, man. Most

engineers, honestly, like to them, five paragraphs is an essay. Okay? And with

cloud code, you've got to read waterfalls of not just information, but also code and diffs, right? Because if

you're going to put your IDE away, you actually do have to look at the diffs.

Now, I'm going to tell you that once you get some expertise at this, you can actually tell from the shape of the diffs and the color of the diffs and the length of the diffs.

>> You can tell whether it needs a code review, whether they're doing the wrong thing, whether they seem to be writing suspiciously too much code for this problem. Right? The diffs alone, just

problem. Right? The diffs alone, just the shape of the diffs can tell you a lot about what's going on without actually reading the code. But you

should pay attention to them. Otherwise,

you'll have problems that will only crop up later, right? But yeah, I mean like uh uh uh put the IDE away. Uh, okay.

Clog Code and then get Cloud Code out and try to start using it. All right?

And you're going to find that it's Look, I've been using Cloud Code honestly 10 to 12 hours a day, literally, uh, for for months and months and months and months, and I still curse it out all the

time. I just lose my mind. I'm like, how

time. I just lose my mind. I'm like, how could you have done that when you just said, right? And it's like, it's

said, right? And it's like, it's actually been shown, it's it's starting to be shown that sometimes when you put a little pressure on them, they perform better. You can break through law jams

better. You can break through law jams that way. But anyway, look, you're going

that way. But anyway, look, you're going to run into problems and and but the thing is next year the tools will be better. Okay, if cloud code's not it,

better. Okay, if cloud code's not it, what is it? Well, we got to get back to something like an IDE, right? I mean,

that's just going to be it's got to be natural for people. You got to be able to look at it and see what's going on, not have to read. It's got to have visual indicators, right? And and yet it's not going to be an IDE because an IDE is very much focused on helping you write code and that's not what you do

anymore, right? So, what it's going to

anymore, right? So, what it's going to be is it's going to be your agent orchestration dashboard. It's you're

orchestration dashboard. It's you're going to walk in in the morning and be like, "Yo, so how are things do?" Right?

>> It's like, "Oh, that one's still running. That one's running a tool. That

running. That one's running a tool. That

one needs my input." Okay. Right. You

just go through the list. And

>> so I'm building one. You can go look.

It's supposed to be a private repo, but it's public. So I've got forks and

it's public. So I've got forks and Um, happens. But whatever, you can play

Um, happens. But whatever, you can play with it. It's called VC, Vibe Coder.

with it. It's called VC, Vibe Coder.

It's my B2 of the Vibe Coder system. And

what it does is it creates a set of uh scanned workflows that run the agents for you.

>> Yeah. I don't know if you saw anti-gravity from Google the other day.

Uh which two days ago so >> it's so fun how much stuff people are inventing that are all >> periphery. Yeah. Yeah. No. So look I

>> periphery. Yeah. Yeah. No. So look I called this I don't know I called it in March with Revenge of the Junior Developer. I did that chart and

Developer. I did that chart and everything and like uh Dario quotes it in all his customer advisory boards and everything, right? Really?

everything, right? Really?

>> Yeah. Yeah. No, it was it was really pretty impactful and and I called that what's going to happen is the that agents I even back in March I knew they were too hard. I was like, what's gonna happen is they're you can run them programmatically and 90% of the crap

that you do with them could be handled by a model often a cheaper model, right?

If it's just like if it's asking you which of these two things should I do next, they're equally important. Like

just have haiku say either one, right?

So like I called the orchestrators are coming and it's taken close and like the end of the year to get there, which is roughly where I predicted them coming.

Replet, agent 3, uh there's a bunch.

There's the conductor, there's uh DMAD came out, open source. They're all

different, you know, takes on it, right?

And uh but there there will be more coming. I guess uh Google's as well,

coming. I guess uh Google's as well, right?

>> Yes. I like this analogy that they have.

It's still pretty new, so who knows what the eventual vision is is that you just get notifications from your agents as they're working.

>> Exactly. Yeah. So, in mine in VC, there's an activity feed. That was one of the first features I added, which is like I want to go work and I just want to get notifications periodically of interesting stuff.

>> Interesting. I wonder if we'll have like social networks of agents.

>> Well, agent each other, following each other.

>> Well, so I just had three-hour coffee uh uh with uh Jeffrey Emanuel, who's who's he did the MCP agent mail. He's one of the smartest people I've ever met in my life. He's the one that wrote the

life. He's the one that wrote the article that crashed the stock market about Nvidia. Oh,

about Nvidia. Oh, >> that Jeffrey Emanuel, the one that an incredibly well-written article that said this is why it's a bubble and the whole market went and Karpathy started following >> is back up.

>> He wrote what you just said. He said it is back up. But he wrote uh agent mail which is he was just tired of having to copy stuff between his agents like you tell me what to tell this agent. And so

he made a little little like I don't know HB server that's like an inbox for them a messaging and they talk to each other now and now he goes coordinate amongst yourselves to paralyze this task this epic that I just put together or

whatever and they'll do it some people are coming at it top down and trying to build orchestrators that you know do it all for you but interestingly with beads right which is the issue tracker session thing that I that I made plus his

>> purely voded by the way >> purely vcoded yes so I mean like I get PRs every day for horrible problems that I introduced but nobody seems to because we've got stable versions now. So, beads

is like living proof that you never actually have to look at the code as long as you and other people are asking the right questions and having the AI look at the code. I get PR from people all the time where it's obvious that the AI did all of the analysis and all of

the coding and I look at it and sometimes I'll just be like, "So, my AI, what do you think of their II's PR?"

Right? And you all summarization.

>> I mean, isn't that bad? Don't you want >> it's bad if your code if look it's all about the outcome >> v is working and it's got tens of thousands of very happy people using it.

So obviously it's not bad.

>> I met one of >> there if you do this uh to your your your company's production website and bring it down then yeah it's bad >> but still be is kind of a database you know and database is one of the harder

things to make.

>> You know beads is really weird. The

architecture is really weird and the only reason it works is because it wouldn't have worked in the old days. it

would have been just too hard to manage and and and not programmatically. But

what you do is you tell the AI, go fix it all up and whenever it's corrupted or there's a merge conflict or just just fix it. And it's funny because Jeffrey

fix it. And it's funny because Jeffrey Emanuel who did the mail basically did the same thing. He has all his agents run in the same directory and they do file reservations. They're like, I need

file reservations. They're like, I need that file. Man, I used to do that

that file. Man, I used to do that Accenture in the '9s, right? I'd like

run over to a dude's cubicle and be like, I need that file. Their revision

control was so bad. So like he's got a file reservation system going. But but

it all what happened was as soon as he put it in place his his agents just started working and now he's got this little village of agents, right? And

that's that's where we're headed. So the

orchestrators are going to be about not keeping the agent on the rails but keeping all of your agents on the rails and communicating with each other. Yeah.

>> And then you hit the wall.

>> Boom.

>> Does anybody know what the wall is once you get past all this?

>> Merge.

>> Merging is the It's the wall that everyone is hitting right now. Yeah,

>> I think the company that's best poised to solve it is Graphite. I was going to go talk to them about it.

>> They'd be happy to talk to you. Yeah.

>> Yeah. I think everybody needs to solve it. And if you're at an enterprise like

it. And if you're at an enterprise like what we hear cuz Gene Kim and I talk, we talk to companies all the time. I'm a

SAS seller. So, a source graph. So, we

get to hear the the inside story from all these big companies, right? And

they're saying, "Yeah, as soon as you get to the point where like every developer is 10 times as productive, merging their code becomes this incredibly complicated problem because I you and I work at the same time for two

or three hours. We make, you know, 30,000 line change each. Mine makes it in first." H and it gets merged and then

in first." H and it gets merged and then you come along and I have literally changed our logging system and our like, you know, our architecture here and APIs that you were using. Yeah.

>> And so it's not going to be as simple.

It's not a simple let's fix the merge conflicts. It's like you're going to

conflicts. It's like you're going to have to reinvision and reimagine and reimplement your change on my change >> or rip yours out >> or rip mine out and make me do it. But

ultimately ours were just the AI doing it, right? But the the important thing

it, right? But the the important thing is that they have to be serialized. It

is a cue and that when they go in there, they have to actually like basically redo what they were doing on top of the new thing.

>> Nobody has solved this and it is a huge obstacle right now. You know what one company did? Sorry, last thing. One

company did? Sorry, last thing. One

company said, "Here's our solution. One

engineer per repo."

>> Not making that up.

>> It's a solution.

>> It's a solution for now.

>> The the classic solution for this is stack diffs, right? Merge cues, stack diffs. That

diffs. That >> I don't know about stack diffs, so I guess I'm dumb.

>> It's a It's like a Facebook concept that they're trying to bring into the white ros. GitHub is working adding it. I just

ros. GitHub is working adding it. I just

talked to Jared Palmer there. Basically,

I I'm hearing no solution yet, but you should be aware of it and design around it.

>> Yeah. I mean there's the oldfashioned way of just hammering through it really hard and >> well also you know you could just talk to the other guy and say like hey I'm doing this you know pretty deep architectural change let me go first and let's let's agree on

>> the overall pattern first.

>> So yeah I mean I've run into this situation a few times where I've actually tried to give this agent the heads up that this one's making a change that affects this one >> with the mail thing that Jeffrey did. I

think once I get it wired up cuz he doesn't use work trees and I'm going to this but once once they can actually talk to each other I think it's going to be as simple as just keep in mind that that agent's working on something that affects you. You might want to go talk

affects you. You might want to go talk to them about it.

>> Yeah. And agree on an overall like fundamental >> ifra and they're quite good at it. I

>> they just have no it's because they have no ego. They're not like oh it's got to

no ego. They're not like oh it's got to be me, right? So just whoever's first gets to be the leader.

>> Great. What do you and him disagree on?

>> Me and who?

>> Jeffrey.

>> Emanuel the guy that I just met. Well,

we uh so we foundationally fundamentally disagree that having 12 agents work in a single repo clone is a good idea.

>> So you're on the pro side.

>> I'm on the pro lots of like either get work trees with lots of branches or separate repo clones. I would imagine keep them sandbox them all in the same they're all they're literally they're

using the same git the same build. So

one of them will be like doing a build like need to run a test.

>> That's so much churn.

>> Yeah, but he has a file reservation system. So, the funny thing is, okay, I

system. So, the funny thing is, okay, I was like, "This is insanity." And he's talking me into at least acknowledging that it probably works pretty well if you're a solo dev and you're using no more than a dozen or 20 agents because

it is actually working for him. And he

uses the same principle that Beads does, which is it wouldn't have worked in the old days. It doesn't make any sense to a

old days. It doesn't make any sense to a real engineer. And yet, you tell the AI,

real engineer. And yet, you tell the AI, "If anything gets messed up, just fix it." And they will. And so, that's

it." And they will. And so, that's right. That's why his thing works

right. That's why his thing works because every once in a while the file reservation gets screwed up and they're like, "Hey, we need to resolve this."

and they figure it out.

>> Interesting. Interesting.

>> Uh I I some people have proposed that the theme of this conference next year is on multi- aents.

>> Oh yeah. I mean Yeah. Of course.

>> Yeah. Yeah. I mean AI will be about multi-agent. Look, we're in this phase

multi-agent. Look, we're in this phase still where we're we're cutting down corn with scythes with our hands. That's

what a a real programmer does these days. We're moving next year. It's very

days. We're moving next year. It's very

clear. We're moving to uh you know um these these machines that churn you know these giant just like those ones that you see on the farms today factory farms. We're going to be factory farming

coat. Okay. And that absolutely like uh

coat. Okay. And that absolutely like uh a lot of people are just so dead set against that philosophically, morally, ethically, whatever. They're just like

ethically, whatever. They're just like >> they're so used to subsistence agriculture that we're not we're not used to like the big >> John Deere. But we are we are actually moving into the John Deere era of coding.

>> That's amazing.

>> Yeah. Yeah, but the funny >> analog you actually >> and I just thought of it too. Well,

we'll have to reuse it.

>> Yeah, >> but it's been it's then it's been growing on me. It's the it's the whole it's this idea that quad code and AMP and Codeex, you know, Klein, we love them all equally. They're all equally

bad. I said in my talk today, they're

bad. I said in my talk today, they're like they're like a power saw or a power drill. A f a skilled craftsman can do a

drill. A f a skilled craftsman can do a lot of good with them and then you can also cut your foot off with them. The

same thing is true of cloud code. that

imagine a big machine, a big farming machine that knows how to run cloud code and scrub it, right? Almost it's like it's like, okay, you plan, you implement, you review, you test, right?

And you split it all up and now you got yourself factory farming, right? It

works. People are building it, it's going to happen. And what it's going to do is it's already started to unlock programming for non-programmers. And

this is completely turning companies upside down. They're starting to realize

upside down. They're starting to realize that maybe the ideal time team size is like two or three. Yeah.

>> And I mean like right the whole way that companies are run the whole governance structure is going to change because now coding is no longer the bottleneck. The

business needs to get immediately involved. The feedback back loops get

involved. The feedback back loops get faster and it's really exciting times but it's too much for a lot of people and they just they're they're like checking out or they're they're revoling online. And I predict that as this

online. And I predict that as this capabilities improve and as we get closer and closer to the factory farming of code we will see a massive backlash from the leadites. You are the one of

the few people that can ask this as a I know a a lot of people in our audience are critical of going the full hog with this.

>> Yes.

>> So a lot like they're like fine for front end fine for application code but don't touch my cloud infra don't touch my my backend my distributed microservices.

>> Definitely don't touch anything production only touch code only use these things when git is your back stop for starters. Okay, so

for starters. Okay, so >> keep prod. It's going to be real tempting to write but don't. If you have git as your back stop, why should you be worried?

>> True, except I guess people have the perception that it is less good at backend code.

>> Ah, this is the problem where everybody's bad at math. Yeah.

>> Okay. So,

>> how good was Chad GPT 3.5 at systems code?

>> Pretty bad. How long ago was that?

>> Okay. People think people think the honestly I believe that the misunderstanding here is rooted in a fundamental >> belief that the models are done getting smarter, >> right?

>> And the funny thing is they could be done getting smarter. They're not, but they could be and we would still be over the hump where we've discovered electricity and now we need to harness it. We will still get to factory farming

it. We will still get to factory farming code with today's models capabilities and we'll get there fast. We'll get

there by summer. But the models are getting smarter so fast. You know, it's really there's there's this interesting tension of, you know, like you're building tools for capabilities that the models will eventually have built into their brains. Yeah. And so you won't

their brains. Yeah. And so you won't need that capability in the tool anymore. And so there's this constant

anymore. And so there's this constant arms race and decay of your tool filling gaps for the model until the model's good enough to fill it itself and then your tool moves on. All road is becoming all code and all tools are becoming throwaway.

>> Like >> which is great because they're easier to build too.

>> Yeah. By the way, yes. So remember uh Joel Spolski, one of the greatest, you know, of our of our our generation, our time, one of the greatest writers and thinkers. He gave the best tech talk

thinkers. He gave the best tech talk I've ever seen and I want to get him to come and revive it. He gave it at Amazon 20 years ago. It's still relevant today.

>> He's invited here.

>> Great. So Joel Spolski a long time ago wrote something that was timeless until today. So it was 20 years timeless,

today. So it was 20 years timeless, which was >> never rewrite your code.

>> Never rewrite your code. And now we've discovered that it is for a larger and larger and larger class of piece of bodies of code. It is better to just start over and rewrite it from scratch than it is to try to fix it. The LLM

will do a better job. I first noticed this when I was trying to port all of my unit tests from one architecture to another. And eventually I was just oh

another. And eventually I was just oh just iteration because they're trying to fix there's a lot to keep in but instead if you say throw all the tests out and make them again it just goes and you're

done, right? And so it's like,

done, right? And so it's like, well, what about this library I gotta refactor? And so it's creeping up, but

refactor? And so it's creeping up, but we're moving into a world where uh the fastest thing to do is just build new code that does a better job of what the old code was trying to do.

>> Yeah.

>> I mean, it's like we're unlearning everything. I feel like an upside down

everything. I feel like an upside down land, but this is it's like we've entered quantum mechanics, but you have to you have to embrace this new world. I

love the energy and the credibility that you bring because a young kid could say what you're saying and not be as believable. But you're coming from the

believable. But you're coming from the perspective of you've been a huge systems programmer. You've been a game

systems programmer. You've been a game programmer. You've been everything.

programmer. You've been everything.

>> Yeah, I've done I've done assembly language for 5 years, you know. Yeah.

>> Operating systems in assembly language.

Uh and it was 8086, not even 8x86. God,

we had 8bit registers. I've done it all.

And you know, the game pro game game programming teaches you everything. And

then of course I've done platforms and Google and ads and this and that.

>> You know the agentic loop and the game programming loop share a lot in common.

>> They do resource sharing operating system loops as >> I feel like I'm building the same systems over and over again now. Yeah.

>> It's it's it's there's only uh we're cursed to reinvent the same designs in every new domain. It's a privilege too.

You know one thing I wanted to get you to comment on is Google.

>> Oh Google.

>> One of my favorite memories which is like just before you retired was talking about how Google still doesn't get it.

Uh, Google Cloud in particular, how they shut down >> the deprecation policy.

>> The deprecation policy.

>> I was so mad about that. You got to get me pretty mad to write a blog.

>> They seen Have they turned it around?

>> No. I talked to some people there and a lot of them were like, "Yeah, that's not a thing for Google." And it's funny cuz, you know, Amazon uh Amazon, not on the platform, not on the deprecation stuff, not on the important stuff. Google has

turned it around on on uh execution.

>> Yeah. They finally did the thing that they should have done, you know, 15 years ago, which is hold people accountable and it's not just engineers do whatever they want all the time, which is what it was for 20 years. It

actually worked pretty well because they had a monopoly on ads and they could afford to subsidize Google engineers doing whatever they wanted for but you know ultimately they had to do the right thing and grow up and mature as an organization and it was painful and they

lost some Google culture and it's not as fun anymore but they now execute well and they did the right thing for the company and now with Gemini you can see now they've been shifting their focus gradually towards more AI AI and now it's starting to pay off for them.

>> Yeah.

>> And maybe they're going to be the big big winners. Do you have observations of

big winners. Do you have observations of a similar kind with all the other labs?

You know, I'm just kind of curious and your takes on one of my favorite charts is is that old chart where you had Microsoft like all pointing guns at each other.

>> Yeah.

>> U Facebook, everyone's >> the first person to ask me this. I

remember that chart. That was funny.

>> Yeah.

>> I feel like someone could do that for Open.

>> They could. They could. You know, it's an interesting question. All three of those companies, Google, Anthropic, and OpenAI are unbelievably chaotic internally right now.

>> Yeah. chaos. Okay, Anthropic hides it really well.

>> They seem >> they seem like they've got their act together. So, what what what that means

together. So, what what what that means is their product managers formed a wall around that chaos and bravo anthropic product managers. But it is and it's not

product managers. But it is and it's not because Anthropic screwing up. It's

because it's an inevitable function of growing that fast. They're hiring like 100 plus people for cloud code in the next I don't know month. I mean like they're they're going wild and that's just cloud code. You're not gonna I mean I was at Google and Amazon when they

were in the big fast phases and you're just going to have chaos. You're going

to have churn. Nobody knows who to talk to what and everything's crazy.

Eventually it starts to smooth out, settle out and they'll get there, right?

Open AI is chaotic more like in a well they had a lot of exits, right? You

know, I don't know if they're as chaotic as say GitHub who lost most of their senior leadership and was just complete turmoil for years, but they're pretty chaotic at open AI, right? Uh, and then Google, you know, um, I we were just

talking to somebody today that was saying it was just still too hard to like get consensus across groups to with the Jules Jules team. They can't get it rolled out internally because Google is so siloed. It's a billion monoliths,

so siloed. It's a billion monoliths, right? Little little little apps that

right? Little little little apps that don't talk to each other that it's hard to roll anything out across Google.

>> So, uh, all three of them have execution problems right now. Uh, I think Anthropic's probably executing a little bit better than the other two, but it's real close race and uh, yeah, it'll be interesting to see and and see see if Oracle or Facebook or any of the others

can catch up, right? Meta

>> Facebook will be the most interesting thing. I mean, they'll have to do

thing. I mean, they'll have to do something huge next year.

>> Next year could be the year of open source models.

>> Yeah.

>> So, uh, look, as soon as open source models get to the point where they're as good as Cloud Sonnet 37 was, then you turn on Klein or something and you've got something that as good as Cloud Code was in March, which wasn't as good as

today. and it's not good, but it's good

today. and it's not good, but it's good enough and you're running it for free free free on your local M4 or whatever, right? So, uh uh yeah, I and and from

right? So, uh uh yeah, I and and from what I've heard, they they're seven months behind and that that gap is gradually narrowing the Frontier models, which means OSS models will be as good

as Gemini 3 next summer, >> right? So yeah, next year could very

>> right? So yeah, next year could very much be the year that means the tools are going to have to get much much better at decomposing the task and assigning them to the right model, the right size of model for cost optimization.

>> I'll represent the critical side which is that the reason they're converging is cuz they're saturating, right? There's

only there's you can only ever hit 100 and the closer you get to 100 proportionally it'll just get harder and harder, right? So obviously uh the rate

harder, right? So obviously uh the rate of change when you're lower down is is higher as compared to when you're already saturating. But that's a minor

already saturating. But that's a minor technical point.

>> Well, no, I mean it's not it's not minor at all. It's actually a foundational

at all. It's actually a foundational question which is is the line of AI uh intelligence going to go straight or is it going exponentially or is it actually starting to peak >> asmtoic? Yeah.

>> asmtoic? Yeah.

>> Yeah. And um you know from what we've heard from people who are very very close to the research we know that AI has been getting uh what is it four times smarter every 18 months for the last I don't know 30 years because of

Mo's law and they think that there's enough data left training data for two more cycles of that before they don't know what happens maybe it goes up more maybe it goes down we don't know >> human history ends

>> but two more cycles means they're going to be 16 times smarter in three years right >> well I don't even know what that means Well, I've spent a long time trying to figure out what it means, but what it means is they're going to be really,

really, really smart, and it's going to change the world, probably in a lot of good ways and a lot of bad ways. And um

yeah, >> I don't know if you had this version of this conversation. People ask me if

this conversation. People ask me if their kids should learn to code cuz >> kids should learn to vibe code.

>> You like you have the escape hatch of you can read the code if you want to.

You don't just don't need to most of the time, but you can and it's a good guard, >> right? But I don't because

>> right? But I don't because >> yeah, >> you don't have to.

>> Well, I think my take is whatever it is, you'll be better off if you do also know how to code because you can prompt better, right? You can because you can

better, right? You can because you can tell you can communicate more precise terms. Look when you when I see you say you know how to code not the syntax and stuff but you have to know like in a

language neutral way what the capabilities of languages are functions and classes and objects and I don't know monads whatever it is the whole superset you should be aware of them and then

from there up so you you've you've cut off all the syntax you don't care how to write it anymore but you care how it works so you've sort of reached the level of how a product manager thinks about things architecturally right and you need to be that product manager and

now you're starting to move your concerns up up and you need to know all the engineering stuff and uh like Jeffrey Emanuel like I was talking about he's a mathematician self-taught engineer he doesn't but he he's learned

all of the right concepts and you know you know Cloudflare does this and and Apache Cassandra does that and >> that is still technical yeah >> that doesn't go away >> you still need to learn all that right and so just because you don't have to

write code anymore doesn't mean you have to you still have to learn a massive amount of stuff to be an effective engineer in the new world because that's the level that you're interacting with the Mat.

>> Amazing. So, this has been a a great overview. I don't know if you have any

overview. I don't know if you have any other sort of rants in you that you want to sort of get out there. I I I'll leave you the floor.

>> I feel like the gossip rate has gone up like not gossip, but the the rate of exciting announcements by engineers who have discovered new things about how to be more productive with agents. Um like

for example I just found out today not not this I was I found out today about it's called code MCP or something like that where you uh yeah instead of calling >> pretty popular project >> the agents can't call MCP very effectively because they don't have any

training on tool calls but they have plenty of training on writing code so you tell them don't call the tool write code to call the tool and they do way better with it right so it's like it's all these little learnings that we're finding right are >> it's crazy that anthropic the creators

of found this >> did they >> yeah well cloud player found it first but enthropic was like yeah you guys are Yeah. Wow. That's really neat.

Yeah. Wow. That's really neat.

>> So, I think that's why I love focusing on the AI engineer because my argument is the AI engineer can uniquely take advantage of LLMs way better than everyone else.

>> That's true. So much more.

>> You could almost define an AI engineer as somebody who's mastered LLM.

>> Yeah.

>> Yeah. Um not from training, but from using >> using Yeah. Yeah.

>> I think it's it's one of these like disruptor like strategies where like it's low status. It's high status to be a researcher. It's high statist train

a researcher. It's high statist train models. You don't get any respect if

models. You don't get any respect if you're a GPT rapper. But like it's starting like people are starting to be more productive and like actually develop sincere expertise in the same

way that I think like F1 car drivers don't know how to build an F1 car, but they know they'll tell you everything about driving it to the >> and they may know they in a sense more about operating it than the people who

build it. And so they have to have that

build it. And so they have to have that conversation right?

>> Yeah. Although although if you watch the uh I think the F1 movie, you you get a little sense >> and they make all the movies. Is that

what you said? Yeah, that's good point.

It's flip-flopped.

>> Uh lovely. Well, thanks so much for having for coming on. Uh huge amount of your work. Um your energy is very

your work. Um your energy is very infectious and I hope you keep doing Stevie's tech talks.

>> I'll start them up again, man. I mean,

this energy is because of the AI and it's because of vibe coding. It's it's

addictive and fun.

>> Tech is fun again. This got boring for a little bit.

>> I know. I know. For a while it was like well source graph like index your indexes your codebase like really really well you know again it's like so so fast and I'm like well that's cool but you

know what's cooler >> yeah cool this has been

Loading...

Loading video analysis...