Full Episode: The AI Industrial Revolution

By Naval

Summary

Topics Covered

The 1000x Engineer Builds the Factory
Two Engineers Can Now Design a Jet Engine
AI Kills the 200-Page Compliance Document
Healthcare Is a Small Communist Society

Full Transcript

Welcome. You're listening to the Naval podcast, your authoritative source for new knowledge. We're trying something

new knowledge. We're trying something new today. Uh I have three frontier

new today. Uh I have three frontier founders with us. Three good-looking

guys actually, and a fourth good-looking guy, Naval. And let me just introduce

guy, Naval. And let me just introduce everybody. Gumo the G Roush. Um he's

everybody. Gumo the G Roush. Um he's

building Versel into an AI cloud for the world of agents and whatever comes after that.

Good to be here. Blake Shawl, he's building supersonic aircraft in his own factory and jet engines as well. Blake's

company, Boom Supersonic. And then Max Hodak from science. He's building a biohybrid brain interface that grows living neurons on silicon to restore

sensory functions like sight, but then eventually to explore new parts of the brain and new senses. All three of these guys are not composing their products with off-the-shelf parts. They're

building their own factories and you know we don't care as much about what they're building exactly as we do about what they're learning about how they're building. What's the new knowledge

building. What's the new knowledge they're generating? What's their alpha?

they're generating? What's their alpha?

What principles are they discovering that other founders can learn from? What

are they trying to figure out right now?

And also what are the cutting edge or crazy ideas that they haven't even talked about yet and they're still forming in their brains. Naval, do you have any reactions to any of that before I jump into Gummo? Yeah, let's just have

fun.

Yeah, you guys should just jump in.

Yeah. So, I can't remember my exact quote, by the way, but I've been really pilled uh with this idea of software factories and the job of the engineer being something that you just show up to

work. You used to used to ship the

work. You used to used to ship the output directly and everything inside the company was, you know, how good is person A at shipping output B. And now

what's happening is the way that I'm judging you as an engineer is like are you producing the factory that would produce multiplicative outputs B through Z, right? Um and that's a that's a

Z, right? Um and that's a that's a pretty significant change because basically like we used to believe and it should be somewhat controversial that there's 10x engineers like now clearly there's 100x or a thousandx engineers

and the world hasn't fully adjusted to this. I used to get flamed on Twitter

this. I used to get flamed on Twitter for saying they're 10x engineers that it flies in the face of so much like equality philosophy that everyone's equal. But the reality is when you're

equal. But the reality is when you're operating in idea domains, when you're operating intellectual domains and virtual digital domains, it's not even 10x, it's 100x or thousandx and it always has been. Satoshi Notch, you

know, the guy who invented JavaScript, the Brendan I of the world, uh John Carmarmac, I mean these are thousandx programmers. Not to even mention if you

programmers. Not to even mention if you choose the right thing to work on versus the wrong thing to work on, that's an infinity difference and it could just be not nemer, just one who had a better judgment on what to work on in the first

place. And now obviously it's less

place. And now obviously it's less controversial because of uh AI leverage.

What's controversial is that the token leaderboards, right? Like people are

leaderboards, right? Like people are still getting a little confused because now they think, well, I have a bunch of 100x engineers. Look at all these tokens

100x engineers. Look at all these tokens that I'm paying for. I'm curious if you guys have seen the same like how do you measure ROI?

It's like the old measuring lines of code, you know, token consumption with lines of code feel like similarly not direct paradigms. I mean, my observation has been that

claude or cha GPT um or GPT is about is basically as good as you are in a domain. And so, uh if you're if you're a

domain. And so, uh if you're if you're a really capable developer, then these things are really powerful. And if

you're a junior developer, then you'll kind of find it to be like more of a junior developer. Like on the one hand,

junior developer. Like on the one hand, these models are incredibly capable. On

the other hand, the feedback that you give them sporadically seems to be incredibly important. And these little

incredibly important. And these little updates seem to totally determine the types of uh performance you get out of them.

There's a new kind of support that I give which is you come to me and like you didn't get good output out of the model and I tell you what to prompt the model with. So like the idea of like the

model with. So like the idea of like the quality of the reprompting which I think you're alluding to is is extremely important.

But I mean and to be clear I think that this will become less important over time like as the models get much much smarter then you'll be able to put in less and get more out. Um but at least at this stage it really seems to kind of

reflect back the judgment that the user brings in. In my experience,

brings in. In my experience, I've kind of resisted learning all the ticks and tricks and tips like, you know, there was a, oh, use Ralph Wigum, use Open Claw, use Hermes, use this

prompt engine, use this scaffolding, plug in this piece, you know, uh, always use plan mode. I just ignored all of that. I just assumed the model's just

that. I just assumed the model's just going to get better faster than I would figure out how to use it. It would

figure out how to use me faster than I would figure out how to use it. And so

I've just been completely hamfisted with them and I get frustrated at them and just sort of I I found myself typing less and less information and doing less and less work as time goes on with the

models because I just assume I can brute force my way through it and I'll throw Codex Claw and Gemini at the same problem over and over and just waste tokens to save time. And I think no matter how expensive these models might

seem, they're still way cheaper than a human. So I would say just waste tokens,

human. So I would say just waste tokens, save time. Don't look at the tokens

save time. Don't look at the tokens either as inputs or outputs. Just look

at your time and look at the final output. And even if they're writing

output. And even if they're writing lowquality code, which I know in many cases they are, it's not necessarily production quality or scalable code.

When the time comes and I want to ship it to production, I'll just throw more tokens at it. I'll say, "Okay, now go through look at it, rewrite it, and they're just going to get better every generation." Uh, so yeah, I don't I

generation." Uh, so yeah, I don't I don't see where this necessarily stops.

As long as we have verifiable domains and solve problems, they're going to resolve those problems. That's in the unsolved problems domain where maybe you're terren tower you're at the cutting edge of creativity that you need

to be you know working very collaboratively and carefully and closely with the model but I'm not in that I'm not at that level in software engineering and gear but you're probably the most extreme software engineer in the team right like out of this set

you're probably the one who most hardcore came up from a software background like how are you finding these models at the edge of their capability? Well, there's one thing

capability? Well, there's one thing that's happened recently that uh what you're saying resonates strongly with, which is it used to be that you would

give a prompt to the model and it kind of does it like classic like next token prediction thing and it like runs away with your idea. And models now have been doing this like intuitive planning mode

without to your point not even having to plan where it comes back to you and says look what you're asking me for there's these three routes we can take. there's

this set of tradeoffs that we're going to go down. That's a moment where like you know people do the whole thing on X like oh now we have a PhD level engineer model like that's very clear that the models at some point graduated. They

used to be junior engineers now they're principal engineers because they come back to you with a set of tradeoffs and obviously sometimes they [ __ ] which is hilarious. It tells you this one is

is hilarious. It tells you this one is going to take three weeks and this many interest it make really bad predictions but clearly it's now this like I respect

the models a lot more as a as a peer like that I'm going back and forth intellectually with but there there are a lot of gaps still so like if you're really really proficient engineer or

architect you I think you're still extracting more juice so the question sort of that Max was positing of like if you're junior do you get junior back.

Well, clearly not because a junior gets more advanced knowledge in code that they would have never been able to write by themselves, but doesn't an experienced architect get 10x whereas a

junior engineer gets 2x. That's what I'm kind of trying to figure out still.

Yeah. Yeah. But I mean, I think there's there's architectural decisions. So,

when you think about the development, I'm seeing this now with some of our the junior software engineers on the team of like what is the next step in their career progression. It's going from like

career progression. It's going from like writing implementation for a feature to picking technologies like choosing between Postgress versus some other database or picking between ZMQ versus

some other message Q or like some other queuing system and those I mean the models can suggest them but that's the thing where you'll see it and you'll be like no no I want to use this other thing. That's the type of little

thing. That's the type of little feedback that I'm saying really matters in the types of output that you seem to get at this point.

Taste taste and judgment right? Taste

and judgment. That said you can ask them which one should I use and why and they know everything. they'll give you really

know everything. they'll give you really good trade-offs. That's the change I was

good trade-offs. That's the change I was saying has happened recently where you would say hey go and um put this super high cardality telemetry data into Postgress and it's like no bro like we

don't put that kind of data into Postgress like you should consider click house or Athena or whatever like that's happened to me a lot which is really

impressive um but I the thing I'm still like kind of struggling with is clearly the human is still completing the model

like at one point is it the other way about like the the human is the one sort of getting the instructions back on like go get me this API key because it's something that only you can do uh or get

me this amount of capital for my next set of investments that I need to make.

Uh you just watch like that clearly we're still not there yet. That's a

temporary aberration. Pretty soon every good SAS company or hosting provider will have a CLI and API interface that the models can meet directly. they don't

even necessarily need an API like as long as it's like textbased Unix based the agent can hack its own API and then the money part you insert crypto tokens you know put in bitcoin

put in whatever and the model goes and just pays for whatever it needs and I think like you know there people working on this but the thing I am now thinking through

is is pure software dead like is pure software engineering like an obsolete thing it's like saying speaking English right the models now speak English we had to learn code to communicate with

the models. Now the models speak English

the models. Now the models speak English and they speak fuzzy sloppy English like a human and they understand things. So

where's the moat like for a founder?

Hardware it's a boon you know like now if you had to build hardware it was hard to build a software company alongside like Patrick Cullison says software is art and it's hard to hire artists. So

now as a hardware founder great you can have really good software develop fairly quickly. Um, if you're creating models,

quickly. Um, if you're creating models, maybe that's the new software engineering, training models and tweaking models and post- trainining and fine-tuning models. But classic software

fine-tuning models. But classic software engineering is that dead is pure software investable is pure software something organize a company, a team around and try to get some leverage. Did

you guys see the uh there was an article an X by Mitchell Hashimoto called the block economy or the building block economy something like that like his argument is that the most useful thing

for agents to have now is really powerful reusable building blocks because to Max's example you wouldn't expect your clanker to reinvent a Q infrastructure system every time he

needs to send an email it needs to bring in the right building block that's right size for the task that you're asking for and say, "Well, okay, for this one, it's

B MQ." I challenge the notion that I

B MQ." I challenge the notion that I would want the agent to reinvent the entire universe from first principles in a way that's incompatible with the rest of society and civilization. Like it's

almost like reinventing highways, laws, policies, etc. just for you. Even if

there's a potential for extra optimization, extra juice that you can get out of it, there's a still a um sort of like cooperation at large scale value of saying we're both depending on

Postgress 13.2 and so that's still really really really valuable. I would

say like the category of infrastructure software and building blocks that these agents are going to use obviously embias is this what we're building seems extremely valuable and I don't see the agent anytime soon and by the way you

could even another metaphor I've been using is like agent's already been created that the models can reuse is like a token cache because you you don't want to churn through a trillion tokens to reproduce what's already existing. Uh

and so there's always starting points that the model can fork off from, but it's going to change things quite profoundly.

So these are like libraries and dependencies, but for models.

Yes. For agents specifically.

To Naval's question though, I mean I learned a program when I was really little. And I like that was the thing

little. And I like that was the thing that through all of like being a teenager and in my 20s like I get like sucked into it and just like code for like 20 hours and it was super fun and I knew all this stuff about programming

languages. I haven't written a single

languages. I haven't written a single line of code in quite a while now. And I

mean, partly that's because my job is different, but also since December, I've built a huge amount of software that I now use every day. There's all these projects that I've kind of fantasized about for years that now I'm like using

um that I've actually built and I didn't write any of that. And I just can't imagine going back to like actually writing code by hand anytime. Like I

mean, I'm unlikely to do that anyway, but just like in general, I see that I have a hard time seeing that as part of the future.

Yeah. There's something really cool is that you understand how the pieces click together. Like I feel like anyone that

together. Like I feel like anyone that understands what an API is and how data flows, inputs and outputs, performance because you kind of you have to orient

the model around like this is a certain level of expectation that I have out of this operation like that's that's always been infinitely more useful than um than writing code. Like I feel like a really

writing code. Like I feel like a really good a proficient engineering leader has been quote unquote like vibe coding through people on Slack or one-on- ones because

you're transmitting your will, your intent, your experience and you're letting others run with it. Uh it's just that now we do the same but with agents.

Uh and so I think that's why you've been successful with it. But I don't know that everyone sees the same level of success. I

success. I mean I went from not having written code in 20 years to I'm coding all the time now. but through agents and I'm building

now. but through agents and I'm building tons of software and it turns out that just understanding the basic principles of software engineering and algorithms actually gets you a long ways because the reason I stopped coding was because

I didn't have time to figure out the latest language latest architecture infrastructure pieces to plug into and I know Verscell makes it a lot easier but even then just getting started was a bare like just plugging pieces together

assembling infrastructure was just so annoying. The thing that really changed

annoying. The thing that really changed is I mean it used to be that you could build a lot like you like there's a lot that was straightforward but then you would hit some random thing and then you could spend

kind of some indefinite period of time debugging some narrow thing and now with the agents what happens is you just don't get stuck anymore which is pretty amazing or they get stuck it's removed well no I mean like relatively quickly they can find like

the right way to do things and it used to be that like I remember when their friends learned a program be like nope it's just like intrinsically frustrating like if like that's part of the feel that's how you learn and that just isn't true anymore.

Blake, how are you applying all the stuff at uh Boom Supersonic?

Yeah. What I found is it completely changes the role of software and hardware developers. The thing that we

hardware developers. The thing that we did from day one was uh try to take a lot of traditional engineering workflows and I mean hardware engineering workflows and turn them into software.

And so if you haven't been around hardware engineering, let me see if I can make this more clear. there uh

there's a lot of engineering hardware engineering that happens in Excel spreadsheets on engineers laptops in a silo and it's very complex uh

spreadsheets sometimes like VBScript code and all of this is actually software but it's it's treated as if it's not software there's no there's no source control there's no automated

testing if you want to hand something off from like an aerodynamicist to a structures engineer that's done manually with like a spreadsheet over email like it's the 1990s. It's terrible. And so we

we started building these kind of like software frameworks that can automate and make repeatable hardware engineering flows. The idea we could reduce the cost

flows. The idea we could reduce the cost of iteration. Um but it was it was

of iteration. Um but it was it was slowgoing because we could never get enough we could never like afford enough software engineers. And what we've

software engineers. And what we've gotten into is this uh mind-blowingly different model where the software engineers actually create the architectures because they understand

systems, they understand algorithms, they they understand, you know, division of concerns. Uh and then the hardware

of concerns. Uh and then the hardware engineers can vibe code their pieces because what they know about hardware engineering and the result is just like mindblowingly different productivity for

small teams. like give an example like like if you're designing a turbine blade like classically so a turbine blade starts like uh cold but when it runs it's hot so it gets bigger and so you

have to design both the aerodynamics and the structural design of the thing to work on it cold shape and this hot shape and so you have to convert between cold and hot and you convert between structures and aerodynamics and this

takes like one engineer one day for one blade for one piece of the analysis and there are like a thousand blades in a jet and and so you can't do much and we literally now with a combination of

software and hardware people created the solution you can change blade geometry you can see in real time the structures and aerodynamics results and so it allows two engineers to design an entire

jet engine which is just wildly different. One of the things you

different. One of the things you mentioned is that you have software engineers creating the tools and architectures for the rest of the engineers. That to me is the biggest u

engineers. That to me is the biggest u the cataclysm of enterprise software is that there's no like startup that builds hardware collaboration tools that can sell you anything anymore because in

internally you're just coding the right things that you need at any given time.

Even spreadsheets are kind of cooked, right? Because the reason spreadsheets

right? Because the reason spreadsheets were successful is that no one could build custom software. So the thing that approximates custom software the most is a spreadsheet with a bunch of EV script

functions. I personally have moved

functions. I personally have moved almost entirely from uh Excel to Python models where I can actually like get like believable simulations of things.

Yeah. I mean the thing that that AI hasn't come to yet that I think it it will within the next year like probably within 26 that will be very very exciting is right now it can generate software but soon it'll be it will

generate step files and PCB layouts. And

when it comes for mechanical and electrical engineering, that will be a whole other thing that we haven't seen yet. That'll be very, very cool.

yet. That'll be very, very cool.

Yeah. On the hardware side, I think it's really a boon for like all these little gadget companies and part companies that write really bad software because they can't make great software and now they're going to be able to make good

enough software or it may not even software that is a human front end. It

might just be completely agentic for an agent to access and you just talk to it through voice and control hardware. And

I this is why one of the reasons why I think for example China is big into open- source models right they're basically going all in on it because they have hardware superiority they have these very complex supply chains and

component chains and they're basically saying hey if I can just generate software on demand then I don't have this disadvantage anymore against Silicon Valley. So that's not the only

Silicon Valley. So that's not the only reason why they're doing open source. I

think they're also behind. They're

distilling models. They're catching, you know, they're collaborating on resources. But I think the Chinese

resources. But I think the Chinese government has a history of funding efforts that then sort of help their entire ecosystem along, especially in network effect businesses.

And so I think they want to like uh pull all their resources, catch up on AI, and use it to give their hardware stuff an advantage. And ironically, they're doing

advantage. And ironically, they're doing all the open source stuff because Open AI is not open. You know, Gro publishes models, but I think they're a model or two behind. Uh, Google has some local

two behind. Uh, Google has some local models, but nothing really that competitive. Anthropic, to my knowledge,

competitive. Anthropic, to my knowledge, I don't even know of any open source models from them. So, all the open source heft is coming from China. It

helps all our hardware founders, but it helps their hardware founders and factories and so on that much more. But

all all the crappy little software that goes with all the little random knickknacks and thingamajigs that you buy off of Amazon for to tinker with a lazy Saturday afternoon, that software

is getting a lot better very quickly. I

think everyone's had the wakeup call that without great frontier coding models, you don't have self-improvement.

And so imagine China as a whole not having the ability to produce frontier everything, right? It's not just

everything, right? It's not just producing software is in any piece of this hardware pipeline like Blake was saying like you need to generate software. If you fall behind on your

software. If you fall behind on your ability to generate software, you fall behind on the ability to generate everything. One thing I'm curious about

everything. One thing I'm curious about from you guys is like cuz everyone loves to talk about Chinese models like do you use Chinese models? Do you know anybody that uses Chinese models? This is an

argument I had yesterday actually which is uh one person at the table dinner was claiming that uh you know you'll just use deepseek for 97% of things because it's so cheap and if you need more

intelligence you'll just run it over and over again the same problem and you'll only use the open AI anthropic etc models for the most advanced tasks and I was kind of like I don't know I think intelligence is an unalloed good you

always want more intelligence and when these models make a mistake you don't know it and it's always cheaper than a real person and real time. So, you'll just use the most intelligent model available, which isn't great news

necessarily because it means that, you know, you're going to end up creating a monopoly or igopoly kind of situation in AI. But, uh, I always want the most

AI. But, uh, I always want the most intelligent programmer. I always want

intelligent programmer. I always want the most correct answer. I always want the best judgment. And given the amount of leverage that I'm going to pour into it through capital and code and people and, you know, marketing, I want to make

the right decision every time. And often

when between two models, let's say like I have one model that I know is a little smarter than the next one and they both give me answers. Often I actually don't know which is the correct answer, right?

So if I know one model's a little smarter, I'm going to go with that answer and eventually I'm going to stop asking the model that I think is less intelligent. But I don't know, have you

intelligent. But I don't know, have you guys found a use for the these, you know, so-called less intelligent models?

We see uses so that so we have the AI gateways uh data that basically like every application agent, etc. goes through. And so there's definitely usage

through. And so there's definitely usage of open models, but the top is like heavily dominated by the frontier intelligence. And there's a subcategory

intelligence. And there's a subcategory or there's like a caveat to that, which is that frontier intelligence at reasonable cost and performance like slaps at scale. So like people don't get

really excited about Gemini, but they put out these models that are like super smart at the right performance cost combination and for a lot of tasks other

than co coding actually interestingly enough. Uh they're the best models.

enough. Uh they're the best models.

They're like the best like industrial production models. Uh you can throw them

production models. Uh you can throw them at like support tasks or browser automation. Like I would always put a

automation. Like I would always put a Gemini model there. Uh and I would look to Chinese models for those kinds of things. But anytime I'm working to push

things. But anytime I'm working to push the frontier, you need the best possible coding model. And that's basically now

coding model. And that's basically now like two or three models and uh and the Chinese are not certainly not in it.

Hey Max, you're pushing pretty hard into vertical integration and extreme urgency. Do you want to talk about that?

urgency. Do you want to talk about that?

Yeah, I mean for many things we um you can't buy it so you got to make it somehow. Our preference would always

it somehow. Our preference would always be to buy something um like if there's a vendor that offers a service at a great price like for example like PCBs like we don't make PCBs like those are they're basically free you can buy them in

unlimited quantity from Asia but the the closer that our products get to being like a single block of coalently bonded matter the better they'll be lower power smaller higher performance last longer

and um there's just like there like the components aren't available and in order to do that type of integration be able to actually innovate beyond things just piecing together things that you can buy off the shelf which really is is very

very limiting. I guess you have to like

very limiting. I guess you have to like learn it to do it yourself and that shows up as vertical integration. So we

own a captive MEMS foundry on the east coast which we bought because there was really no other way to do the type of packaging and assembly stuff that we wanted to do and I think that all of this is going to be affected heavily by

AI over the next few years. It's not

quite there yet. In fact, ironically, one of the biggest impacts that we've seen of AI inside the companies in regulatory interactions because if we can do things like generate documentation or if we can ask like we

want to change, we want to evolve this product like there's thousands of ISO standards that might apply which ones do we have to comply with and like trace this through. This used to be like

this through. This used to be like you're like you're following a whole regulatory quality team for several months as they trace this and now the AI just kind of knows. Um, but when I think

about stuff like the the surgical program or the MEMS fab, I think ultimately the software still needs hands. Like it's going to be smarter

hands. Like it's going to be smarter than us, but if it can't make things, then like those are real real boundaries. And so we've instrumented

boundaries. And so we've instrumented our foundry as well as many other parts of the company in in ways where um as these models get better uh that should show up pretty immediately in in things

like the the cell engineering that we're doing and the material science that we're that we're developing.

It sort of makes me realize that like it's been a while since I've generated a basic legal document using a lawyer, right? I stopped asking lawyers for NDAs

right? I stopped asking lawyers for NDAs and you know agreement for this and sign that and research this and like all the basic legal tasks are gone too because you know there's the old joke that law

is like spaghetti code you know they have this very complicated code that they try to put in English and it contradicts this code over here and has to fit into that code over here and there no real APIs for it. Um but for

just like junior engineers and junior engineering I should say junior engineers basically got a promotion to senior engineers and junior engineering got taken over by agents and so the same way I think in a way the downside is you

can look at law and say you know parallegals just got fired or you could say parallegals just got promoted to senior lawyers and now they can spend their time thinking about the law. It's

actually kind of interesting to think about the parallels of how software engineering is evolving with lawyers because lawyers, you never know what they put into these documents exactly.

You just trust them. Like, hey lawyer, can you look at this document? Can you

tell me if it's legit? Can you do red lines? Whatever. Like, at the end of the

lines? Whatever. Like, at the end of the day, you're what you're valuing in the relationship with a lawyer is that is that they're a trusted authority. They

went to law school and they're putting their reputation on the line. Cool.

I think there's a parallel par parallel with like the biggest problem in software engineering today is these mountains of slob that end up as a PR and then people are say like there's all these memes on Twitter like way back in

the day we used to read every line of code of a PR. Well, in my world infrastructure I want engineers to be able to say I understand doesn't necessarily mean that you've read every

line of the of the PR. You need to be able to say I am signing off on understanding the consequences of this PR or I wrote the test harness

the simulations the proofs the type checkers etc to be able to say even without reading this I have confidence I can sign off on it's going to be safe in production and so it's it's kind of

interesting because there's a world in which we embrace that everything is going to be spaghetti code and that we don't fully understand it but we write the basically evaluator that give us

confidence and then we rely on like people uh like the infrastructure production engineers to say okay I'm fine uh sending this into prod you know at the end of the day like someone is

going to get paged if your systems go down and I think another thing that people are underestimating is that creating software is really easy 0ero to one but think about a thousand days from

now what does what does your software look like is it secure is it tested is it production grade uh is it performant and are you still motivated to invest

all of those tokens in maintaining it in prod I mean humans are becoming verifiers right and and that's kind of how we train these models with good verification data and now we need human verifiers so yeah I think a lot of the a

lot of the old function of people lawyers engineers operations people move to verifying the stack and saying yeah this is roughly correct and I I'll roughly stand behind it and I'll support you if it goes wrong

one of the things we see related to the regulatory is it massively reduces change aversion and improves iteration.

So to give you an example like let's let's say you're going to go certify an airplane. One of the zillions of things

airplane. One of the zillions of things you have to do is prove that it could withstand a lightning strike and the the regulatory documentation for the test

plan for such a thing stretches on for say 200 pages. And what you would classically do is hire a, let's be honest, not super bright engineer who's willing to be there monkey at keyboard

writing 200 pages of regulatory compliance documentation. And it takes a

compliance documentation. And it takes a couple months. And and by the way, if

couple months. And and by the way, if you change the airplane now, you want to cry because there's another like two months of rework of this like wrote kind of regulatory compliance documentation.

And what we found is you we can build a rag that will enable us to basically prompt our way through all of that work you know in let's call it minutes. The

first order effect is oh you save a lot of time. The the the second order effect

of time. The the the second order effect is if you change the specification of the airplane uh it now takes you know uh minutes not months. So you can actually be willing to change. And the third

order effect is you can now you know basically get rid of the not very great engineers and have a small number of really creative ones. They can iterate rapidly because the cost of change goes down and in a certain sense like the

entire regulatory burden which really hurts the ability to iterate drops away.

I think that this is a really undersold story in AI right now. I think the consensus in Silicon Valley is that like regulation sucks like any like we want to go faster, we want to rec realize this amazing future. We want abundance.

We want just like prosperity and stuff that slows down that future is just kind of to be avoided. And certainly I think we've overregulated. We've made it

we've overregulated. We've made it impossible to build stuff. It's just

like it's totally crazy what goes into getting building any type of thing in a lot of places either physical or otherwise. But you know like a lot of

otherwise. But you know like a lot of the regulations themselves are not the problem. Like if you've actually read a

problem. Like if you've actually read a lot of these things like like having non smog choked cities is great. Being able

to swim in like many rivers is great.

Like having like a lot of these things were progress. The problem is that it is

were progress. The problem is that it is really difficult for humans to deal with understanding and complying with this and that every time you have to exchange a letter with the government, you wait months. And if you could take a lot of

months. And if you could take a lot of the things that we've learned and kind of make them like totally frictionless, that would actually pretty cool. And I

think that um that I think is an under an underold story in in AI right now.

Yeah. until the regulators start spewing tokens back at us and then you start getting huge amounts of documents from the regulators that you have to comply and it's agent on agent wars. But

but that's basically what we have now.

Yeah. But but there is a fair fight.

Yeah.

I'd argue that's an improvement from where we are now. Like one of the terrible things right now is if you build anything physical, you have to get a building permit. It's like you're guilty until proven innocent. And the

worst thing I' that we've run into is the fire department because they have like the moral impremature of, you know, people pulling people out of burning buildings. And yet what they actually do

buildings. And yet what they actually do is just like screw with your design for buildings for months. And I, you know, if we could replace the fire marshall with with an agent that would critique

your your building plan quickly, um, even even if it's feedback was overdone, it would be massively better than the delays that exist today. When

Max was talking about this potentially being a good thing to have all this regulation, my my head went to the things that make agents successful is uh humans or other agents setting up the

right testing guard rails. Uh a lot of people are really excited about SLGOLO.

I don't know if you guys have played with that or like Ralph loops where you tell the model go do this and this is your exit criteria. Well, I'm telling Blake go make us all supersonic. your

exit criteria is that you've complied with all of these regulations. So

there's totally a world in where we say like the regulations are great. They're

like our testing, our test suite. As

long as this p passing these tests for one that's not incurring contradictions and the regulations are actually reasonable, etc. Like they're actually an awesome guard rail to have.

Otherwise, we would be shipping slop directly into into the air.

Yeah. But this is going to turn into a red queen race, right? They're going to have agents. We're going to have agents.

have agents. We're going to have agents.

I think we might have better agents, which is good, as opposed to have to do human versus human, but if anything, their cycle time, their response time may get lower. Like the app store is drowning in spam. I'm sure the patent

office right now is drowning in spam.

And so these agencies, they're going to be slow adopters of AI. They're going to get DDoSed, right, by clever entrepreneurs just overloading them with documents. It's possible that the

documents. It's possible that the approval time for this stuff might extend out as this suddenly get flooded.

It creates the opportunity to um I think really shift the model, the regulatory model. Imagine if we drove around a city

model. Imagine if we drove around a city the way we build things today. Before

you could go anywhere, you'd have to write a plan up, ship it to some regulator, you know, and your plan would have to specify we're going to take such and such a route and we're going to drive this speed limit. We're going to use our blinker and we're going to stop at every stop sign and we're never going

to run a red light, blah blah blah blah blah. And then 3 months later, you get

blah. And then 3 months later, you get back critique. It's like, well, we think

back critique. It's like, well, we think you should like drive on this other street. And eventually you get approval.

street. And eventually you get approval.

didn't go drive somewhere. It's insane.

You can never go anywhere. And yet that is absolutely the way we build physical infrastructure in this country. It's

guilty until proven innocent. And and

what we should actually do is make more of these things enforcementbased rather than preapproval based.

I mean, I don't know. I mean, I I don't want to be under too much. Like, if I ship a medical device to a lot of people, there needs to be it's like there's unknowns there. It's like we were responsible. We did clinical

were responsible. We did clinical trials. We reported all the data, but

trials. We reported all the data, but Max, this is this is why there's so little innovation in medical right now because the FDA approval process is a nightmare. Uh, in fact, the two biggest

nightmare. Uh, in fact, the two biggest advancements in tech in Silicon Valley in the last decade, AI and before that, crypto, they're both in the math domain because it's the last unregulated domain. And when they started regulating

domain. And when they started regulating frontier models and started regulating GPUs, that stops as well. You know,

Peter Teal laments about how there's no innovation in the physical domain.

what's been held back by just the huge regulatory barriers and you can always find a scare version like you vaccine or medical like famous ones, right? But the

regulations spread everywhere. The

tentacles are everywhere and there's all these different contradictory regulatory bodies. You saw how uh was it SpaceX,

bodies. You saw how uh was it SpaceX, they got sued first for for not having enough I forget what it was migrants or refugees or whatever, but they're not allowed to hire them by government regulation on the other side because

they're not citizens. This is not like logical code that has to compile in one place. These are madeup random

place. These are madeup random regulations all over the place. You

might comply with one state, you violate another state, you violate federal over here, you annoy this guy over here, that guy chooses to prosecute one out of 50 people who are his friend. It's it's

very arbitrary. It's very capricious.

And and moreover, like the idea that this makes like things safer, I think it's just a complete mythology. Like

just watch, you know, watch watch Boeing as an example. They certified the 737 Max, which had a single sensor that had complete authority over the nose up,

nose down attitude of that airplane. No

intern is dumb enough to think that's a good idea. And yet, it got all the way

good idea. And yet, it got all the way through the certification system. This

stuff doesn't actually make us safer. It

just makes us slower. Well, I mean, there's definitely dysfunction here. I

mean, I think that some of this makes us safer in the sense that the NRC makes us safer, which is that their job was to make sure that nuclear energy was safe.

They did this by permitting zero plants until I think like a year ago since the 70s. It will be perfectly safe if we

70s. It will be perfectly safe if we never build any of it. And I want to be really clear that I'm on the side of deregulation in on a lot of this. I

agree with Blake that a lot of this can be done a lot more efficiently. But I

also think it's a little too dismissive just to say it's like oh this is like the FDA or like even it's it's in the reg it's in the agencies in general. And

the problem is deeper to the degree that when the if the FDA approves 10 really important drugs, they don't get any credit for that. One patient dies and they get hauled before Congress and

yelled at. And so they have very

yelled at. And so they have very negatively biased incentives here. And I

I think the reality is is that this is reflective of the beliefs of the American people. There's this trade-off

American people. There's this trade-off here between the perception of risk taken in human subjects research and the rate at which we get new medicines. And

it is absolutely true that if we move faster on this, we would learn. It's

totally asymmetric and I think you're totally right, Max. If you approve a bad thing, your career is over. If you block a good thing, nobody notices, right? So,

it so it creates this asymmetric slowdown. And I think this is um I think

slowdown. And I think this is um I think that is the most important problem to solve in the regulatory state.

But but this is a very deep problem because it is this is where the voters are like and we we go and poll some of the stuff that we're working on in the future to understand kind of like where where the American people are on it. And

if you push too hard on this, like there are there are all kinds of ways you could work around it. You go to prosper.

There's all kinds of ways to try to go faster. But if you're seen as being a

faster. But if you're seen as being a bad actor, then you're rejected from the society that we live in. That is the thing that you need an answer for, which is deeper than just saying like, "Oh, well, we need regulatory reform."

You you have a deep point there, Max, which is uh it's the voters, right?

Yeah.

This where the citizens are. Like we

like to blame politicians. You'll see an X all the time, right? When people are like, "Oh, this politician, that politician, you a politician." like

they're elected, they're voted, majority vote, right? This is where the people

vote, right? This is where the people literally are. That's the package.

literally are. That's the package.

That's the bundle they've chosen. And

you may not like this constantiation, but if you were to remove this one, something very similar would take its place because the voters would just vote them right back in. And I think culturally it's very hard for most

people to understand what we lost, what we missed, right? So for example, like France, you know, there's a French entrepreneur on X lamenting that 57% of GDP gets sucked up by the government and so you can't create companies. But to

the average French citizen, that's not visible. They don't notice what they're

visible. They don't notice what they're missing. They just know they're slightly

missing. They just know they're slightly poorer than the US. The economist just did a little piece on economist is finally coming back around to being capitalists after 30 years. And they

just did a little piece on how the US is outstripping everybody and growing faster and getting bigger. But then they immediately turn around and say, well, it's because of the oceans, because of natural resources, everything but capitalism, right? They don't want to

capitalism, right? They don't want to say the dirty sea word. uh because you know for some reason all of these all of these uh magazines became Marxist at some point um but they can't they can't

envision or imagine what could have been if we had just been a little more lazare a little more open so I would love to see a true experiment among the 50 states you know different regulations different tax structures not

because right now the federal tax structure and federal regulations dominate everything but imagine you know you could go to some small state if you had cancer and you could try every drug that everyone was cooking up in caviat

mtor and you got to do your research and blah blah blah but this is known as the experimental zone um same way for drones same way for well aircraft is a little harder because you got to cross a lot of areas but do you think there's something magical

in there the notion of like innovation zones because we have we have a huge like nimi problem right uh but if you if you create like you know opt-in yimi zones they create some experimentation

framework and by definition it happens where people are consenting and you can try different rules or no rules or different ways of enforcing or you know innocent until proven guilty and then see what actually happens and what are

the innovation consequences and what are the safety consequences and then then the successes can spread but I mean to Naval's to Naval's point an innovation zone would not solve the problem in drug discovery um there so there was the

right to try act passed a little while ago we've had this pathway called single patient IND for a lot longer than that um the FDA like if if your doctor calls the FDA and says hey I want to give this

my patient an approved drug they give over 99% of those over like they approve over 99% of those. They can even grant them over the phone. Um the problem is that in order to dose a patient, you

still need clinical grade drug and the only entity with that is typically the IP owner who's in the middle of running a clinical trial. Like they're investing hundreds of millions of dollars into like making this thing. And the problem

is that the FDA, they'll draw an adverse inference if something bad happens to your patient who's probably really sick to begin with. And that's going to be seen as a property of the drug which is global, not related to your innovation

zone. And so there's kind of two

zone. And so there's kind of two problems. One is you need to get the IP owner to give you some of your drug which they're not going to do. And then

you need to prevent the global regulator from casting doubt on what might happen with their clinical trial if they give you some.

How would you address I mean I don't know your field. How would you address that in medicine?

Oh well I mean that in particular I mean this is just like a very inside baseball. I think the FDA has to be

baseball. I think the FDA has to be prohibited from drawing adverse inferences across different users of a capsit, for example. There's these like a bunch of specific ways that you could really accelerate innovation with a

relatively light regulatory touch by just um preventing this this kind of paranoia from driving our decisions.

Is there anything better than the FDA out there? Like what are we benchmarking

out there? Like what are we benchmarking these regulators against or is it not an interesting question because we don't have everyone follows the FDA.

So I'll give two two expansions to that.

The first is um Europe which is not really better than the FDA but they have a different system in that they've got these these notified bodies which are basically private businesses that are blessed by their host governments to

certify things whether this is trains or planes or medical devices and the notified body system uh creates slightly better incentives at the review layer because they can hire people they can grow there's competition among the

notified bodies they themselves have to be compliant with the conditions placed by their host governments for certification but it means that they can there can be many thousands more reviewers than you might have in the US.

The second thing I'll say is there actually is one approved getting paid implantable BCI today which is in China

and the CFDA is thinking for itself. And

they really do have a system that I think is going to give us a run for our money if we're not if we're not careful.

And they they they handle it very differently.

How do they handle it?

I mean the costs to bring a drug to market or a device to market are just much lower. I mean you can try things in

much lower. I mean you can try things in humans and you can try things on market like the so the problem the one of the things that I've spent a lot of time recently thinking about is like 20 years

ago we were buying far fewer laptops and phones each one was much more expensive now there's they're cheaper there's far more of them we buy more of them the total spending has gone up this is great stock prices of things like Qualcomm and

Samsung and Apple are way up everybody's happy they're using kind of the excess wealth generated by the phones and laptops to buy the phones and laptops um this doesn't happen in healthcare. In

healthcare, because you've got this reimbursement mechanism in the way where there's this kind of enterprise sale happening, the bucket of money that we use to buy healthcare is basically fixed. It is not increasing as there is

fixed. It is not increasing as there is more stuff that is producing better healthcare outcomes like we see in technological growth industries. And so

this means that the rate of spending on healthcare grows at roughly the rate of of growth of tax receipts. And so if let's say that like AI is booming and there's major advances that are happening and two years from now we're

spending 10 times as much on AI as we are now, this could be great, but if in two years we're spending 10 times as much on healthcare, this would be a catastrophe. And this is fundamentally

catastrophe. And this is fundamentally at odds with being a technological growth industry. And so as time goes on

growth industry. And so as time goes on and there's more things to spend money on that extend and improve quality of life for patients, like we can restore vision to people go blind in their 80s.

We might be able to extend life in like far past where it's been before. we can

restore capability to patients that are older and in worse condition, but like how do you pay for that? There's kind of this like omni problem in healthare which is all really related the same

problem which is just too expensive to bring these things to market and that's what China is getting at. The way out of this is not singlepayer or some revision to health to health insurance. It's to

bring down the costs so that someone can buy this with a credit card finance maybe like a car worst case. And to do that, we have to make it cheaper to bring these things to market. And

China's doing that. That that will allow them to sell these things for $10,000 on $100,000. There's no private market in

$100,000. There's no private market in healthcare. And because there's no

healthcare. And because there's no private market, what was the analogy people make sometimes? Like imagine

instead of going to restaurants and paying, you would basically go to all the restaurants and then at the end of the month, you would send all the receipts and all the bills to your insurer to the government and they would

reimburse you. Well, there'd be a line

reimburse you. Well, there'd be a line outside every good restaurant. Every bad

restaurant, you know, would be available. Um, the weights would be

available. Um, the weights would be terrible. The product wouldn't improve.

terrible. The product wouldn't improve.

You're basically running a small communist society inside a larger capitalist society. And that's what

capitalist society. And that's what we're doing in healthcare.

It's also what we're doing on roads, which is why we have traffic. Like, it's

the exact same situation on roads. It's

why there's, you know, there's no variable pricing for getting on the highway. It's why it's always clogged.

highway. It's why it's always clogged.

If you want to step on the third rail of healthcare for for a moment, think about this healthcare plan. Tell me what's wrong with it. Right? Imagine that the

first 20% of your uh annual income was your healthcare deductible. Doesn't

matter like if if you're broke and homeless, it's zero. If you're rich, you know, it's millions of dollars. Uh but

whatever your annual income is, the first 20% is your healthcare deductible.

And then the rest is paid by the government, the insurance system up to the usual caps that they have today. You

would create a private market pretty quickly. And so like in dental and

quickly. And so like in dental and plastic surgery and sort of a lot of optional medical procedures, you would actually get a competitive situation.

You get improvement. Like if you look at optometry, you know, with LASIC, you look at dental with like veneers and uh braces and all that stuff and kind of all the dental surgery stuff that they do. Or if you look at plastic surgery,

do. Or if you look at plastic surgery, like those fields do seem to be advancing because they're private payers. They have people who are, you

payers. They have people who are, you know, voting with their money. So we

need to do some equivalent of that in the normal healthare system. But people

lose their minds. They don't want to think one step ahead. They're like, "No, no, no. Well, what about the broke

no, no. Well, what about the broke person?" Well, the broke person has no

person?" Well, the broke person has no income. So, they're like, "Well, 20% is

income. So, they're like, "Well, 20% is too much for some people." Okay, you can put some deductible in there. But

generally, if you don't have some private market where people are paying a lot of the times for what are medical procedures, you're just not going to get this feedback loop that you're talking about. You're not going to get this

about. You're not going to get this ability to spend more money into the system. Right now, like very wealthy

system. Right now, like very wealthy people can't spend voluntarily into the system, but the prices aren't anywhere.

The rate cards aren't anywhere. the

system's not designed for it. It's like

if you go shopping for medical care and you want to pay out of your pocket, sometimes they'll quote you a price that's 10x what they charge the insurance company.

Have you heard Sid's story from GitLab?

Do you know Sid?

So, he was uh I mean had a massively successful IPO then was diagnosed with a rare cancer and has achieved has lived way past the prognosis, has really taken it into his

own hands. I think he went from kind of

own hands. I think he went from kind of he did frontline chemo and then there was one alternative that was available.

He exhausted it and the doctors were like, "We've got nothing for you." Since

I think like six or seven companies have come out of it, there's now 20 or 30 drugs in his escalation ladder. He's

still alive. Um years later, he's doing great. I saw him the other day and he he basically created his own like personalized medicines and uh treatment plan.

Yeah, there's there's a handful of these anecdotes that I've heard now. I it is really clear to me that at the high end if you just kind of have like you're not dealing with insurance you have the

resources you're like I want the full toolbox of modern science outcomes are possible that that like your normal like if you go and ask your doctor like oh what will happen if I do this they will just start shouting and

throwing things but it is clear that that much that like that crazy things are possible at the high end and I think that this type of like end of one medicine is actually going to end up being a really rich source of research for understanding how to build more

translatable things.

It requires a ton of agency from the patient in a moment where they're at their weakest, which is pretty uh ironic. My friend passed away from

ironic. My friend passed away from cancer and like last thing he wanted to do was research n equals one medicine uh because he was just, you know, like like

dying by the week. But this is where AI should really shine and come up with the right solutions and democratization of like what can you actually do when you find yourself in that situation. It's

kind of crazy how few people get access to this just from a knowledge perspective, not just monetarily speaking.

How much autonomous software do you guys have in your organizations that's running on its own or near autonomous and improving on its own? For us it's the a lot of the infrastructure is

already autonomous because we have uh we have this capability uh that fires off uh upon finding anomalies which I recommend everyone creates a version of this or Verscell offers a version of

this but upon anything happening that's anomalous today the mo most engineering organizations are responding to this by setting up alarms or uh like monitoring

thresholds by hand which is pretty insane but that's actually how the entire industry works you say if my error rate increases by this amount uh at this API endpoint do this so we've

actually uh automated a lot of the S sur job uh site reliability engineering so anything uh uh any metric that uh slows

down speeds up uh throughput changes whatever fires off an anomaly alert an agent investigates that an agent can decide to create an incident if the

incident is filed, people get looped in and the agent begins the process of remediation. We're doing everything

remediation. We're doing everything except for like the actual like giving the tools for the agent to like you know change prod but we're basically like serving solutions on a silver platter to

engineers and then the other thing that's working really well for us is just autonomous optimization processes um and autonomous security research. So

the other we open source this tool called deepseac it's [ __ ] incredible.

was like mythos, but you get it today.

Uh we run it against our entire monor repo using 10,000 concurrent agents in the cloud and it found basically several quarters worth of security research uh

progress was made in um basically a couple days and $14,000 worth of tokens.

So I'm talking about like months worth of red teaming, security research, entire teams of people. Uh and so we're basically now running like this periodic because the the other problem with AI is

that cyber security is becoming a nightmare. Um there's way too many

nightmare. Um there's way too many vulnerabilities, way too much work to do. There's too powerful adversaries. So

do. There's too powerful adversaries. So

you have to like basically be investing very proactively. We're running a lot of

very proactively. We're running a lot of autonomous security research. Um so S sur and then optimization work are very obvious. Uh you've probably seen on

obvious. Uh you've probably seen on Twitter there's people translating code bases from language A to language B.

Like a lot of the work that if you if you already put in the work to get a working program optimizing it or rewriting it in a native programming language or things like that is now becoming quite um quite doable uh with

with Frontier models.

I mean just for my own vibe coded app I built a uh bug reporting queue for my test flight users and they can report bugs from inside the app. It uploads the logs in a screenshot. And of course,

they use for feature requests, too. So

then I just have a simple Damon go through, compile all the bug reports. It

actually proactively analyzes and fixes them in the background. And then it ships me a test flight version to try out before I ship it to the testers. And

then for feature requests, I just have it right now compile them, but I could see an app in the future could literally be built by the users. Now, I'm not saying that's a good idea. It might be a mess, but at least it can take the bug

reports and stuff.

We should ship that, by the way, just to see what happens to the social experiment.

Yeah. Yeah. The social experiment, right? You you end up with like that

right? You you end up with like that Homer Simpson car where it's got an umbrella and like a flashlight, you know, a clown horn and so on where it's got every feature. But definitely for bug fixing, you could do that. We did in

a way a version of that experiment where uh I stopped all project work across the entire company for a week and said everybody from the receptionist to the engineers uh build whatever you think is

the most important thing to build. Uh

the only requirements you have to use AI and you have to demo it for the whole company when you're done. I expected we would get a large number of silly projects and a small number of needle movers. And what we got was a large

movers. And what we got was a large number of needle movers and a very small number of silly projects. And wow. Yes.

Yeah, that's a great experiment.

Yeah, two or three are like trajectory changing like they would absolutely change the direction of the company. But

the what this surprised me the most was literally the receptionist like the shipping and receiving associate whose job it was to like take packages off a truck and like email people when their

like stuff came into inventory uh build an automation for that and uh and that that we're actually using. The

conclusion I kind of is like, wow, like everybody has some idea of what could exist that would make the world better, but that many times their first order ideas are stupid and they don't have the they don't have the ability to project

that out and kind of see that it's stupid. But if they have the ability to

stupid. But if they have the ability to go from idea to an actual thing, if it's not working, they can react. They can

iterate. And if you give them a week, by the time they're at the end of the week, they've actually built something that makes sense.

But imagine if all work was like that.

like how can you set up a workforce that does not do the work directly? All they

do is train the agent that does the work for them. And we've done this as well

for them. And we've done this as well like you have to remind folks and you have to like create hackathons and hey let's build agents. Uh and obviously there's a lot of people there's a culture change happening like there are

a lot of people that are just coming in who intuitively know their job is to not work on the thing is to actually train the agent that works on the thing. But

I'm curious about like, you know, what what does the autonomous company of the future look like?

It could get a lot crazier. Maybe you

just turn on all cameras and the agents just watching everything that's happening. It see the shipping and

happening. It see the shipping and receiving thing is very inefficient and it creates the app presents the app.

Did you see that? Zach installed this thing into everyone's machines. He's

thinking about it. It's like we saw this too like we're we're um we're we're likely going to ship a feature into AI gateway that allows people to opt in into preserving inputs and outputs and

then you can say for all of my inputs and all my outputs can you extract the skills of the things that I like learn from my work and then dump it as skills uh so that I can even download them for

myself. But you could imagine people in

myself. But you could imagine people in in companies wanting to to share and and pull this together. It's funny because for me that's so unimaginable for my own work because my own work is not repetitive. I look for things to

repetitive. I look for things to automate. There's almost nothing left

automate. There's almost nothing left for me to automate for my own work. And

I and I hope that's where kind of everybody ends up, right? You just work in your maximum zone of creativity and interest at all times. And like if there is anything left to automate, you should automate it. Get it out of your life.

automate it. Get it out of your life.

It'll free you up to be creative and that's where you generate all the value.

But I think that's very hard to see in the job career mindset because you hire people to do the same thing over and over and that's going away and that's really scary because people like, well, what am I going to do? Well, you're

going to do creative things. You're

going to come up with new things and you don't have to come up with a new thing every day. That's impossible, right? But

every day. That's impossible, right? But

you're going to come up with a new thing once in a while that will then create something else, some point of leverage for you. But it it is it is a scary time

for you. But it it is it is a scary time for people for sure. If you've been doing the same thing over and over for 10 years and now all of a sudden it's like, well, now you're going to train an agent and automate it away, that's scary.

I think historically it was the returns were like 70% intelligence, 30% agency and now it's going to be 70% agency, 30% intelligence and that will that will

shift further as the models get better and better.

I'm actually not sure about that, Max.

I'll take the counterpoint on that. I

think it's 99% intelligence and 1% agency because then the agents will exercise the agency, right? You will literally be like, "Hey

right? You will literally be like, "Hey agent, I'm making smart decisions and thinking big thoughts. Just go implement stuff." In fact, sometimes I want to

stuff." In fact, sometimes I want to build features on apps uh that I'm flowing out of vibe coding. I'll ask the agent, "What features should I build next?" You know, go look at the logs, go

next?" You know, go look at the logs, go look at the users, what should I do?

To be clear, I'm talking about the returns to humans. um the the humans that will be best fit for the future will be the ones that are more agentic which is to say like the ones that can come in and just have the thought of like I'm going to open cloud and be like

what should I build versus watch YouTube and here's a fun experiment I'll bet you we all know a lot of people now who are coding who weren't coding before including many cases ourselves right so the number the percentage of coders in

the ecosystem has probably gone up by might be 10x right yeah it might literally be 10 times as many people are coding now than we're coding a year ago it's wild our signup numbers

are through the roof and there's this new class of people who are not engineers.

They just use the infrastructure. But I

think it might be like podcasters and YouTubers and like people posting on X.

The majority of people are still not creating code. Like I go to people and

creating code. Like I go to people and I'm like, "Oh man, vibe coding is so much fun. It's more fun than like I I

much fun. It's more fun than like I I had a little gaming group that I used to play video games and FPS's to blow off steam." I completely stopped playing.

steam." I completely stopped playing.

All that time went into vibe coding instead. It's more entertaining. you get

instead. It's more entertaining. you get

something real out of it and but the feedback loop is just as tight or even better. And I went to my other friends

better. And I went to my other friends and I was like, "Hey, you should be vibe coding instead." And they just gave me

coding instead." And they just gave me this blank look and I'm like, "No, no, you don't understand. Building things is so much easier." But I think to them it was always like some blackbox process in

the background. They never understood

the background. They never understood it. They assume maybe you were just

it. They assume maybe you were just talking to computer all along. So they

don't see what's changed. They don't

realize it's a lot easier. to them just that starting to Max's point the starting is so impossible to imagine and hard they don't do it. So we might have taken you know 0.01%

of the population writing code to maybe now it's 1% call it a 100x increase but 99% still never going to write code. So

we are in this weird space.

It's crazy. It's like it's a video game and it's a great video game but real stuff comes out.

Yeah. My fiance was up all night last night because she couldn't go to sleep because she was hacking on something and of course she wasn't writing any of the code. But it's just like it's addictive

code. But it's just like it's addictive in a way that programming hasn't been for me for like over a decade. It's

amazing because it's like a lottery for people. I think the normies normies have

people. I think the normies normies have gotten a little more into the vibe coding but through uh models that are more media models, video models for example, right? More people probably

example, right? More people probably fooled around making videos and images than they did writing code and apps. The

problem is like I don't video has its own issues, right? Maybe someday we like make me a great movie about X and I'll just spit out a good documentary, but right now they don't have the taste or the judgment. This is a bet that I have

the judgment. This is a bet that I have with uh Andre Carpathy was like what's the year that you'll be able to just dump in a book and get a movie out? I

think closer although I think he has come down substantially in timeline since we made this bet a few years ago.

By 2030 we're going to have like dozens of Lord of the Rings. Like there's going to be some fan who's like he did it wrong. I'm going to make my own take. Um

wrong. I'm going to make my own take. Um

like the famous stories or like the one of my other benchmarks for progress in AI is I'm a huge fan of of a series called The Expanse. Um there's a series there's a TV series and there's there's nine books and they've made the first six books but they haven't made the last

three books and there's meaningful divergences and I just I haven't gotten in I haven't read the books. Like I'm

looking forward to I can dump in the last three books conditioned on the TV series and be like generate the last three seasons.

This is coming.

That's a great feature. Yeah. But that's

in a way it's easy because there's already all this reference material.

When you said get me the next Lord of the Rings, I was really excited because we haven't really had a breakthrough in imagination.

Oh, we're going to see that in culture the likes of Harry Potter and Lord of the Rings. I I'm really excited about that and that will be the I agree that that will be the more exciting one.

What can humans uniquely do? This gets

back this gets to the core issue. What

are humans going to be able to uniquely do, right? And I think Max, you're an

do, right? And I think Max, you're an AGI maximalist. So for you it's nothing.

AGI maximalist. So for you it's nothing.

Agents will do everything.

I'm not like antihuman, but I just like I think it's going to be we will have to find like if your identity is how smart and creative you are, you're going to have a bad time.

Yeah. I I guess I'm still on the other side of that. I think that creativity is still the thing in the environment that surprises you. You step out of the

surprises you. You step out of the system and do something that wasn't even imaginable within the system. It's

outside of the training data. It's out

of the out of the distribution of data that was fed into the system. And I

think there'll always be room for that.

Have you noticed that every cla website looks the same? And and people like basically like dial in what a cla website looks like once you get enough generations out of the model. Like

there's a look it's this serif font.

It's brown and cream and they use monospace fonts with a certain amount of spacing. Like after a while you get this

spacing. Like after a while you get this this distribution that you say well this this is not creative. This is slop that came out of claude. It's not going to be

human versus computer. It's going to be human with computer versus just computer.

Just computer will eventually happen, but we're pretty far away.

But the computer is going to be able to produce these crazy super stimuluses that like it's going to be it's going to make the entertainment. And I mean, we kind of see a weak form of this in in

Tik Tok. Um, and so when you think about

Tik Tok. Um, and so when you think about the going like my my personal definition of art is meaningful out of distribution behavior. And so this is something that

behavior. And so this is something that kind of is surprising in some way. Feels

like you're kind of moving in the Z-axis. Like you're surprised that the

Z-axis. Like you're surprised that the thing was realized, but meaningful. Yeah.

but meaningful. Yeah.

Meaningful means that like it it somehow to me means that it somehow changes your like future trajectory through the universe. Like your life is somehow

universe. Like your life is somehow different for having thought about it and and reflected on it. Well, my

definition of art is completely different and leads to a completely different outcome. Sorry to interrupt.

different outcome. Sorry to interrupt.

No, it's interesting how just by your definition, you get to a different premise. That's the extrapolation of the

premise. That's the extrapolation of the axiom.

Yeah. I mean, one of the things I like about my definition is that it's so broad. Like, there can be like military

broad. Like, there can be like military maneuvers that you can be like, that was art.

And I think we're going to see this all the time. We're going to see move 37s

the time. We're going to see move 37s all over the place. Although, I'm

curious what your definition of art is.

I mean, I have multiple definitions, but so it's not like a concrete I haven't packaged into one thing. But I do think of art as something where you convey emotion. You convey something you felt

emotion. You convey something you felt to another person. And so you create some object or something or that that creates that that takes an emotion that you felt inside. And so to me, a

computer almost by definition is incapable of doing it. The exact same piece of art uh without intent behind it is sort of meaningless. Now you can also argue nature is art like beauty in

nature like you see a sunset, right? Not

let's say human. So that one I would call it's pure intelligence working without motive. There's beauty for

without motive. There's beauty for example in a sunset because there's an intelligence there. There's a complex

intelligence there. There's a complex system at work there and your brain recognize it and there's no motive there. So no ego gets involved. But art

there. So no ego gets involved. But art

in kind of the more human sense I think of as someone felt something and they wanted you to feel that thing or they wanted to feel that thing again or they wanted to capture the feeling they had with that thing and so they created the

thing. Attribution to who created it is

thing. Attribution to who created it is going to be really important. So like

for example a beautiful photo, right? If

a person takes the photo versus AI generates the exact same photo down to the last pixel, the person taking the photo will have more meaning for me.

I just invested in a startup that does verifiability with hardware at the station that someone some human actually took a photo which is going to have a lot of really cool use cases.

It will we will be drowned in slop. No

question.

Do you remember the control net stuff from like a year or two ago? There was

like there's one particular scene of like it was like a medieval village. It

had like a swirl in it. Do you remember?

Yeah.

That was AI generated and that was the one of the first times I looked at this and thought it was really cool. Like whether

you want to call it R.

But that one doesn't that one break your your premise because some human came up with the the training and the prompt to arrive to that really cool riddle. By

the way, it's totally possible that an AI can also do that in the future. But I

give whoever came up with that idea of the optic optical illusion control net, I give them more credit than the I think the bar is going to be raised massively. Like it's going to take more

massively. Like it's going to take more and more to surprise you. It's going to have to be more and more impressive.

Like Studio that's already happened.

Yeah. Like like OpenAI destroyed Studio Ghibli for everybody. Nobody wants to see that Studio Ghibli work ever again, right? It's been done. And so

right? It's been done. And so

Oh, that one also has I have a counter point to that one. Like have you watched real Studio Giblly? It actually looks so much [ __ ] better than the soft that open put out. Like watch it again now.

It's impressive.

Yeah. At the point where you've seen tons of Studio Ghibli things everywhere all over the internet. It is now in distribution. It's no longer surprising.

distribution. It's no longer surprising.

The art value has been around.

That's right. That's right. No, your

surprise definition still works. I just

think that that humans are the ones who can generate surprise completely out of the data distribution. And I think they can do it with intent and I do think intent matters for meaning. So to your meaning point, right, you said meaning

and surprise, right? Right. And I guess what I would say is that humans can steal the ones generate surprise out of the system. For example, let's say you

the system. For example, let's say you took an AI and you trained it to be perfect at mathematics, right? The

perfect mathematics AI and it's within the formal system of mathematics. And

then Kurt Goodle comes along and he has something completely outside of the system, right? Girdle's in completeness

system, right? Girdle's in completeness theorem. It was completely stepped out

theorem. It was completely stepped out of the system and used uh attributes of physics to basically break the system.

So that kind of thing I don't think an AI could get to. So there's always room for creativity outside. surprise and

then the meaning comes from the fact that a human was involved that they did it for a purpose and they conveyed something. So maybe I can interpret your

something. So maybe I can interpret your definition my way, but we'll see how it plays out. I'm a little more optimistic

plays out. I'm a little more optimistic about humans.

So if you train an AI model, it's trained on some data distribution. It's

trained on some tokens. It then learns some some distribution of language and the structure within that. Um, is it possible for an LLM or transformer to kind of go out of distribution, have

like a new idea that was not present in the training set somehow? Well, the

training sets are so large that it is hard to imagine ideas that are not within the training sets somewhere. But,

uh, if they exist, they probably lie in the natural domain in physics and interaction and feeling and emotions and evolution in things that the it's not subject to. So, I do think that there's

subject to. So, I do think that there's still things outside of language, but language does encapsulate a lot.

Language is a great compressor and we've got a lot of it.

But I mean you can get to these other things through selfplay that selfplay and sensors like cameras are sensors like our eyes are sensors.

Yeah. I mean I think the question is how do you what how do you go out of distribution without randomness? So in

the case of like RL you can get randomness like you can sample an action from a distribution of an action space and you can get randomness that can take you down these walks into new territory.

But I think the the real to kind of turn this around is like can humans go out of distribution? Where does any new idea

distribution? Where does any new idea come from? Are we also dependent on

come from? Are we also dependent on randomness to get us into these new territories?

We're not dependent on pure randomness like like natural selection works through pure randomness, right? Where

you just mutate a gene and then see what happens. But with humans, we seem to

happens. But with humans, we seem to have this ability to cut through infinite space and get, you know, just eliminate huge swats. And so our creativity makes sense within the larger

scheme of things. That seems to be one of our unique capabilities. And um maybe AI is starting to do at the edges as we're seeing with solving some of these math problems. But even math is a very bounded domain, but it's a big one. I'm

not I'm not saying it'll never get there. I don't have that confidence. But

there. I don't have that confidence. But

I think at least at the moment, I would say that uh truly stepping outside surprising people, that's still a domain of humans. And I think humans plus AI is

of humans. And I think humans plus AI is where it's all moving to. Like human

without AI, forget it. Pure AI, I don't think is there yet. But I think human plus AI, we're in that era. How long we stay there, I'm betting is longer than people think. I think humans will have

people think. I think humans will have an enormous amount of value. Um, in

fact, more value. All of us, everyone here, our productivity has gone through the roof. And basic economics normally

the roof. And basic economics normally says that when someone's productivity is higher, they're wealthier. They're

better off. You actually hire more of them, not less of them. Maybe some of you are not hiring junior people anymore, although I don't know if that's necessarily true. I don't think of it as

necessarily true. I don't think of it as junior versus senior. If someone is really good with AI and they're really smart and creative, I want to hire them more than ever because the leverage I'm going to get out of them is incredible.

That's a new requirement. And we're

hiring juniors and super seniors as long as they're really good with agents and really good with AI and and quick to adapt.

And a lot of them don't need to be hired anymore. They can create their own

anymore. They can create their own thing.

My my hypothesis is we end up with a larger number of smaller teams. Like the number of people required to accomplish the given task drops by a lot. And you

know people who only see first order of facts say oh my gosh all the jobs are disappear because I can do I can do a jet engine with two people. I don't need a thousand you know 998 jobs are gone.

But what it actually means is you can create a lot of different chat engines.

I think that's exactly right.

I think there will be goes back to Naval's point. I I think the thing

Naval's point. I I think the thing that's uniquely human is the creativity and what's been missing for you know a lot of people can be creative but they don't know how to turn their vision into

a real thing that's changing. So I think we're have an explosion of an of entrepreneurship explosion of founders and a very large number of very small teams because you don't need many people

to accomplish something.

Yeah. I think like look AI provided uh base level intelligence and uh domain knowledge uh and cut through all the jargon and then now agents actually

provide a lot of agency. So the main things left are creativity, taste, and yes, you need enough agency to get started, agency to stick with it, but you don't necessarily need the agency to like spend 20 years learning one thing

before you can dive into it, make a contribution. And so that barrier going

contribution. And so that barrier going down, generalists are having a field day. And at the end of the day, we're

day. And at the end of the day, we're all generalists. All of us like to think

all generalists. All of us like to think about everything. We don't like to be

about everything. We don't like to be just trapped in one thing. Like Max is here talking about consciousness and the FDA and brain science and creativity.

Like all of us are trying to think about everything all the time. And so, uh, people in Twitter who are always fond of saying like experts, credentials, sources, right? Those are the guys

sources, right? Those are the guys getting hurt because the expertise doesn't matter. You spent 5 years, 10

doesn't matter. You spent 5 years, 10 years getting a PhD in XYZ, you know, hopefully develop your creativity and your instincts and your taste and your judgment because if all it did was help you memorize a whole

bunch of things and jargon and, you know, learn some scaffolding stuff, well, AI will cut right through that.

It's like a, you know, calculator times a billion or, you know, bicycle for the mind but accelerated. So I I think it's about people with AI versus people without AI. And so the single best thing

without AI. And so the single best thing you can be doing right now for yourself is just getting really good with these tools, getting comfortable with them and always knowing the edges of the boundaries of what they're capable and what they're not capable of. And that is

a moving target.

Loading...

Loading video analysis...