Full Episode: The AI Industrial Revolution
By Naval
Summary
Topics Covered
- The 1000x Engineer Builds the Factory
- Two Engineers Can Now Design a Jet Engine
- AI Kills the 200-Page Compliance Document
- Healthcare Is a Small Communist Society
Full Transcript
Welcome. You're listening to the Naval podcast, your authoritative source for new knowledge. We're trying something
new knowledge. We're trying something new today. Uh I have three frontier
new today. Uh I have three frontier founders with us. Three good-looking
guys actually, and a fourth good-looking guy, Naval. And let me just introduce
guy, Naval. And let me just introduce everybody. Gumo the G Roush. Um he's
everybody. Gumo the G Roush. Um he's
building Versel into an AI cloud for the world of agents and whatever comes after that.
Good to be here. Blake Shawl, he's building supersonic aircraft in his own factory and jet engines as well. Blake's
company, Boom Supersonic. And then Max Hodak from science. He's building a biohybrid brain interface that grows living neurons on silicon to restore
sensory functions like sight, but then eventually to explore new parts of the brain and new senses. All three of these guys are not composing their products with off-the-shelf parts. They're
building their own factories and you know we don't care as much about what they're building exactly as we do about what they're learning about how they're building. What's the new knowledge
building. What's the new knowledge they're generating? What's their alpha?
they're generating? What's their alpha?
What principles are they discovering that other founders can learn from? What
are they trying to figure out right now?
And also what are the cutting edge or crazy ideas that they haven't even talked about yet and they're still forming in their brains. Naval, do you have any reactions to any of that before I jump into Gummo? Yeah, let's just have
fun.
Yeah, you guys should just jump in.
Yeah. So, I can't remember my exact quote, by the way, but I've been really pilled uh with this idea of software factories and the job of the engineer being something that you just show up to
work. You used to used to ship the
work. You used to used to ship the output directly and everything inside the company was, you know, how good is person A at shipping output B. And now
what's happening is the way that I'm judging you as an engineer is like are you producing the factory that would produce multiplicative outputs B through Z, right? Um and that's a that's a
Z, right? Um and that's a that's a pretty significant change because basically like we used to believe and it should be somewhat controversial that there's 10x engineers like now clearly there's 100x or a thousandx engineers
and the world hasn't fully adjusted to this. I used to get flamed on Twitter
this. I used to get flamed on Twitter for saying they're 10x engineers that it flies in the face of so much like equality philosophy that everyone's equal. But the reality is when you're
equal. But the reality is when you're operating in idea domains, when you're operating intellectual domains and virtual digital domains, it's not even 10x, it's 100x or thousandx and it always has been. Satoshi Notch, you
know, the guy who invented JavaScript, the Brendan I of the world, uh John Carmarmac, I mean these are thousandx programmers. Not to even mention if you
programmers. Not to even mention if you choose the right thing to work on versus the wrong thing to work on, that's an infinity difference and it could just be not nemer, just one who had a better judgment on what to work on in the first
place. And now obviously it's less
place. And now obviously it's less controversial because of uh AI leverage.
What's controversial is that the token leaderboards, right? Like people are
leaderboards, right? Like people are still getting a little confused because now they think, well, I have a bunch of 100x engineers. Look at all these tokens
100x engineers. Look at all these tokens that I'm paying for. I'm curious if you guys have seen the same like how do you measure ROI?
It's like the old measuring lines of code, you know, token consumption with lines of code feel like similarly not direct paradigms. I mean, my observation has been that
claude or cha GPT um or GPT is about is basically as good as you are in a domain. And so, uh if you're if you're a
domain. And so, uh if you're if you're a really capable developer, then these things are really powerful. And if
you're a junior developer, then you'll kind of find it to be like more of a junior developer. Like on the one hand,
junior developer. Like on the one hand, these models are incredibly capable. On
the other hand, the feedback that you give them sporadically seems to be incredibly important. And these little
incredibly important. And these little updates seem to totally determine the types of uh performance you get out of them.
There's a new kind of support that I give which is you come to me and like you didn't get good output out of the model and I tell you what to prompt the model with. So like the idea of like the
model with. So like the idea of like the quality of the reprompting which I think you're alluding to is is extremely important.
But I mean and to be clear I think that this will become less important over time like as the models get much much smarter then you'll be able to put in less and get more out. Um but at least at this stage it really seems to kind of
reflect back the judgment that the user brings in. In my experience,
brings in. In my experience, I've kind of resisted learning all the ticks and tricks and tips like, you know, there was a, oh, use Ralph Wigum, use Open Claw, use Hermes, use this
prompt engine, use this scaffolding, plug in this piece, you know, uh, always use plan mode. I just ignored all of that. I just assumed the model's just
that. I just assumed the model's just going to get better faster than I would figure out how to use it. It would
figure out how to use me faster than I would figure out how to use it. And so
I've just been completely hamfisted with them and I get frustrated at them and just sort of I I found myself typing less and less information and doing less and less work as time goes on with the
models because I just assume I can brute force my way through it and I'll throw Codex Claw and Gemini at the same problem over and over and just waste tokens to save time. And I think no matter how expensive these models might
seem, they're still way cheaper than a human. So I would say just waste tokens,
human. So I would say just waste tokens, save time. Don't look at the tokens
save time. Don't look at the tokens either as inputs or outputs. Just look
at your time and look at the final output. And even if they're writing
output. And even if they're writing lowquality code, which I know in many cases they are, it's not necessarily production quality or scalable code.
When the time comes and I want to ship it to production, I'll just throw more tokens at it. I'll say, "Okay, now go through look at it, rewrite it, and they're just going to get better every generation." Uh, so yeah, I don't I
generation." Uh, so yeah, I don't I don't see where this necessarily stops.
As long as we have verifiable domains and solve problems, they're going to resolve those problems. That's in the unsolved problems domain where maybe you're terren tower you're at the cutting edge of creativity that you need
to be you know working very collaboratively and carefully and closely with the model but I'm not in that I'm not at that level in software engineering and gear but you're probably the most extreme software engineer in the team right like out of this set
you're probably the one who most hardcore came up from a software background like how are you finding these models at the edge of their capability? Well, there's one thing
capability? Well, there's one thing that's happened recently that uh what you're saying resonates strongly with, which is it used to be that you would
give a prompt to the model and it kind of does it like classic like next token prediction thing and it like runs away with your idea. And models now have been doing this like intuitive planning mode
without to your point not even having to plan where it comes back to you and says look what you're asking me for there's these three routes we can take. there's
this set of tradeoffs that we're going to go down. That's a moment where like you know people do the whole thing on X like oh now we have a PhD level engineer model like that's very clear that the models at some point graduated. They
used to be junior engineers now they're principal engineers because they come back to you with a set of tradeoffs and obviously sometimes they [ __ ] which is hilarious. It tells you this one is
is hilarious. It tells you this one is going to take three weeks and this many interest it make really bad predictions but clearly it's now this like I respect
the models a lot more as a as a peer like that I'm going back and forth intellectually with but there there are a lot of gaps still so like if you're really really proficient engineer or
architect you I think you're still extracting more juice so the question sort of that Max was positing of like if you're junior do you get junior back.
Well, clearly not because a junior gets more advanced knowledge in code that they would have never been able to write by themselves, but doesn't an experienced architect get 10x whereas a
junior engineer gets 2x. That's what I'm kind of trying to figure out still.
Yeah. Yeah. But I mean, I think there's there's architectural decisions. So,
when you think about the development, I'm seeing this now with some of our the junior software engineers on the team of like what is the next step in their career progression. It's going from like
career progression. It's going from like writing implementation for a feature to picking technologies like choosing between Postgress versus some other database or picking between ZMQ versus
some other message Q or like some other queuing system and those I mean the models can suggest them but that's the thing where you'll see it and you'll be like no no I want to use this other thing. That's the type of little
thing. That's the type of little feedback that I'm saying really matters in the types of output that you seem to get at this point.
Taste taste and judgment right? Taste
and judgment. That said you can ask them which one should I use and why and they know everything. they'll give you really
know everything. they'll give you really good trade-offs. That's the change I was
good trade-offs. That's the change I was saying has happened recently where you would say hey go and um put this super high cardality telemetry data into Postgress and it's like no bro like we
don't put that kind of data into Postgress like you should consider click house or Athena or whatever like that's happened to me a lot which is really
impressive um but I the thing I'm still like kind of struggling with is clearly the human is still completing the model
like at one point is it the other way about like the the human is the one sort of getting the instructions back on like go get me this API key because it's something that only you can do uh or get
me this amount of capital for my next set of investments that I need to make.
Uh you just watch like that clearly we're still not there yet. That's a
temporary aberration. Pretty soon every good SAS company or hosting provider will have a CLI and API interface that the models can meet directly. they don't
even necessarily need an API like as long as it's like textbased Unix based the agent can hack its own API and then the money part you insert crypto tokens you know put in bitcoin
put in whatever and the model goes and just pays for whatever it needs and I think like you know there people working on this but the thing I am now thinking through
is is pure software dead like is pure software engineering like an obsolete thing it's like saying speaking English right the models now speak English we had to learn code to communicate with
the models. Now the models speak English
the models. Now the models speak English and they speak fuzzy sloppy English like a human and they understand things. So
where's the moat like for a founder?
Hardware it's a boon you know like now if you had to build hardware it was hard to build a software company alongside like Patrick Cullison says software is art and it's hard to hire artists. So
now as a hardware founder great you can have really good software develop fairly quickly. Um, if you're creating models,
quickly. Um, if you're creating models, maybe that's the new software engineering, training models and tweaking models and post- trainining and fine-tuning models. But classic software
fine-tuning models. But classic software engineering is that dead is pure software investable is pure software something organize a company, a team around and try to get some leverage. Did
you guys see the uh there was an article an X by Mitchell Hashimoto called the block economy or the building block economy something like that like his argument is that the most useful thing
for agents to have now is really powerful reusable building blocks because to Max's example you wouldn't expect your clanker to reinvent a Q infrastructure system every time he
needs to send an email it needs to bring in the right building block that's right size for the task that you're asking for and say, "Well, okay, for this one, it's
B MQ." I challenge the notion that I
B MQ." I challenge the notion that I would want the agent to reinvent the entire universe from first principles in a way that's incompatible with the rest of society and civilization. Like it's
almost like reinventing highways, laws, policies, etc. just for you. Even if
there's a potential for extra optimization, extra juice that you can get out of it, there's a still a um sort of like cooperation at large scale value of saying we're both depending on
Postgress 13.2 and so that's still really really really valuable. I would
say like the category of infrastructure software and building blocks that these agents are going to use obviously embias is this what we're building seems extremely valuable and I don't see the agent anytime soon and by the way you
could even another metaphor I've been using is like agent's already been created that the models can reuse is like a token cache because you you don't want to churn through a trillion tokens to reproduce what's already existing. Uh
and so there's always starting points that the model can fork off from, but it's going to change things quite profoundly.
So these are like libraries and dependencies, but for models.
Yes. For agents specifically.
To Naval's question though, I mean I learned a program when I was really little. And I like that was the thing
little. And I like that was the thing that through all of like being a teenager and in my 20s like I get like sucked into it and just like code for like 20 hours and it was super fun and I knew all this stuff about programming
languages. I haven't written a single
languages. I haven't written a single line of code in quite a while now. And I
mean, partly that's because my job is different, but also since December, I've built a huge amount of software that I now use every day. There's all these projects that I've kind of fantasized about for years that now I'm like using
um that I've actually built and I didn't write any of that. And I just can't imagine going back to like actually writing code by hand anytime. Like I
mean, I'm unlikely to do that anyway, but just like in general, I see that I have a hard time seeing that as part of the future.
Yeah. There's something really cool is that you understand how the pieces click together. Like I feel like anyone that
together. Like I feel like anyone that understands what an API is and how data flows, inputs and outputs, performance because you kind of you have to orient
the model around like this is a certain level of expectation that I have out of this operation like that's that's always been infinitely more useful than um than writing code. Like I feel like a really
writing code. Like I feel like a really good a proficient engineering leader has been quote unquote like vibe coding through people on Slack or one-on- ones because
you're transmitting your will, your intent, your experience and you're letting others run with it. Uh it's just that now we do the same but with agents.
Uh and so I think that's why you've been successful with it. But I don't know that everyone sees the same level of success. I
success. I mean I went from not having written code in 20 years to I'm coding all the time now. but through agents and I'm building
now. but through agents and I'm building tons of software and it turns out that just understanding the basic principles of software engineering and algorithms actually gets you a long ways because the reason I stopped coding was because
I didn't have time to figure out the latest language latest architecture infrastructure pieces to plug into and I know Verscell makes it a lot easier but even then just getting started was a bare like just plugging pieces together
assembling infrastructure was just so annoying. The thing that really changed
annoying. The thing that really changed is I mean it used to be that you could build a lot like you like there's a lot that was straightforward but then you would hit some random thing and then you could spend
kind of some indefinite period of time debugging some narrow thing and now with the agents what happens is you just don't get stuck anymore which is pretty amazing or they get stuck it's removed well no I mean like relatively quickly they can find like
the right way to do things and it used to be that like I remember when their friends learned a program be like nope it's just like intrinsically frustrating like if like that's part of the feel that's how you learn and that just isn't true anymore.
Blake, how are you applying all the stuff at uh Boom Supersonic?
Yeah. What I found is it completely changes the role of software and hardware developers. The thing that we
hardware developers. The thing that we did from day one was uh try to take a lot of traditional engineering workflows and I mean hardware engineering workflows and turn them into software.
And so if you haven't been around hardware engineering, let me see if I can make this more clear. there uh
there's a lot of engineering hardware engineering that happens in Excel spreadsheets on engineers laptops in a silo and it's very complex uh
spreadsheets sometimes like VBScript code and all of this is actually software but it's it's treated as if it's not software there's no there's no source control there's no automated
testing if you want to hand something off from like an aerodynamicist to a structures engineer that's done manually with like a spreadsheet over email like it's the 1990s. It's terrible. And so we
we started building these kind of like software frameworks that can automate and make repeatable hardware engineering flows. The idea we could reduce the cost
flows. The idea we could reduce the cost of iteration. Um but it was it was
of iteration. Um but it was it was slowgoing because we could never get enough we could never like afford enough software engineers. And what we've
software engineers. And what we've gotten into is this uh mind-blowingly different model where the software engineers actually create the architectures because they understand
systems, they understand algorithms, they they understand, you know, division of concerns. Uh and then the hardware
of concerns. Uh and then the hardware engineers can vibe code their pieces because what they know about hardware engineering and the result is just like mindblowingly different productivity for
small teams. like give an example like like if you're designing a turbine blade like classically so a turbine blade starts like uh cold but when it runs it's hot so it gets bigger and so you
have to design both the aerodynamics and the structural design of the thing to work on it cold shape and this hot shape and so you have to convert between cold and hot and you convert between structures and aerodynamics and this
takes like one engineer one day for one blade for one piece of the analysis and there are like a thousand blades in a jet and and so you can't do much and we literally now with a combination of
software and hardware people created the solution you can change blade geometry you can see in real time the structures and aerodynamics results and so it allows two engineers to design an entire
jet engine which is just wildly different. One of the things you
different. One of the things you mentioned is that you have software engineers creating the tools and architectures for the rest of the engineers. That to me is the biggest u
engineers. That to me is the biggest u the cataclysm of enterprise software is that there's no like startup that builds hardware collaboration tools that can sell you anything anymore because in
internally you're just coding the right things that you need at any given time.
Even spreadsheets are kind of cooked, right? Because the reason spreadsheets
right? Because the reason spreadsheets were successful is that no one could build custom software. So the thing that approximates custom software the most is a spreadsheet with a bunch of EV script
functions. I personally have moved
functions. I personally have moved almost entirely from uh Excel to Python models where I can actually like get like believable simulations of things.
Yeah. I mean the thing that that AI hasn't come to yet that I think it it will within the next year like probably within 26 that will be very very exciting is right now it can generate software but soon it'll be it will
generate step files and PCB layouts. And
when it comes for mechanical and electrical engineering, that will be a whole other thing that we haven't seen yet. That'll be very, very cool.
yet. That'll be very, very cool.
Yeah. On the hardware side, I think it's really a boon for like all these little gadget companies and part companies that write really bad software because they can't make great software and now they're going to be able to make good
enough software or it may not even software that is a human front end. It
might just be completely agentic for an agent to access and you just talk to it through voice and control hardware. And
I this is why one of the reasons why I think for example China is big into open- source models right they're basically going all in on it because they have hardware superiority they have these very complex supply chains and
component chains and they're basically saying hey if I can just generate software on demand then I don't have this disadvantage anymore against Silicon Valley. So that's not the only
Silicon Valley. So that's not the only reason why they're doing open source. I
think they're also behind. They're
distilling models. They're catching, you know, they're collaborating on resources. But I think the Chinese
resources. But I think the Chinese government has a history of funding efforts that then sort of help their entire ecosystem along, especially in network effect businesses.
And so I think they want to like uh pull all their resources, catch up on AI, and use it to give their hardware stuff an advantage. And ironically, they're doing
advantage. And ironically, they're doing all the open source stuff because Open AI is not open. You know, Gro publishes models, but I think they're a model or two behind. Uh, Google has some local
two behind. Uh, Google has some local models, but nothing really that competitive. Anthropic, to my knowledge,
competitive. Anthropic, to my knowledge, I don't even know of any open source models from them. So, all the open source heft is coming from China. It
helps all our hardware founders, but it helps their hardware founders and factories and so on that much more. But
all all the crappy little software that goes with all the little random knickknacks and thingamajigs that you buy off of Amazon for to tinker with a lazy Saturday afternoon, that software
is getting a lot better very quickly. I
think everyone's had the wakeup call that without great frontier coding models, you don't have self-improvement.
And so imagine China as a whole not having the ability to produce frontier everything, right? It's not just
everything, right? It's not just producing software is in any piece of this hardware pipeline like Blake was saying like you need to generate software. If you fall behind on your
software. If you fall behind on your ability to generate software, you fall behind on the ability to generate everything. One thing I'm curious about
everything. One thing I'm curious about from you guys is like cuz everyone loves to talk about Chinese models like do you use Chinese models? Do you know anybody that uses Chinese models? This is an
argument I had yesterday actually which is uh one person at the table dinner was claiming that uh you know you'll just use deepseek for 97% of things because it's so cheap and if you need more
intelligence you'll just run it over and over again the same problem and you'll only use the open AI anthropic etc models for the most advanced tasks and I was kind of like I don't know I think intelligence is an unalloed good you
always want more intelligence and when these models make a mistake you don't know it and it's always cheaper than a real person and real time. So, you'll just use the most intelligent model available, which isn't great news
necessarily because it means that, you know, you're going to end up creating a monopoly or igopoly kind of situation in AI. But, uh, I always want the most
AI. But, uh, I always want the most intelligent programmer. I always want
intelligent programmer. I always want the most correct answer. I always want the best judgment. And given the amount of leverage that I'm going to pour into it through capital and code and people and, you know, marketing, I want to make
the right decision every time. And often
when between two models, let's say like I have one model that I know is a little smarter than the next one and they both give me answers. Often I actually don't know which is the correct answer, right?
So if I know one model's a little smarter, I'm going to go with that answer and eventually I'm going to stop asking the model that I think is less intelligent. But I don't know, have you
intelligent. But I don't know, have you guys found a use for the these, you know, so-called less intelligent models?
We see uses so that so we have the AI gateways uh data that basically like every application agent, etc. goes through. And so there's definitely usage
through. And so there's definitely usage of open models, but the top is like heavily dominated by the frontier intelligence. And there's a subcategory
intelligence. And there's a subcategory or there's like a caveat to that, which is that frontier intelligence at reasonable cost and performance like slaps at scale. So like people don't get
really excited about Gemini, but they put out these models that are like super smart at the right performance cost combination and for a lot of tasks other
than co coding actually interestingly enough. Uh they're the best models.
enough. Uh they're the best models.
They're like the best like industrial production models. Uh you can throw them
production models. Uh you can throw them at like support tasks or browser automation. Like I would always put a
automation. Like I would always put a Gemini model there. Uh and I would look to Chinese models for those kinds of things. But anytime I'm working to push
things. But anytime I'm working to push the frontier, you need the best possible coding model. And that's basically now
coding model. And that's basically now like two or three models and uh and the Chinese are not certainly not in it.
Hey Max, you're pushing pretty hard into vertical integration and extreme urgency. Do you want to talk about that?
urgency. Do you want to talk about that?
Yeah, I mean for many things we um you can't buy it so you got to make it somehow. Our preference would always
it somehow. Our preference would always be to buy something um like if there's a vendor that offers a service at a great price like for example like PCBs like we don't make PCBs like those are they're basically free you can buy them in
unlimited quantity from Asia but the the closer that our products get to being like a single block of coalently bonded matter the better they'll be lower power smaller higher performance last longer
and um there's just like there like the components aren't available and in order to do that type of integration be able to actually innovate beyond things just piecing together things that you can buy off the shelf which really is is very
very limiting. I guess you have to like
very limiting. I guess you have to like learn it to do it yourself and that shows up as vertical integration. So we
own a captive MEMS foundry on the east coast which we bought because there was really no other way to do the type of packaging and assembly stuff that we wanted to do and I think that all of this is going to be affected heavily by
AI over the next few years. It's not
quite there yet. In fact, ironically, one of the biggest impacts that we've seen of AI inside the companies in regulatory interactions because if we can do things like generate documentation or if we can ask like we
want to change, we want to evolve this product like there's thousands of ISO standards that might apply which ones do we have to comply with and like trace this through. This used to be like
this through. This used to be like you're like you're following a whole regulatory quality team for several months as they trace this and now the AI just kind of knows. Um, but when I think
about stuff like the the surgical program or the MEMS fab, I think ultimately the software still needs hands. Like it's going to be smarter
hands. Like it's going to be smarter than us, but if it can't make things, then like those are real real boundaries. And so we've instrumented
boundaries. And so we've instrumented our foundry as well as many other parts of the company in in ways where um as these models get better uh that should show up pretty immediately in in things
like the the cell engineering that we're doing and the material science that we're that we're developing.
It sort of makes me realize that like it's been a while since I've generated a basic legal document using a lawyer, right? I stopped asking lawyers for NDAs
right? I stopped asking lawyers for NDAs and you know agreement for this and sign that and research this and like all the basic legal tasks are gone too because you know there's the old joke that law
is like spaghetti code you know they have this very complicated code that they try to put in English and it contradicts this code over here and has to fit into that code over here and there no real APIs for it. Um but for
just like junior engineers and junior engineering I should say junior engineers basically got a promotion to senior engineers and junior engineering got taken over by agents and so the same way I think in a way the downside is you
can look at law and say you know parallegals just got fired or you could say parallegals just got promoted to senior lawyers and now they can spend their time thinking about the law. It's
actually kind of interesting to think about the parallels of how software engineering is evolving with lawyers because lawyers, you never know what they put into these documents exactly.
You just trust them. Like, hey lawyer, can you look at this document? Can you
tell me if it's legit? Can you do red lines? Whatever. Like, at the end of the
lines? Whatever. Like, at the end of the day, you're what you're valuing in the relationship with a lawyer is that is that they're a trusted authority. They
went to law school and they're putting their reputation on the line. Cool.
I think there's a parallel par parallel with like the biggest problem in software engineering today is these mountains of slob that end up as a PR and then people are say like there's all these memes on Twitter like way back in
the day we used to read every line of code of a PR. Well, in my world infrastructure I want engineers to be able to say I understand doesn't necessarily mean that you've read every
line of the of the PR. You need to be able to say I am signing off on understanding the consequences of this PR or I wrote the test harness
the simulations the proofs the type checkers etc to be able to say even without reading this I have confidence I can sign off on it's going to be safe in production and so it's it's kind of
interesting because there's a world in which we embrace that everything is going to be spaghetti code and that we don't fully understand it but we write the basically evaluator that give us
confidence and then we rely on like people uh like the infrastructure production engineers to say okay I'm fine uh sending this into prod you know at the end of the day like someone is
going to get paged if your systems go down and I think another thing that people are underestimating is that creating software is really easy 0ero to one but think about a thousand days from
now what does what does your software look like is it secure is it tested is it production grade uh is it performant and are you still motivated to invest
all of those tokens in maintaining it in prod I mean humans are becoming verifiers right and and that's kind of how we train these models with good verification data and now we need human verifiers so yeah I think a lot of the a
lot of the old function of people lawyers engineers operations people move to verifying the stack and saying yeah this is roughly correct and I I'll roughly stand behind it and I'll support you if it goes wrong
one of the things we see related to the regulatory is it massively reduces change aversion and improves iteration.
So to give you an example like let's let's say you're going to go certify an airplane. One of the zillions of things
airplane. One of the zillions of things you have to do is prove that it could withstand a lightning strike and the the regulatory documentation for the test
plan for such a thing stretches on for say 200 pages. And what you would classically do is hire a, let's be honest, not super bright engineer who's willing to be there monkey at keyboard
writing 200 pages of regulatory compliance documentation. And it takes a
compliance documentation. And it takes a couple months. And and by the way, if
couple months. And and by the way, if you change the airplane now, you want to cry because there's another like two months of rework of this like wrote kind of regulatory compliance documentation.
And what we found is you we can build a rag that will enable us to basically prompt our way through all of that work you know in let's call it minutes. The
first order effect is oh you save a lot of time. The the the second order effect
of time. The the the second order effect is if you change the specification of the airplane uh it now takes you know uh minutes not months. So you can actually be willing to change. And the third
order effect is you can now you know basically get rid of the not very great engineers and have a small number of really creative ones. They can iterate rapidly because the cost of change goes down and in a certain sense like the
entire regulatory burden which really hurts the ability to iterate drops away.
I think that this is a really undersold story in AI right now. I think the consensus in Silicon Valley is that like regulation sucks like any like we want to go faster, we want to rec realize this amazing future. We want abundance.
We want just like prosperity and stuff that slows down that future is just kind of to be avoided. And certainly I think we've overregulated. We've made it
we've overregulated. We've made it impossible to build stuff. It's just
like it's totally crazy what goes into getting building any type of thing in a lot of places either physical or otherwise. But you know like a lot of
otherwise. But you know like a lot of the regulations themselves are not the problem. Like if you've actually read a
problem. Like if you've actually read a lot of these things like like having non smog choked cities is great. Being able
to swim in like many rivers is great.
Like having like a lot of these things were progress. The problem is that it is
were progress. The problem is that it is really difficult for humans to deal with understanding and complying with this and that every time you have to exchange a letter with the government, you wait months. And if you could take a lot of
months. And if you could take a lot of the things that we've learned and kind of make them like totally frictionless, that would actually pretty cool. And I
think that um that I think is an under an underold story in in AI right now.
Yeah. until the regulators start spewing tokens back at us and then you start getting huge amounts of documents from the regulators that you have to comply and it's agent on agent wars. But
but that's basically what we have now.
Yeah. But but there is a fair fight.
Yeah.
I'd argue that's an improvement from where we are now. Like one of the terrible things right now is if you build anything physical, you have to get a building permit. It's like you're guilty until proven innocent. And the
worst thing I' that we've run into is the fire department because they have like the moral impremature of, you know, people pulling people out of burning buildings. And yet what they actually do
buildings. And yet what they actually do is just like screw with your design for buildings for months. And I, you know, if we could replace the fire marshall with with an agent that would critique
your your building plan quickly, um, even even if it's feedback was overdone, it would be massively better than the delays that exist today. When
Max was talking about this potentially being a good thing to have all this regulation, my my head went to the things that make agents successful is uh humans or other agents setting up the
right testing guard rails. Uh a lot of people are really excited about SLGOLO.
I don't know if you guys have played with that or like Ralph loops where you tell the model go do this and this is your exit criteria. Well, I'm telling Blake go make us all supersonic. your
exit criteria is that you've complied with all of these regulations. So
there's totally a world in where we say like the regulations are great. They're
like our testing, our test suite. As
long as this p passing these tests for one that's not incurring contradictions and the regulations are actually reasonable, etc. Like they're actually an awesome guard rail to have.
Otherwise, we would be shipping slop directly into into the air.
Yeah. But this is going to turn into a red queen race, right? They're going to have agents. We're going to have agents.
have agents. We're going to have agents.
I think we might have better agents, which is good, as opposed to have to do human versus human, but if anything, their cycle time, their response time may get lower. Like the app store is drowning in spam. I'm sure the patent
office right now is drowning in spam.
And so these agencies, they're going to be slow adopters of AI. They're going to get DDoSed, right, by clever entrepreneurs just overloading them with documents. It's possible that the
documents. It's possible that the approval time for this stuff might extend out as this suddenly get flooded.
It creates the opportunity to um I think really shift the model, the regulatory model. Imagine if we drove around a city
model. Imagine if we drove around a city the way we build things today. Before
you could go anywhere, you'd have to write a plan up, ship it to some regulator, you know, and your plan would have to specify we're going to take such and such a route and we're going to drive this speed limit. We're going to use our blinker and we're going to stop at every stop sign and we're never going
to run a red light, blah blah blah blah blah. And then 3 months later, you get
blah. And then 3 months later, you get back critique. It's like, well, we think
back critique. It's like, well, we think you should like drive on this other street. And eventually you get approval.
street. And eventually you get approval.
didn't go drive somewhere. It's insane.
You can never go anywhere. And yet that is absolutely the way we build physical infrastructure in this country. It's
guilty until proven innocent. And and
what we should actually do is make more of these things enforcementbased rather than preapproval based.
I mean, I don't know. I mean, I I don't want to be under too much. Like, if I ship a medical device to a lot of people, there needs to be it's like there's unknowns there. It's like we were responsible. We did clinical
were responsible. We did clinical trials. We reported all the data, but
trials. We reported all the data, but Max, this is this is why there's so little innovation in medical right now because the FDA approval process is a nightmare. Uh, in fact, the two biggest
nightmare. Uh, in fact, the two biggest advancements in tech in Silicon Valley in the last decade, AI and before that, crypto, they're both in the math domain because it's the last unregulated domain. And when they started regulating
domain. And when they started regulating frontier models and started regulating GPUs, that stops as well. You know,
Peter Teal laments about how there's no innovation in the physical domain.
what's been held back by just the huge regulatory barriers and you can always find a scare version like you vaccine or medical like famous ones, right? But the
regulations spread everywhere. The
tentacles are everywhere and there's all these different contradictory regulatory bodies. You saw how uh was it SpaceX,
bodies. You saw how uh was it SpaceX, they got sued first for for not having enough I forget what it was migrants or refugees or whatever, but they're not allowed to hire them by government regulation on the other side because
they're not citizens. This is not like logical code that has to compile in one place. These are madeup random
place. These are madeup random regulations all over the place. You
might comply with one state, you violate another state, you violate federal over here, you annoy this guy over here, that guy chooses to prosecute one out of 50 people who are his friend. It's it's
very arbitrary. It's very capricious.
And and moreover, like the idea that this makes like things safer, I think it's just a complete mythology. Like
just watch, you know, watch watch Boeing as an example. They certified the 737 Max, which had a single sensor that had complete authority over the nose up,
nose down attitude of that airplane. No
intern is dumb enough to think that's a good idea. And yet, it got all the way
good idea. And yet, it got all the way through the certification system. This
stuff doesn't actually make us safer. It
just makes us slower. Well, I mean, there's definitely dysfunction here. I
mean, I think that some of this makes us safer in the sense that the NRC makes us safer, which is that their job was to make sure that nuclear energy was safe.
They did this by permitting zero plants until I think like a year ago since the 70s. It will be perfectly safe if we
70s. It will be perfectly safe if we never build any of it. And I want to be really clear that I'm on the side of deregulation in on a lot of this. I
agree with Blake that a lot of this can be done a lot more efficiently. But I
also think it's a little too dismissive just to say it's like oh this is like the FDA or like even it's it's in the reg it's in the agencies in general. And
the problem is deeper to the degree that when the if the FDA approves 10 really important drugs, they don't get any credit for that. One patient dies and they get hauled before Congress and
yelled at. And so they have very
yelled at. And so they have very negatively biased incentives here. And I
I think the reality is is that this is reflective of the beliefs of the American people. There's this trade-off
American people. There's this trade-off here between the perception of risk taken in human subjects research and the rate at which we get new medicines. And
it is absolutely true that if we move faster on this, we would learn. It's
totally asymmetric and I think you're totally right, Max. If you approve a bad thing, your career is over. If you block a good thing, nobody notices, right? So,
it so it creates this asymmetric slowdown. And I think this is um I think
slowdown. And I think this is um I think that is the most important problem to solve in the regulatory state.
But but this is a very deep problem because it is this is where the voters are like and we we go and poll some of the stuff that we're working on in the future to understand kind of like where where the American people are on it. And
if you push too hard on this, like there are there are all kinds of ways you could work around it. You go to prosper.
There's all kinds of ways to try to go faster. But if you're seen as being a
faster. But if you're seen as being a bad actor, then you're rejected from the society that we live in. That is the thing that you need an answer for, which is deeper than just saying like, "Oh, well, we need regulatory reform."
You you have a deep point there, Max, which is uh it's the voters, right?
Yeah.
This where the citizens are. Like we
like to blame politicians. You'll see an X all the time, right? When people are like, "Oh, this politician, that politician, you a politician." like
they're elected, they're voted, majority vote, right? This is where the people
vote, right? This is where the people literally are. That's the package.
literally are. That's the package.
That's the bundle they've chosen. And
you may not like this constantiation, but if you were to remove this one, something very similar would take its place because the voters would just vote them right back in. And I think culturally it's very hard for most
people to understand what we lost, what we missed, right? So for example, like France, you know, there's a French entrepreneur on X lamenting that 57% of GDP gets sucked up by the government and so you can't create companies. But to
the average French citizen, that's not visible. They don't notice what they're
visible. They don't notice what they're missing. They just know they're slightly
missing. They just know they're slightly poorer than the US. The economist just did a little piece on economist is finally coming back around to being capitalists after 30 years. And they
just did a little piece on how the US is outstripping everybody and growing faster and getting bigger. But then they immediately turn around and say, well, it's because of the oceans, because of natural resources, everything but capitalism, right? They don't want to
capitalism, right? They don't want to say the dirty sea word. uh because you know for some reason all of these all of these uh magazines became Marxist at some point um but they can't they can't
envision or imagine what could have been if we had just been a little more lazare a little more open so I would love to see a true experiment among the 50 states you know different regulations different tax structures not
because right now the federal tax structure and federal regulations dominate everything but imagine you know you could go to some small state if you had cancer and you could try every drug that everyone was cooking up in caviat
mtor and you got to do your research and blah blah blah but this is known as the experimental zone um same way for drones same way for well aircraft is a little harder because you got to cross a lot of areas but do you think there's something magical
in there the notion of like innovation zones because we have we have a huge like nimi problem right uh but if you if you create like you know opt-in yimi zones they create some experimentation
framework and by definition it happens where people are consenting and you can try different rules or no rules or different ways of enforcing or you know innocent until proven guilty and then see what actually happens and what are
the innovation consequences and what are the safety consequences and then then the successes can spread but I mean to Naval's to Naval's point an innovation zone would not solve the problem in drug discovery um there so there was the
right to try act passed a little while ago we've had this pathway called single patient IND for a lot longer than that um the FDA like if if your doctor calls the FDA and says hey I want to give this
my patient an approved drug they give over 99% of those over like they approve over 99% of those. They can even grant them over the phone. Um the problem is that in order to dose a patient, you
still need clinical grade drug and the only entity with that is typically the IP owner who's in the middle of running a clinical trial. Like they're investing hundreds of millions of dollars into like making this thing. And the problem
is that the FDA, they'll draw an adverse inference if something bad happens to your patient who's probably really sick to begin with. And that's going to be seen as a property of the drug which is global, not related to your innovation
zone. And so there's kind of two
zone. And so there's kind of two problems. One is you need to get the IP owner to give you some of your drug which they're not going to do. And then
you need to prevent the global regulator from casting doubt on what might happen with their clinical trial if they give you some.
How would you address I mean I don't know your field. How would you address that in medicine?
Oh well I mean that in particular I mean this is just like a very inside baseball. I think the FDA has to be
baseball. I think the FDA has to be prohibited from drawing adverse inferences across different users of a capsit, for example. There's these like a bunch of specific ways that you could really accelerate innovation with a
relatively light regulatory touch by just um preventing this this kind of paranoia from driving our decisions.
Is there anything better than the FDA out there? Like what are we benchmarking
out there? Like what are we benchmarking these regulators against or is it not an interesting question because we don't have everyone follows the FDA.
So I'll give two two expansions to that.
The first is um Europe which is not really better than the FDA but they have a different system in that they've got these these notified bodies which are basically private businesses that are blessed by their host governments to
certify things whether this is trains or planes or medical devices and the notified body system uh creates slightly better incentives at the review layer because they can hire people they can grow there's competition among the
notified bodies they themselves have to be compliant with the conditions placed by their host governments for certification but it means that they can there can be many thousands more reviewers than you might have in the US.
The second thing I'll say is there actually is one approved getting paid implantable BCI today which is in China
and the CFDA is thinking for itself. And
they really do have a system that I think is going to give us a run for our money if we're not if we're not careful.
And they they they handle it very differently.
How do they handle it?
I mean the costs to bring a drug to market or a device to market are just much lower. I mean you can try things in
much lower. I mean you can try things in humans and you can try things on market like the so the problem the one of the things that I've spent a lot of time recently thinking about is like 20 years
ago we were buying far fewer laptops and phones each one was much more expensive now there's they're cheaper there's far more of them we buy more of them the total spending has gone up this is great stock prices of things like Qualcomm and
Samsung and Apple are way up everybody's happy they're using kind of the excess wealth generated by the phones and laptops to buy the phones and laptops um this doesn't happen in healthcare. In
healthcare, because you've got this reimbursement mechanism in the way where there's this kind of enterprise sale happening, the bucket of money that we use to buy healthcare is basically fixed. It is not increasing as there is
fixed. It is not increasing as there is more stuff that is producing better healthcare outcomes like we see in technological growth industries. And so
this means that the rate of spending on healthcare grows at roughly the rate of of growth of tax receipts. And so if let's say that like AI is booming and there's major advances that are happening and two years from now we're
spending 10 times as much on AI as we are now, this could be great, but if in two years we're spending 10 times as much on healthcare, this would be a catastrophe. And this is fundamentally
catastrophe. And this is fundamentally at odds with being a technological growth industry. And so as time goes on
growth industry. And so as time goes on and there's more things to spend money on that extend and improve quality of life for patients, like we can restore vision to people go blind in their 80s.
We might be able to extend life in like far past where it's been before. we can
restore capability to patients that are older and in worse condition, but like how do you pay for that? There's kind of this like omni problem in healthare which is all really related the same
problem which is just too expensive to bring these things to market and that's what China is getting at. The way out of this is not singlepayer or some revision to health to health insurance. It's to
bring down the costs so that someone can buy this with a credit card finance maybe like a car worst case. And to do that, we have to make it cheaper to bring these things to market. And
China's doing that. That that will allow them to sell these things for $10,000 on $100,000. There's no private market in
$100,000. There's no private market in healthcare. And because there's no
healthcare. And because there's no private market, what was the analogy people make sometimes? Like imagine
instead of going to restaurants and paying, you would basically go to all the restaurants and then at the end of the month, you would send all the receipts and all the bills to your insurer to the government and they would
reimburse you. Well, there'd be a line
reimburse you. Well, there'd be a line outside every good restaurant. Every bad
restaurant, you know, would be available. Um, the weights would be
available. Um, the weights would be terrible. The product wouldn't improve.
terrible. The product wouldn't improve.
You're basically running a small communist society inside a larger capitalist society. And that's what
capitalist society. And that's what we're doing in healthcare.
It's also what we're doing on roads, which is why we have traffic. Like, it's
the exact same situation on roads. It's
why there's, you know, there's no variable pricing for getting on the highway. It's why it's always clogged.
highway. It's why it's always clogged.
If you want to step on the third rail of healthcare for for a moment, think about this healthcare plan. Tell me what's wrong with it. Right? Imagine that the
first 20% of your uh annual income was your healthcare deductible. Doesn't
matter like if if you're broke and homeless, it's zero. If you're rich, you know, it's millions of dollars. Uh but
whatever your annual income is, the first 20% is your healthcare deductible.
And then the rest is paid by the government, the insurance system up to the usual caps that they have today. You
would create a private market pretty quickly. And so like in dental and
quickly. And so like in dental and plastic surgery and sort of a lot of optional medical procedures, you would actually get a competitive situation.
You get improvement. Like if you look at optometry, you know, with LASIC, you look at dental with like veneers and uh braces and all that stuff and kind of all the dental surgery stuff that they do. Or if you look at plastic surgery,
do. Or if you look at plastic surgery, like those fields do seem to be advancing because they're private payers. They have people who are, you
payers. They have people who are, you know, voting with their money. So we
need to do some equivalent of that in the normal healthare system. But people
lose their minds. They don't want to think one step ahead. They're like, "No, no, no. Well, what about the broke
no, no. Well, what about the broke person?" Well, the broke person has no
person?" Well, the broke person has no income. So, they're like, "Well, 20% is
income. So, they're like, "Well, 20% is too much for some people." Okay, you can put some deductible in there. But
generally, if you don't have some private market where people are paying a lot of the times for what are medical procedures, you're just not going to get this feedback loop that you're talking about. You're not going to get this
about. You're not going to get this ability to spend more money into the system. Right now, like very wealthy
system. Right now, like very wealthy people can't spend voluntarily into the system, but the prices aren't anywhere.
The rate cards aren't anywhere. the
system's not designed for it. It's like
if you go shopping for medical care and you want to pay out of your pocket, sometimes they'll quote you a price that's 10x what they charge the insurance company.
Have you heard Sid's story from GitLab?
Do you know Sid?
So, he was uh I mean had a massively successful IPO then was diagnosed with a rare cancer and has achieved has lived way past the prognosis, has really taken it into his
own hands. I think he went from kind of
own hands. I think he went from kind of he did frontline chemo and then there was one alternative that was available.
He exhausted it and the doctors were like, "We've got nothing for you." Since
I think like six or seven companies have come out of it, there's now 20 or 30 drugs in his escalation ladder. He's
still alive. Um years later, he's doing great. I saw him the other day and he he basically created his own like personalized medicines and uh treatment plan.
Yeah, there's there's a handful of these anecdotes that I've heard now. I it is really clear to me that at the high end if you just kind of have like you're not dealing with insurance you have the
resources you're like I want the full toolbox of modern science outcomes are possible that that like your normal like if you go and ask your doctor like oh what will happen if I do this they will just start shouting and
throwing things but it is clear that that much that like that crazy things are possible at the high end and I think that this type of like end of one medicine is actually going to end up being a really rich source of research for understanding how to build more
translatable things.
It requires a ton of agency from the patient in a moment where they're at their weakest, which is pretty uh ironic. My friend passed away from
ironic. My friend passed away from cancer and like last thing he wanted to do was research n equals one medicine uh because he was just, you know, like like
dying by the week. But this is where AI should really shine and come up with the right solutions and democratization of like what can you actually do when you find yourself in that situation. It's
kind of crazy how few people get access to this just from a knowledge perspective, not just monetarily speaking.
How much autonomous software do you guys have in your organizations that's running on its own or near autonomous and improving on its own? For us it's the a lot of the infrastructure is
already autonomous because we have uh we have this capability uh that fires off uh upon finding anomalies which I recommend everyone creates a version of this or Verscell offers a version of
this but upon anything happening that's anomalous today the mo most engineering organizations are responding to this by setting up alarms or uh like monitoring
thresholds by hand which is pretty insane but that's actually how the entire industry works you say if my error rate increases by this amount uh at this API endpoint do this so we've
actually uh automated a lot of the S sur job uh site reliability engineering so anything uh uh any metric that uh slows
down speeds up uh throughput changes whatever fires off an anomaly alert an agent investigates that an agent can decide to create an incident if the
incident is filed, people get looped in and the agent begins the process of remediation. We're doing everything
remediation. We're doing everything except for like the actual like giving the tools for the agent to like you know change prod but we're basically like serving solutions on a silver platter to
engineers and then the other thing that's working really well for us is just autonomous optimization processes um and autonomous security research. So
the other we open source this tool called deepseac it's [ __ ] incredible.
was like mythos, but you get it today.
Uh we run it against our entire monor repo using 10,000 concurrent agents in the cloud and it found basically several quarters worth of security research uh
progress was made in um basically a couple days and $14,000 worth of tokens.
So I'm talking about like months worth of red teaming, security research, entire teams of people. Uh and so we're basically now running like this periodic because the the other problem with AI is
that cyber security is becoming a nightmare. Um there's way too many
nightmare. Um there's way too many vulnerabilities, way too much work to do. There's too powerful adversaries. So
do. There's too powerful adversaries. So
you have to like basically be investing very proactively. We're running a lot of
very proactively. We're running a lot of autonomous security research. Um so S sur and then optimization work are very obvious. Uh you've probably seen on
obvious. Uh you've probably seen on Twitter there's people translating code bases from language A to language B.
Like a lot of the work that if you if you already put in the work to get a working program optimizing it or rewriting it in a native programming language or things like that is now becoming quite um quite doable uh with
with Frontier models.
I mean just for my own vibe coded app I built a uh bug reporting queue for my test flight users and they can report bugs from inside the app. It uploads the logs in a screenshot. And of course,
they use for feature requests, too. So
then I just have a simple Damon go through, compile all the bug reports. It
actually proactively analyzes and fixes them in the background. And then it ships me a test flight version to try out before I ship it to the testers. And
then for feature requests, I just have it right now compile them, but I could see an app in the future could literally be built by the users. Now, I'm not saying that's a good idea. It might be a mess, but at least it can take the bug
reports and stuff.
We should ship that, by the way, just to see what happens to the social experiment.
Yeah. Yeah. The social experiment, right? You you end up with like that
right? You you end up with like that Homer Simpson car where it's got an umbrella and like a flashlight, you know, a clown horn and so on where it's got every feature. But definitely for bug fixing, you could do that. We did in
a way a version of that experiment where uh I stopped all project work across the entire company for a week and said everybody from the receptionist to the engineers uh build whatever you think is
the most important thing to build. Uh
the only requirements you have to use AI and you have to demo it for the whole company when you're done. I expected we would get a large number of silly projects and a small number of needle movers. And what we got was a large
movers. And what we got was a large number of needle movers and a very small number of silly projects. And wow. Yes.
Yeah, that's a great experiment.
Yeah, two or three are like trajectory changing like they would absolutely change the direction of the company. But
the what this surprised me the most was literally the receptionist like the shipping and receiving associate whose job it was to like take packages off a truck and like email people when their
like stuff came into inventory uh build an automation for that and uh and that that we're actually using. The
conclusion I kind of is like, wow, like everybody has some idea of what could exist that would make the world better, but that many times their first order ideas are stupid and they don't have the they don't have the ability to project
that out and kind of see that it's stupid. But if they have the ability to
stupid. But if they have the ability to go from idea to an actual thing, if it's not working, they can react. They can
iterate. And if you give them a week, by the time they're at the end of the week, they've actually built something that makes sense.
But imagine if all work was like that.
like how can you set up a workforce that does not do the work directly? All they
do is train the agent that does the work for them. And we've done this as well
for them. And we've done this as well like you have to remind folks and you have to like create hackathons and hey let's build agents. Uh and obviously there's a lot of people there's a culture change happening like there are
a lot of people that are just coming in who intuitively know their job is to not work on the thing is to actually train the agent that works on the thing. But
I'm curious about like, you know, what what does the autonomous company of the future look like?
It could get a lot crazier. Maybe you
just turn on all cameras and the agents just watching everything that's happening. It see the shipping and
happening. It see the shipping and receiving thing is very inefficient and it creates the app presents the app.
Did you see that? Zach installed this thing into everyone's machines. He's
thinking about it. It's like we saw this too like we're we're um we're we're likely going to ship a feature into AI gateway that allows people to opt in into preserving inputs and outputs and
then you can say for all of my inputs and all my outputs can you extract the skills of the things that I like learn from my work and then dump it as skills uh so that I can even download them for
myself. But you could imagine people in
myself. But you could imagine people in in companies wanting to to share and and pull this together. It's funny because for me that's so unimaginable for my own work because my own work is not repetitive. I look for things to
repetitive. I look for things to automate. There's almost nothing left
automate. There's almost nothing left for me to automate for my own work. And
I and I hope that's where kind of everybody ends up, right? You just work in your maximum zone of creativity and interest at all times. And like if there is anything left to automate, you should automate it. Get it out of your life.
automate it. Get it out of your life.
It'll free you up to be creative and that's where you generate all the value.
But I think that's very hard to see in the job career mindset because you hire people to do the same thing over and over and that's going away and that's really scary because people like, well, what am I going to do? Well, you're
going to do creative things. You're
going to come up with new things and you don't have to come up with a new thing every day. That's impossible, right? But
every day. That's impossible, right? But
you're going to come up with a new thing once in a while that will then create something else, some point of leverage for you. But it it is it is a scary time
for you. But it it is it is a scary time for people for sure. If you've been doing the same thing over and over for 10 years and now all of a sudden it's like, well, now you're going to train an agent and automate it away, that's scary.
I think historically it was the returns were like 70% intelligence, 30% agency and now it's going to be 70% agency, 30% intelligence and that will that will
shift further as the models get better and better.
I'm actually not sure about that, Max.
I'll take the counterpoint on that. I
think it's 99% intelligence and 1% agency because then the agents will exercise the agency, right? You will literally be like, "Hey
right? You will literally be like, "Hey agent, I'm making smart decisions and thinking big thoughts. Just go implement stuff." In fact, sometimes I want to
stuff." In fact, sometimes I want to build features on apps uh that I'm flowing out of vibe coding. I'll ask the agent, "What features should I build next?" You know, go look at the logs, go
next?" You know, go look at the logs, go look at the users, what should I do?
To be clear, I'm talking about the returns to humans. um the the humans that will be best fit for the future will be the ones that are more agentic which is to say like the ones that can come in and just have the thought of like I'm going to open cloud and be like
what should I build versus watch YouTube and here's a fun experiment I'll bet you we all know a lot of people now who are coding who weren't coding before including many cases ourselves right so the number the percentage of coders in
the ecosystem has probably gone up by might be 10x right yeah it might literally be 10 times as many people are coding now than we're coding a year ago it's wild our signup numbers
are through the roof and there's this new class of people who are not engineers.
They just use the infrastructure. But I
think it might be like podcasters and YouTubers and like people posting on X.
The majority of people are still not creating code. Like I go to people and
creating code. Like I go to people and I'm like, "Oh man, vibe coding is so much fun. It's more fun than like I I
much fun. It's more fun than like I I had a little gaming group that I used to play video games and FPS's to blow off steam." I completely stopped playing.
steam." I completely stopped playing.
All that time went into vibe coding instead. It's more entertaining. you get
instead. It's more entertaining. you get
something real out of it and but the feedback loop is just as tight or even better. And I went to my other friends
better. And I went to my other friends and I was like, "Hey, you should be vibe coding instead." And they just gave me
coding instead." And they just gave me this blank look and I'm like, "No, no, you don't understand. Building things is so much easier." But I think to them it was always like some blackbox process in
the background. They never understood
the background. They never understood it. They assume maybe you were just
it. They assume maybe you were just talking to computer all along. So they
don't see what's changed. They don't
realize it's a lot easier. to them just that starting to Max's point the starting is so impossible to imagine and hard they don't do it. So we might have taken you know 0.01%
of the population writing code to maybe now it's 1% call it a 100x increase but 99% still never going to write code. So
we are in this weird space.
It's crazy. It's like it's a video game and it's a great video game but real stuff comes out.
Yeah. My fiance was up all night last night because she couldn't go to sleep because she was hacking on something and of course she wasn't writing any of the code. But it's just like it's addictive
code. But it's just like it's addictive in a way that programming hasn't been for me for like over a decade. It's
amazing because it's like a lottery for people. I think the normies normies have
people. I think the normies normies have gotten a little more into the vibe coding but through uh models that are more media models, video models for example, right? More people probably
example, right? More people probably fooled around making videos and images than they did writing code and apps. The
problem is like I don't video has its own issues, right? Maybe someday we like make me a great movie about X and I'll just spit out a good documentary, but right now they don't have the taste or the judgment. This is a bet that I have
the judgment. This is a bet that I have with uh Andre Carpathy was like what's the year that you'll be able to just dump in a book and get a movie out? I
think closer although I think he has come down substantially in timeline since we made this bet a few years ago.
By 2030 we're going to have like dozens of Lord of the Rings. Like there's going to be some fan who's like he did it wrong. I'm going to make my own take. Um
wrong. I'm going to make my own take. Um
like the famous stories or like the one of my other benchmarks for progress in AI is I'm a huge fan of of a series called The Expanse. Um there's a series there's a TV series and there's there's nine books and they've made the first six books but they haven't made the last
three books and there's meaningful divergences and I just I haven't gotten in I haven't read the books. Like I'm
looking forward to I can dump in the last three books conditioned on the TV series and be like generate the last three seasons.
This is coming.
That's a great feature. Yeah. But that's
in a way it's easy because there's already all this reference material.
When you said get me the next Lord of the Rings, I was really excited because we haven't really had a breakthrough in imagination.
Oh, we're going to see that in culture the likes of Harry Potter and Lord of the Rings. I I'm really excited about that and that will be the I agree that that will be the more exciting one.
What can humans uniquely do? This gets
back this gets to the core issue. What
are humans going to be able to uniquely do, right? And I think Max, you're an
do, right? And I think Max, you're an AGI maximalist. So for you it's nothing.
AGI maximalist. So for you it's nothing.
Agents will do everything.
I'm not like antihuman, but I just like I think it's going to be we will have to find like if your identity is how smart and creative you are, you're going to have a bad time.
Yeah. I I guess I'm still on the other side of that. I think that creativity is still the thing in the environment that surprises you. You step out of the
surprises you. You step out of the system and do something that wasn't even imaginable within the system. It's
outside of the training data. It's out
of the out of the distribution of data that was fed into the system. And I
think there'll always be room for that.
Have you noticed that every cla website looks the same? And and people like basically like dial in what a cla website looks like once you get enough generations out of the model. Like
there's a look it's this serif font.
It's brown and cream and they use monospace fonts with a certain amount of spacing. Like after a while you get this
spacing. Like after a while you get this this distribution that you say well this this is not creative. This is slop that came out of claude. It's not going to be
human versus computer. It's going to be human with computer versus just computer.
Just computer will eventually happen, but we're pretty far away.
But the computer is going to be able to produce these crazy super stimuluses that like it's going to be it's going to make the entertainment. And I mean, we kind of see a weak form of this in in
Tik Tok. Um, and so when you think about
Tik Tok. Um, and so when you think about the going like my my personal definition of art is meaningful out of distribution behavior. And so this is something that
behavior. And so this is something that kind of is surprising in some way. Feels
like you're kind of moving in the Z-axis. Like you're surprised that the
Z-axis. Like you're surprised that the thing was realized, but meaningful. Yeah.
but meaningful. Yeah.
Meaningful means that like it it somehow to me means that it somehow changes your like future trajectory through the universe. Like your life is somehow
universe. Like your life is somehow different for having thought about it and and reflected on it. Well, my
definition of art is completely different and leads to a completely different outcome. Sorry to interrupt.
different outcome. Sorry to interrupt.
No, it's interesting how just by your definition, you get to a different premise. That's the extrapolation of the
premise. That's the extrapolation of the axiom.
Yeah. I mean, one of the things I like about my definition is that it's so broad. Like, there can be like military
broad. Like, there can be like military maneuvers that you can be like, that was art.
And I think we're going to see this all the time. We're going to see move 37s
the time. We're going to see move 37s all over the place. Although, I'm
curious what your definition of art is.
I mean, I have multiple definitions, but so it's not like a concrete I haven't packaged into one thing. But I do think of art as something where you convey emotion. You convey something you felt
emotion. You convey something you felt to another person. And so you create some object or something or that that creates that that takes an emotion that you felt inside. And so to me, a
computer almost by definition is incapable of doing it. The exact same piece of art uh without intent behind it is sort of meaningless. Now you can also argue nature is art like beauty in
nature like you see a sunset, right? Not
let's say human. So that one I would call it's pure intelligence working without motive. There's beauty for
without motive. There's beauty for example in a sunset because there's an intelligence there. There's a complex
intelligence there. There's a complex system at work there and your brain recognize it and there's no motive there. So no ego gets involved. But art
there. So no ego gets involved. But art
in kind of the more human sense I think of as someone felt something and they wanted you to feel that thing or they wanted to feel that thing again or they wanted to capture the feeling they had with that thing and so they created the
thing. Attribution to who created it is
thing. Attribution to who created it is going to be really important. So like
for example a beautiful photo, right? If
a person takes the photo versus AI generates the exact same photo down to the last pixel, the person taking the photo will have more meaning for me.
I just invested in a startup that does verifiability with hardware at the station that someone some human actually took a photo which is going to have a lot of really cool use cases.
It will we will be drowned in slop. No
question.
Do you remember the control net stuff from like a year or two ago? There was
like there's one particular scene of like it was like a medieval village. It
had like a swirl in it. Do you remember?
Yeah.
That was AI generated and that was the one of the first times I looked at this and thought it was really cool. Like whether
you want to call it R.
But that one doesn't that one break your your premise because some human came up with the the training and the prompt to arrive to that really cool riddle. By
the way, it's totally possible that an AI can also do that in the future. But I
give whoever came up with that idea of the optic optical illusion control net, I give them more credit than the I think the bar is going to be raised massively. Like it's going to take more
massively. Like it's going to take more and more to surprise you. It's going to have to be more and more impressive.
Like Studio that's already happened.
Yeah. Like like OpenAI destroyed Studio Ghibli for everybody. Nobody wants to see that Studio Ghibli work ever again, right? It's been done. And so
right? It's been done. And so
Oh, that one also has I have a counter point to that one. Like have you watched real Studio Giblly? It actually looks so much [ __ ] better than the soft that open put out. Like watch it again now.
It's impressive.
Yeah. At the point where you've seen tons of Studio Ghibli things everywhere all over the internet. It is now in distribution. It's no longer surprising.
distribution. It's no longer surprising.
The art value has been around.
That's right. That's right. No, your
surprise definition still works. I just
think that that humans are the ones who can generate surprise completely out of the data distribution. And I think they can do it with intent and I do think intent matters for meaning. So to your meaning point, right, you said meaning
and surprise, right? Right. And I guess what I would say is that humans can steal the ones generate surprise out of the system. For example, let's say you
the system. For example, let's say you took an AI and you trained it to be perfect at mathematics, right? The
perfect mathematics AI and it's within the formal system of mathematics. And
then Kurt Goodle comes along and he has something completely outside of the system, right? Girdle's in completeness
system, right? Girdle's in completeness theorem. It was completely stepped out
theorem. It was completely stepped out of the system and used uh attributes of physics to basically break the system.
So that kind of thing I don't think an AI could get to. So there's always room for creativity outside. surprise and
then the meaning comes from the fact that a human was involved that they did it for a purpose and they conveyed something. So maybe I can interpret your
something. So maybe I can interpret your definition my way, but we'll see how it plays out. I'm a little more optimistic
plays out. I'm a little more optimistic about humans.
So if you train an AI model, it's trained on some data distribution. It's
trained on some tokens. It then learns some some distribution of language and the structure within that. Um, is it possible for an LLM or transformer to kind of go out of distribution, have
like a new idea that was not present in the training set somehow? Well, the
training sets are so large that it is hard to imagine ideas that are not within the training sets somewhere. But,
uh, if they exist, they probably lie in the natural domain in physics and interaction and feeling and emotions and evolution in things that the it's not subject to. So, I do think that there's
subject to. So, I do think that there's still things outside of language, but language does encapsulate a lot.
Language is a great compressor and we've got a lot of it.
But I mean you can get to these other things through selfplay that selfplay and sensors like cameras are sensors like our eyes are sensors.
Yeah. I mean I think the question is how do you what how do you go out of distribution without randomness? So in
the case of like RL you can get randomness like you can sample an action from a distribution of an action space and you can get randomness that can take you down these walks into new territory.
But I think the the real to kind of turn this around is like can humans go out of distribution? Where does any new idea
distribution? Where does any new idea come from? Are we also dependent on
come from? Are we also dependent on randomness to get us into these new territories?
We're not dependent on pure randomness like like natural selection works through pure randomness, right? Where
you just mutate a gene and then see what happens. But with humans, we seem to
happens. But with humans, we seem to have this ability to cut through infinite space and get, you know, just eliminate huge swats. And so our creativity makes sense within the larger
scheme of things. That seems to be one of our unique capabilities. And um maybe AI is starting to do at the edges as we're seeing with solving some of these math problems. But even math is a very bounded domain, but it's a big one. I'm
not I'm not saying it'll never get there. I don't have that confidence. But
there. I don't have that confidence. But
I think at least at the moment, I would say that uh truly stepping outside surprising people, that's still a domain of humans. And I think humans plus AI is
of humans. And I think humans plus AI is where it's all moving to. Like human
without AI, forget it. Pure AI, I don't think is there yet. But I think human plus AI, we're in that era. How long we stay there, I'm betting is longer than people think. I think humans will have
people think. I think humans will have an enormous amount of value. Um, in
fact, more value. All of us, everyone here, our productivity has gone through the roof. And basic economics normally
the roof. And basic economics normally says that when someone's productivity is higher, they're wealthier. They're
better off. You actually hire more of them, not less of them. Maybe some of you are not hiring junior people anymore, although I don't know if that's necessarily true. I don't think of it as
necessarily true. I don't think of it as junior versus senior. If someone is really good with AI and they're really smart and creative, I want to hire them more than ever because the leverage I'm going to get out of them is incredible.
That's a new requirement. And we're
hiring juniors and super seniors as long as they're really good with agents and really good with AI and and quick to adapt.
And a lot of them don't need to be hired anymore. They can create their own
anymore. They can create their own thing.
My my hypothesis is we end up with a larger number of smaller teams. Like the number of people required to accomplish the given task drops by a lot. And you
know people who only see first order of facts say oh my gosh all the jobs are disappear because I can do I can do a jet engine with two people. I don't need a thousand you know 998 jobs are gone.
But what it actually means is you can create a lot of different chat engines.
I think that's exactly right.
I think there will be goes back to Naval's point. I I think the thing
Naval's point. I I think the thing that's uniquely human is the creativity and what's been missing for you know a lot of people can be creative but they don't know how to turn their vision into
a real thing that's changing. So I think we're have an explosion of an of entrepreneurship explosion of founders and a very large number of very small teams because you don't need many people
to accomplish something.
Yeah. I think like look AI provided uh base level intelligence and uh domain knowledge uh and cut through all the jargon and then now agents actually
provide a lot of agency. So the main things left are creativity, taste, and yes, you need enough agency to get started, agency to stick with it, but you don't necessarily need the agency to like spend 20 years learning one thing
before you can dive into it, make a contribution. And so that barrier going
contribution. And so that barrier going down, generalists are having a field day. And at the end of the day, we're
day. And at the end of the day, we're all generalists. All of us like to think
all generalists. All of us like to think about everything. We don't like to be
about everything. We don't like to be just trapped in one thing. Like Max is here talking about consciousness and the FDA and brain science and creativity.
Like all of us are trying to think about everything all the time. And so, uh, people in Twitter who are always fond of saying like experts, credentials, sources, right? Those are the guys
sources, right? Those are the guys getting hurt because the expertise doesn't matter. You spent 5 years, 10
doesn't matter. You spent 5 years, 10 years getting a PhD in XYZ, you know, hopefully develop your creativity and your instincts and your taste and your judgment because if all it did was help you memorize a whole
bunch of things and jargon and, you know, learn some scaffolding stuff, well, AI will cut right through that.
It's like a, you know, calculator times a billion or, you know, bicycle for the mind but accelerated. So I I think it's about people with AI versus people without AI. And so the single best thing
without AI. And so the single best thing you can be doing right now for yourself is just getting really good with these tools, getting comfortable with them and always knowing the edges of the boundaries of what they're capable and what they're not capable of. And that is
a moving target.
Loading video analysis...