Why AWS and Azure Cannot Run Autonomous AI – Ivan Burazin (Daytona)
By The MAD Podcast with Matt Turck
Summary
Topics Covered
- Agents are digital knowledge workers that need their own computers
- Cloud infrastructure was built for stateless apps—agents break that model
- Models don't learn on the job—they only have context
- Rapid response and proactive communication are the entire solution to customer support
- 60ms spin-up and 50,000 sandboxes in 70 seconds—the infrastructure arms race
Full Transcript
We are a part of this like super cycle right now and the super cycle does not last forever and so if you're going to pause by the super cycle you are steing
market like that is what you are doing I would ask claw can you go fetch the data from our bank and then it was like oh yeah just log in and give me access like login give me no I will not give you
access right away fundamentally for me that was like broke the entire thesis of it so you give it its own machine when I think about agents I think of them as
digital knowledge workers. And to do anything as a knowledge worker, you do need a computer. My argument is that every agent will need at least one sandbox, sometimes more.
Hi, I'm Matt Turk. Welcome to the Matt podcast. My guest today is Ivan Borisin,
podcast. My guest today is Ivan Borisin, CEO of Daytona, one of the most talked about startups in agent infrastructure.
If you've been hearing the word sandbox in every AI agent conversation and quietly wondering what it actually means and why it suddenly matters, this episode is for you. We go from first principles, why does an agent need a
computer at all? All the way to the deep technical end, why Daytona had to throw out Kubernetes and write their own scheduleuler, and why a global CPU shortage might be coming faster than
people think. Ivan also unpacked the
people think. Ivan also unpacked the full agent stack as he sees it, models, sandboxes tools MCP memory orchestration, and where each piece is heading. Along the way, Ivan shares some
heading. Along the way, Ivan shares some really interesting lessons on go to market and distribution for technical founders that he has learned through 16 years building developer tools startups.
Please enjoy this great conversation with Ivan from Daytona.
Hey Ian, welcome.
Great to be here.
So you have said that every agent needs its own computer. What's the simplest way of explaining that idea? Well, when
I think about agents, I think of them as digital knowledge workers. And to do anything as a knowledge worker, you do need a computer or I should say anything
sophisticated. So, you and I can be on a
sophisticated. So, you and I can be on a conversation and we can get something done, but we usually need some sort of tools in our world is usually a computer to get higher productivity um and to be
able to do things. And so, I think of that in the same lens. So, for me, agents are literally just digital knowledge workers.
Great. And so it's on computer. That's
the the whole concept of of sandbox. So
as an introduction to this whole conversation, what is a sandbox?
Absolutely. So sandbox is although they the term sandox first comes from isolation. So making sure that there's a
isolation. So making sure that there's a secure place for in this case an agent to run and do things. But on the other side it is essentially a computer. It's
a full-on computer that an agent has ability to install tools, access the web, um, run scripts, run code, whatever it needs to get its job done. So, the
shortest answer is a sandbox is essentially we call them composable computers for AI agents.
Yeah. And is in some ways like this whole open claw and Mac mini like a good analogy for what a sandbox is like the Mac Mini sort of being the sandbox.
Exactly. So I think the openclaw Mac mini thing helped a lot of people understand what we actually do. It's
like oh I get it now. I get it right. So
it needs a computer in this case was a Mac Mini to be able to do different things. So yes that helped a lot with
things. So yes that helped a lot with awareness for sure.
Yeah because uh again to unpack it like the uh open claw as as a as a frameworking system to help an agent do all sorts of things on your computer.
basically you want to be able to kill it if it goes rogue and therefore you can unplug the Mac Mini the way you could sort of kill a sandbox. Is that is that kind of so there's there's a couple things that I personally thought about
and so my open claw runs on a not a physical Mac mini essentially but a sandboxed um virtual Mac mini and the reason why I didn't so the way people
usually run these things and cloud code or openclaw is usually on their own computer because it helps them you know organize emails you know search whatever
documents they have and whatnot there there's a one there's a high security risk there because at one point we were doing our board meeting presentation and
I would ask claw can you go fetch the data from our bank and then it was like oh yeah just log in and give me access and I'm like log in give me no I will not give you
access right um and so right away fundamentally for me that was like broke the entire thesis of it so you give it its own machine I personally gave it its own like Daytona account it gave it its own phone number and the reason I have
give it own phone number is because it has to do 2FA to get into the bank. Like
there's no other way except for a 2FA with a phone number for this particular bank. And so it has to have all these
bank. And so it has to have all these things like a employee like a digital employee to be able to go and access these things. And so the risk now that
these things. And so the risk now that now that it has its own computer, it has its own account. These accounts have the limitations. So it can only it can only
limitations. So it can only it can only look at the data from my bank. It can't
spend the money from my bank. And so
except for the credit card that we did give it which has you know you know $100 a day or whatever it may be. And so the only risk you essentially have at that point if it's in a sandbox is will it take this data and now leak it somewhere
and that's something we can talk about later but essentially the worst thing can happen is that and to your point you can kill the whole machine if you need to kill machine.
Yeah. And there is a a fundamental concept of um stateful versus uh stateless. Uh can you can you unpack
stateless. Uh can you can you unpack that for us?
So that basically comes when we talk to people about what we are building they're like oh doesn't this already exist in any of the hyperscalers right?
And so the answer is no. And the answer is no is is because everything that they were built for which was deploying apps they were they were stateless. Like if
you have let's let's pick any of your websites or whatever you want web apps you do not want that to change on the fly. You if you if you are the company
fly. You if you if you are the company will pick like eBay because they're in this building so they're they came to mind. And so if you're eBay and you're
mind. And so if you're eBay and you're an engineer there and you're like, "Oh, you have this new, you know, update. A
button is here or it does this thing."
You want that state that you you don't want that to be changed on the fly, right? The database might change,
right? The database might change, information might change, but you don't want the app to change, right? And so
with those things in mind, that is how people had built the hyperscalers. Like
that is the fundamental architecture you came into and built on top of that. And
the simplest analogy I give people is let's say you're building a truck, right? You are a factory for a truck.
right? You are a factory for a truck.
There's like the the weight, the type of the engine, the chassis, um all of that is made to very slowly but surely
securely transport some sort of goods, right? On the other hand, you can have a
right? On the other hand, you can have a sports car which is still, you know, it has four wheels, it has an engine, whatever, but it's made for different things. It's made to go very fast. So
things. It's made to go very fast. So
the way the chassis is built, the way the, you know, um engine, the the weight balances is completely different. And so
you can do you can do a lot of things with both, but they're not fundamentally the same thing. And as a company trying to build out both of those, they're separate platforms completely, right?
Mhm. So sandboxes are a fundamentally new primitive. Is that is that correct?
new primitive. Is that is that correct?
Is is it like a history of sandbox or did sandbox exist before? So I I would argue I would say that that the the person that could have nudged although he said he did and someone else did but anyway there was a company called code
sandbox way back in the day when we used to compete with our company code anywhere which also was in the in the realm of like replet it's all these cloud-based IDs and so they called it code sandbox cuz I I I believe it it
sounded cute it's like a box where your code you for ID um lived and they were actually one of the firsts so the team there that actually used microVM M did
snapshotting, forking, all these things that we do use today. And so that is sort of and this is maybe a decade ago,
but the utilization or the usage from or the value that it was giving human developers was not there. There's this
whole article about like the end of local hosts and people have been talking about this for I've been talking about this for 20 years. Um and basically developers would say like you'll take my local host like out of my cold dead
body. But now that agents are here,
body. But now that agents are here, local hosts no longer actually one, you don't want that for a number of reasons.
And so we're finally getting to that. So
that technology and that thesis around sandboxes originally now seems to be coming to fruition.
Mhm. And the big acceleration around sandboxes is is agents, right? That's
thing. Absolutely. When we think about agents, so one, if you even if you do run an agent on your computer, like you want to do that, up to you. Go ahead and do that. There's a bunch of problems.
do that. There's a bunch of problems. One is let's hear in your laptop.
There's this whole thing on Twitter now where people are holding their laptops open.
There people what is that? Yeah, I saw you tweet the Well, that's because I always hold my laptop open. That's just like my vibe
laptop open. That's just like my vibe without that. Um but been doing that for
without that. Um but been doing that for a long time. But people do that because they want clawed or open claw to finish.
Yeah.
Cuz if you close it, it's Yeah.
is you terminate. Yeah. It's or pause or stop or whatever. Um so like the problem is like the continued the ability to work non-stop is not there as long as your laptop. So that's one thing. The
your laptop. So that's one thing. The
other thing is you can't do concurrency.
So you can do multiple to the amount of compute that you have in your laptop which will be quite limited. And so you might want to spin up 10 or 20 or 50 or 100 or 100,000 whatever that number might be. And so that's very hard. So
might be. And so that's very hard. So
ideally you actually really want to remove it somewhere remotely. So you can start on your laptop, you continue on your phone. It's the same computer and
your phone. It's the same computer and same agent that is doing their thing, right? And so that is a takeoff of we
right? And so that is a takeoff of we have now decided that it's absolutely okay that is no longer on our local host but again it is still a local host to that agent. So it's still a computer to
that agent. So it's still a computer to that agent what a laptop is to us.
Do all agents need a sandbox or is that a specific category?
My my argument is that every agent will need at least one sandbox sometimes more and we can get to that again. there are
places where you don't need and I analogize agents with humans for the most part and again we can have we can do productive work without computers and so there is and that was the original
sort of like chatbot where basically you would just talk to an agent and it would inference so we just think and give you value so I don't know you a lot of people use it for like emotional support or whatever for the most part it doesn't
need a computer it has enough data from you going back and forth and then it can sort of understand and give you feedback but that's a smaller subset. If we think of where all productivity gains are
among the biggest verticals, healthcare, financial services, whatever, most of that is done via a computer. And so if you want agents to do all these things, then agents will need these computers to
do.
So if you do uh tool calls, coding, like any of those actions, then you need a sandbox. If you just chat,
sandbox. If you just chat, you probably don't need it. Yeah. So
like even if if you have a chat we can get it to deal but if you chat and it has to search the web it still has to open a browser or it has to use something like you know parallel or exa or whatever to get to that that would be
a tool call. So it depends on the use case but the thing that's quite interesting about the world that we live in is one let's take a step back one I
firmly believe that in due time all the tools will be headless now will it be inside a sandbox or outside that's another thing all of them will be um and
that's the most efficient way for an agent to work but most of knowledge work is still locked into legacy apps inside of inside of Windows for the vast majority, like the absolute vast
majority. Um, so if you want an agent
majority. Um, so if you want an agent today to do a job end to end, you literally have to give it a computer.
And so an example of this is again the board report, which is like, oh, can you pull out this report and our bank has an API and it can pull it out, but it only has spend on the API. It doesn't have um
incoming revenue on the API. It's just
not exposed or through the MCP tool actually. It's
not exposed. And I'm like, then the agent was like, "Oh, I can't get to it."
I'm like, "Dude, log in, download it."
And okay, I'll go to it. And then you can see it opens up a browser, it goes download whatever. Same thing with, I
download whatever. Same thing with, I don't know, with any other data source that we have. If it can pull it from the headless, it'll do it headless. If it
cannot, then it sort of logs in and do that. And so, if we want to give them
that. And so, if we want to give them power today and get value, we have to enable them to have these tools.
You had a great tweet the other day where where you were talking about like the actual use cases. uh that you see as a provider of sandbox of agents. Do do
you want to go through this? You had a code command execution computer use browser use and RL environment infra. Do
you want to unpack that?
Sure. We basically I've made it I I think I've structured it better since that tweet which is right now we have two major use cases or two types of customers consuming Daytona in different
use cases and one is on the researcher side. it'll be you know RL eval
side. it'll be you know RL eval benchmarks and the other will be on what we call background agents or long running agents and so when you think of background agents or longrunning agents
like the the most popular those are where a human is the end consumer the human talks to a let's call it app layer service that has an agent in that and then the agent will call in the sandbox
so things will be you know think of like Harvey or perplexity or whatever as or lovable as these types of background longrunning agents and so they can both
be uh sort of headless so code and command execution and or computer browser use. So depending on what they
browser use. So depending on what they need to do and the same thing is on the researcher side where it's like RL valves and whatnot you can do RL valves for you know coding and for that it's
basically just headless like commands and command execution in there or you can actually teach it to do things in the real world and then it does have to fire up a you know Windows a Mac a Linux
sort of desktop or a browser to go through the things.
code and command execution and browser computer use are like two ways an agent can work and then it's to basically the consumption of the sandboxes to do those two different things are different.
So that's a great um sort of sandbox 101 introduction to to to the concept um help us understand uh where sandboxes
fit in the overall picture of this emerging uh agent stack that uh I think everybody's trying to figure out at the same time. So there's there's different
same time. So there's there's different components, there's uh file systems, there's orchestration, there's like what what are the different pieces?
I I try to think about everything. To
me, it's actually quite interesting where and we'll get this a bit later as well. It's like a lot of this all exists
well. It's like a lot of this all exists in real life today. And so people are like overthinking this. I'm not saying that there there's not going to be new products and solutions and technology to solve it, but it's like it's not a new
fundamental way of work. And so when you think about the agent stack itself, it's like okay, you first have the models and the models are essentially the brain
that that is what it is of equivalent to what a human's brain is sort of. So you
tell it something, it replies, it understands and whatnot. And then under that is like oh what are the tools it can do to get things done, right? And so
that can be anything from like any of the MCP or or tool calls can do. It
could be the sandbox, the computer, like whatever we as humans, we also have a bunch of tools that we use. Everything
from a hammer to a computer and everything around that, right? Um, so
all of those things exist as well. Then
there is memory that exists. Can can an agent remember these things? Similar to
like do you remember these things that are there? People talk about
are there? People talk about orchestration of agents. It is like management. Do you manage people? I
management. Do you manage people? I
manage people. Managing agents is similar is not dissimilar to managing humans. Now what tools will you use to
humans. Now what tools will you use to manage them? Will you manage them in the
manage them? Will you manage them in the same way just send a slack message and or you do linear or whatever it is we will see or it's going to be a net new app we'll figure that out but it is of that and of course it's like
observability can you see what your teammate or colleague has done and verify that that job is good right and so there's tech that's sort of how I think about that on a very sort of
higher level and then you can break down to different types of solutions people are creating for this yeah let's do some of that the things that I think about like
there's also like things that are more solved and less solved where like the models I'm not saying they're solved that they won't continue to progress.
There might be different versions of models but like it's very clear that there is some sort of brain that is there and we have the leaders and the frontiers and we have people trying to do different things and that is all said
and done. The tooling I don't know if I
and done. The tooling I don't know if I would spend too much time we that's all we're talking about now which is like the sandbox the computer the MCP tools there's like a bunch of them that that
are being there. The thing that's not solved very well is one so there's memory itself and so memory itself like how do you do this right now the best
solution is just dumping things into MD files and then you know sort of having access to that and then you know compressing that and how you're going to get that I I we talked about this the other day which was the book of why we
sleep right um and in that book of why we sleep it's basically a lot of the how you do data retention and things like that and that's sort of what we're trying to solve for agents agents
themselves into that segment and so memories there the thing that is actually quite interesting for me is actually the learning of the model one thing that I didn't understand
internalize is that models actually don't learn right now so you use a model and if even if you solve memory it has memory of things so it has context of
these things and so um it can oh here's the context and so it can have a better answer because it has the context but it doesn't actually learn on the job that
is done yesterday right and so who is solving or how do we solve that does that one does that require something like you know constant RL learning post training for this or is there something
fundamentally that we change in the model so the models can do that I don't know but it's definitely something that will that's hindering progression today and it's not absolutely clear how we get
to that point because it sort of weird to work with someone that actually doesn't get smarter, right? Like you get a new model, but like it's like, oh, it's like a new new one that comes out,
but like you do something five times and it can mess up. It literally screws up the sixth time like right. So like every single time really and so you can try to prompter better. You can have more
prompter better. You can have more context so it doesn't do it but doesn't actually learn. I think that is
actually learn. I think that is something that's going to be quite interesting to see sort of how that that sort of progresses.
Yeah. And from your vantage point, memory um is one of those markdown files or like a
combination of those versus a database.
Seems like markdown files are incredibly elegant and simple but somehow feel less robust than what a database would offer.
I mean not from a personal and I might change this from a personal perspective.
I I also would tend to agree with you that like markdown seems quite trivial for the for the problem that's there and the compression and and how you get to all these things inside of that. So the
thing that we try to do as a company providing infrastructure for these things is to make sure that we can expose the ability for the agent to
access these things in a very simple way and add to them but solving it itself is outside of our let's call it mandate and in this whole um kind of agent
harness I guess everything that you describe kind of falls under the current uh concept of of of harness. Where do sandboxes fit long
of harness. Where do sandboxes fit long term? Do do you think a lot of this gets
term? Do do you think a lot of this gets eventually built into the sandbox or does the sandbox remain the the the execution layer for it all?
Two two things. So if you look there's like the sandbox itself and again restating I'm saying it so many times in in this conversation but if we take like an average worker in Goldman for example
right you have the you have the worker the person which let's call it the model in this way you have the computer which it logs in into um and the harness is a
set of the way it shapes that model that it can and cannot do things interact things so it's almost like hands sort of to speak um of that bit bit deeper but
basically the sandbox does support it and so if you think of a computer again I'll pick on I'll pick on go Goldman for example I've never worked at Goldman but it's my just assumption I worked at bigger company so my assumption is that
when you log into that computer there is so much software in that computer that logs what you do restricts what you do make sure that you don't leak data and do these things again no prior knowledge
of Goldman just assumptions on these things and so those are the types of things that we as a sandbox provider will most certainly incorporate into
that. But that harness still has its
that. But that harness still has its function in that which it guides the model like oh I now know how to interact with this machine. Oh, I know how to do
a tool call. Oh, I know how to this is how I work with memory. This is how I change with a model. And so there's a there is value in that. And so we don't take on the harness. That is something
we pretty sure that we never do. But
there's things that we do in the sandbox that does help or support that entire system.
Great. We talked about models. Uh where
do the model providers like the big um AI frontier labs uh fit in this overall picture? OpenAI recently had an agents
picture? OpenAI recently had an agents SDK announcement. What what is that?
SDK announcement. What what is that?
Yeah. So I mean they they have a big push into this this space where like claude code and their agent SDK um has been and so it's a focus for them to
essentially catch up um on that segment of of the market there. So
there's two things right there's uh OpenAI agent SDK and then cloud manage agents or they're different. Yeah they're
they're different. Yeah they're different. Yeah unpack that for us. the
different. Yeah unpack that for us. the
anthropic managed agent. So it is a managed service where you essentially have the model, the harness and the sandbox all wrapped into one basically and so you have that all managed for you
as a service that entire stack. So that
is there. Whereas if you just have an agent SDK, you can use that and run that into any sandbox provider or your own or any machine that you want to run it and
it can connect to depending on the licensing or whatever maybe you might connect it to that model provider that gave you that harness or you might be able to interchange uh those things. So
basically the one is just a harness the other is the model the harness and the the sandbox al together.
Okay. And uh I believe Daytona was a partner to open AI for agents SDK, right? Like one of the sandbox
right? Like one of the sandbox providers. Exactly. That's part of that
providers. Exactly. That's part of that original framework. Okay. That's the
original framework. Okay. That's the
sort of like overall um landscape uh around the agent stack. We're going to go uh much deeper into sandboxes and how that works from a technical standpoint
in a minute. But as a quick detour, let's talk about um your story and uh Daytona's um story. So you this is not
your uh first venture. Talk about the the the prior thing that you did. You
alluded to some of it like uh that was a little little bit like replet.
Yeah. So we started the cloud IDE space basically in 2009 both me and co-founder and so in 2009 for those of you that were around there was like no Docker, no Kubernetes, no VS code and so we had to
build the entire stack which was something that we learned how how to do and something that we applied today. So
like it was actually very very very early. The only I say that we started it
early. The only I say that we started it because the company that started before us was Heroku which became Heroku and killed the ID which we probably should
have done sooner um as well. So but we were the only one that sort of like started and finished finished in that in that sense. Later on you had other ones
that sense. Later on you had other ones like code sandbox like replet and like others that that had had joined and um stack blitz which now bolt and whatnot.
And so Replet also now changed and bolt into their other directions and Replet is doing really well. We decided to go in a completely different direction when we decided to do their next company. So
um we had learned a lot of things on how to build this entire stack. But when we decided to kick off Daytona V1, which is different than it is today, we decided not to go app layer but just be
infrastructure probably like on the teachings of Heroku which definitely went in into that direction.
It's like oh we learned how to do all these things on the orchestration level underneath that seems to be very very valuable. So let's push on that and that
valuable. So let's push on that and that is how we kicked that off. you you
tweeted uh my first startup taught me exactly what not to do at Daytona and talk about like selling to developers new onrem solution like talk about some of we learned a lot of things we one thing
we learned is timing I never understood like timing of the market like you have to pick it's very hard to time the market but you know if you're in the right time like fairly soon um ideally you want to be just before the time but
that's like very very hard you definitely don't want to be completely wrong we were like two decades wrong um for decade and a half. So definitely
wrong on that one. The other thing is like I did not know the difference between a user and a customer. And so
selling to de Daytona now is a PLG motion 100%. The code anywhere product
motion 100%. The code anywhere product was also a PLG motion but the code anywhere product had no enterprise use case value. It was all single developer
case value. It was all single developer value and signal developers do not want to pay for these things. Whereas a
product like what we are it is a PLG motion. it gives value to a single
motion. it gives value to a single developer but a single developer probably working inside of a of a company and so they will pay with their company's card which is very very very
different but you don't know when you start out so um those are definitely things that we um understood when we were creating this company and the difference we didn't get into detail but the difference between Daytona v1 which
was managing it was an infrastructure product for human engineers in very large enterprises it was an enterprise play whereas Daytona V to the sandbox
is a we can call it a neo cloud like it's a new cloud although neo clouds are usually for GPU clouds we did understand that there was an enterprise play to be
there so everything that we had learned of like onprem multicloud different ways of u managing observability audit logs all these things that you need for
enterprises we had already either baked in or understood that that would be needed so we prepared the product for that originally.
Great. You're also pretty amazing at just distribution in general and and then building a brand and then developers uh which is fascinating, right? Because that's uh one of the
right? Because that's uh one of the typical issues with very technical ventures. The people tend to be
ventures. The people tend to be excellent and deeply thoughtful about product and technology and then distribution comes as an afterthought.
How did you become good at it and what are some lessons you can share for technical builders? I mean it's all
technical builders? I mean it's all about like I'm I probably sucked at all that like terrible terrible like I was the were so let's take there's so many
ways to say how we did. So one thing is I think I very well understand humans at scale like my like what the
market wants and or needs and or feels.
Um and so when you do that it's like I I did not know that inherently but you sort of start learning that you understand humans. I it's very hard for
understand humans. I it's very hard for like single humans not so much but more like larger massate humans aggregate humans much better um so that's one thing but the other thing is I was like
super shy geeky whatever as as a child and the the stage fright terrible terrible like pitching my first startup like I would be so nervous like I would be I had to
like talk on stage five minutes before stage I'd go to the restroom like 10 times like just like of the pressure of these things and the thing that happened
quasi randomly is with our first venture code anywhere we did pitch around all these like conferences around the world and whatnot and it was such an interesting thing that I decided in like
with one of the founders of one of these conferences at the afterparty quite you know intoxicated probably he's like oh do you want to do one in Croatia where I was living at the time and he I'm like
yeah let's go do this and so I go do I set this up this conference it's in Croatia 250 people um which was really big for us at the time. And I had hired
an MC that had bailed last minute. And
so there was no MC. I have stage right.
You're it.
I'm it. I have to go. And so you I was literally the MC for two days in a row.
And you broke straight like you break it. Like you just have have to do it. Um
it. Like you just have have to do it. Um
so what I'm trying to say this is very different from the go to market but it builds on it's like okay now that I no longer have this stage fright the you start teaching start understanding what
interactions excite people less excite people how to bring people and whatnot and it's also a problem because when we said we're going to do a conference I didn't understand what a conference was and so it's like how do you break down what is a conference okay a conference
is it's entertainment first and foremost and so you have you know the show the show are the speakers how do you get the Then you have the this for the speakers you have to sell the audience. Who is
the audience? Who's coming there? How do
you get them there? What? And then some has to pay for everything. The audience
does pay for tickets, but the vast majority is under sponsorships. And then
how do you sell that to sponsors? And so
you break that those things down and then you start understanding incentives of all these different parties and you start putting that together. And so our go to market motion when we and we did
conferences for a decade more or less.
So you sort of like you have iteration cycles of a year then we started doing two a year and we started doing three a year and then every conference got better cuz the iteration cycle was was much faster to get the the feedback loop
was much faster and then when we decided to do like Daytona and our whole go to market strategy was originally around in real life events and so we did dinners
meetups drinkups hackathons but all of them were simpler versions of conferences and mind you the conference that we had was like 4,000 people so it's like it's a fairly big. It's not
100,000, but still fairly fairly big.
And I remember telling one of my teammates that works on the conference business with me now at Daytona. I'm
like, there's no way in hell I'm ever doing a conference ever. Not doing it.
That's the most stressful job in the world. Never doing it again. Never. But
world. Never doing it again. Never. But
we'll do these little things cuz it's easy. And then people will come up to
easy. And then people will come up to us, how do you guys do these events? How
do you do these meetups? Well, when you understand how to do a big conference, like what are the interested parties?
Then you know how to do a like very small meetup. It's the same things
small meetup. It's the same things apply. It's just like quite a lot
apply. It's just like quite a lot smaller, a lot easier to do. And we
ended up doing a bunch of these. They
were all We didn't sell obviously these events do on sell, but sold out in the sense of like there's no more space in the rooms that we're doing. We now like attract like partners that you know co-sponsor these events for us. Just a
great motion for us. And then at some point my my my colleague was like we have to do a conference like no we have to do it. And so we ended up doing that in the Chase Center, but we ended up doing in the Chase Center in San
Francisco two months ago. I think you were there as well. You helped out and the the whole which was uh incredible by the way. So when you think about marketing as a you know an effort
to make a company I mean create a brand and make a company look you know big and powerful. Um that was a
powerful. Um that was a you know a masterass in how to do that.
Thank you. But all of it all the entire GTM that we have we have no other things that we do like Twitter is a go to market um we don't have we have zero sales people and whatnot but it is definitely so the way I think about this
and I stole this from or paraphrasing it from David from century which is like there's basically three ways that people pick your product and one is awareness do they even know you exist like if that
doesn't happen there's no way they can pick you two is preference which is you know the the pricing the brand, the person, the different features, whatever
like it's it's preference. And the third thing is like is there a deterministic thing that you offer that no one else has? So the easiest example for the
has? So the easiest example for the third one is like do you have fed ramp right? Do you have that certificate? If
right? Do you have that certificate? If
you have like I can only the customer can only use you and no one else. But
the two above are are quite interesting which is like how to make sure that everyone knows who you are and then you work on the um then you work on the
preference. And so for me the all I
preference. And so for me the all I don't it's not all I think about a large part of I think about is if all things were equal. So if our product is equal
were equal. So if our product is equal to everyone else's product what is the differentiator to that? And so like one can more people know about you than
others like is the branding there is the the the um the experience there. uh
experience is a nuance which p some people don't think about, some people do think about, but it's like do you prefer the feel of this product versus that product? And these are all things that
product? And these are all things that have nothing to do with the actual technical capabilities of the product.
And so my co-founder Verran, our CTO, like his job is mostly to make sure the product is better than anything else in the market. And my
mandate is to make sure that even if his if our product is equal that we can supersede that and so that's how I think about the go to market in general.
Do you think of uh customer support and customer service as all of that it's all go to market all all that is go to market and and we've seen this we've chatted about this where
we we've been getting users and customers just because we are so good at that and so all of it is an experience.
So if you think of any experience as a human you go to like store restaurant whatever it is the entire experience it is from your the way what is a brand it's the
perception of the that brand itself is just the perception and the perception is like if you go into whatever store pick your brand you want the the smell the music the people the smile that
whatever you get all of that together is the perception of that brand and so if you think of that through the lens of a product which might sound counterintuitive I don't know or nonobvious to people like I think about
that alto together let's say we were selling sneakers right like we don't sell sneakers but the the this sneaker which we won't me won't say other brands but if you do you can yeah always pivot to
GPU exactly like if you go to the store here in New York it's like a beautiful store the people are very nice every everything's aesthetically pleasing and
you just like enjoy that entire experience and it's a good sneaker right and so you have to have that altogether and so that's how I think about this as well which is you know you have to have a good product you have to have all
these things but all these other things have to coalide with that now the risk is and we've seen this with a bunch of startups like there's no sustenance so the product is not good and you have everything else and that is sort of
that's when you get into trouble but if the product is good or at least as good if it's better it's amazing and you have all these things for for me my belief is that is that is a key driver to continue growth
great uh you mentioned Twitter and X uh a minute ago. To which
extent is that part of the whole u the whole plan?
It's hard to measure. Yeah.
Like I've worked in bigger companies and if someone if I was working for my former company they're like how do we measure that? Like I have no idea that
measure that? Like I have no idea that we measure that. But what I can say is that I was never active on Twitter very I have an account since 2009 but never very active until like this holiday
season where there was like this semi viral tweet that that went out. Um,
what did you say again?
I So the the tweet was if you're taking a break these holidays, you're NGMI.
You're not going to make it.
Yeah.
Which I honestly believed.
Yeah.
Like the tweet was basically I wasn't even thinking about there's like a typo in the tweet. Like it's not people like you're wage bating. It's like no. I was
my thought is we are we are a part of this like super cycle right now and the super cycle does not last forever. And
so if you're going to pause by the super cycle, you are steing market. like that
is what you are doing right and also it's like it's very much towards you know founders and executives and tech leaders it's less about the individual person that you know that can't have an impact on
in that like people have to take their breaks and so people took that in all different ways or like two poop too it's like you should die you're a capitalist whatever all these things and the reason why it became so popular like it was
kind of popular which was some person said dude you never I've never heard of you or whatever and then I said basically that's why not to keep working and that
perfect and I'll again not I was like instant reply on that and so basically not talk too much about that is that from then I had noticed that
people even if they like your tweets or don't or are completely opposed or not has no bearing on I should say it has
that positive bearing on their thoughts of your company. So we have found that people that even are completely opposed
to the posts end up being customers assuming that they need the product line. Not everyone is there. So it was
line. Not everyone is there. So it was quite interesting to me that oh just because someone doesn't agree does not mean that they won't like that because it's all generally awareness.
And so that has been something that I've spent a bunch of time trying to catch up to you on um on the Twitter versse.
Yeah. No, that's super interesting, right? For any um AI builder or AI
right? For any um AI builder or AI founder. So the the the PLG motion
founder. So the the the PLG motion ultimately like to create inbound is is a combination of all the things. So we
talk about conferences and meetups. Um
Twitter is important. Is there anything else that people uh should know? We talk
about customer service. You mentioned no salespeople as of now.
We have no sales people at all. Yeah. So
I think like the how do you experience the the product, right? So like once you've seen the product, how do you experience the product? It's like your door to the product like the website, the login, the whatever. We can fix a lot of these things to be very clear.
I'm not saying we're the best at these, but there's that. And then what are the feature sets are in there? Can I get things easily? Our SDKs are really
things easily? Our SDKs are really really really good. Like people really like like the ergonomics of them. So
it's like really good. That part is great. And then the thing that and this
great. And then the thing that and this is even public on Twitter. All our case studies that we've done and we outsource the case studies to third parties. we're
not part of this is that we reply very very fast and so this is a very public thing and it's something just core to who I am and how I learned to work cuz
one of the let's call it first real jobs I had was a system admin so my job was like to fix the printer and computer and whatever and so one thing that I learned and I try to talk to my entire team is
like one thing you have to do is like one is the first response very fast just the first response very fast like people are calm. They know they have
are calm. They know they have transferred their problem to someone else and someone has acknowledged that.
And so when they do that, they feel rested, right? And then you just have to
rested, right? And then you just have to promise that you will solve or get back to them at x amount of time, whatever that is. And they were calm till that
that is. And they were calm till that moment. And the thing that you have to
moment. And the thing that you have to do is one either solve the problem until that given time or call them, message them, whatever interact them before that
time and state a new time. That is the solution to support. That is it. There's
nothing more than that. There's
obviously there's like key things. If
you're actually down and don't work and the person can't work, that's a completely different thing. But every
other non-critical problem and you have the most happy customers in the world because they don't have to think about their problem anymore. You think about their problem. And so you can like keep
their problem. And so you can like keep that on until you get it done. But the
key part is if I said that something's going to be fixed or deployed or done or whatever in 2 days, a day and a half later, if it's not going to be in two days, a day and a half later said, "Hey, this is going to be delayed two more
days." And they're fine. You thought
days." And they're fine. You thought
about it before they thought about it.
And it's not magical. It's not I mean, it's a magical experience, but it's not very it's not a complex thing, but I found that people don't understand that intuitively. And so, that is what I
intuitively. And so, that is what I learned back in the day, and that is what we do in the company today. And
that's why we are we have very very happy users and customers.
Okay. Fascinating. Now let's switch back to the the more technical uh stuff. So
we talked about the concept of sandboxes. Uh just walk us through what
sandboxes. Uh just walk us through what a sandbox is technically.
Yeah, there's a lot of things there's a lot of things there we can talk about.
The way I think about uh sandbox is the ergonomics of consumption of compute.
And so because there's a lot of different layers on that I basically put it into three different layers which is the infrastructure the primitive and the tooling those three things together. And
what I mean by infrastructure is like is does it spin up very very fast? Can you
spin so like we spin up in 60 milliseconds? Can you spin up a lot of
milliseconds? Can you spin up a lot of them at once so the the rate of creating them. And so we can spin up 50,000 in 70
them. And so we can spin up 50,000 in 70 seconds. Um so little less than a minute
seconds. Um so little less than a minute and a half. And then once they're up, how many can you keep running? Like we
can have we have customers that have billions a day. Um so those are like the infrastructure parts that are non-trivial.
Why does it matter uh how quickly you can initialize a new sandbox and how many again it depends on the user and the use case. So if you're like a long running
case. So if you're like a long running background agent again everyone prefers it to be fast like no one wants to wait.
Like you don't want to wait for a reply.
Like everyone wants to be fast to be very clear. So the faster the better,
very clear. So the faster the better, but generally there's an actual reason why you want it very very very fast. And
that is especially if you so for I was going to say if for a background agent a background agent might work for like 10 minutes or an hour or whatever. So the
incremental millisecond might not matter. But I still believe that from a
matter. But I still believe that from a user perspective even a second of waiting is kind of uncomfortable. You
don't want that. And so is 60 or 90 maybe less so. But there you want that one two seconds for sure under that. But
the interesting the more interesting part where that is really really important is for the researchers where when you're doing reinforcement learning you basically have a lotment of GPUs and
you want the GPU is more expensive than the CPU. So the vast majority of
the CPU. So the vast majority of sandboxes are CPU boxes to be clear. So
they're the computers that we all work on. There might be a graphics card in
on. There might be a graphics card in there but basically it's the compute the RAM the CPU um and the hard disk that's in there. And they are cheaper or less
in there. And they are cheaper or less expensive and easier to to get at least for now. We'll see for how long that
for now. We'll see for how long that lasts. Then GPUs, which means you want
lasts. Then GPUs, which means you want your GPUs always to be at maximum utilization and the CPUs can then idle if need to be idled, but you want them, you don't want the GPUs to idle. And so
what that means is you want to make sure that the CPU machines, the sandboxes spin up so fast because you don't spin them in a training run. You don't spin them all up. You spin them on like
depending on how many you have like a thousand at once. They do the task, then do the next one, the next one, the next one. And so between turning off and on
one. And so between turning off and on the CPUs, you want that time to be as short as it can so that the GPU utilization does not go down, right? And
so that's why that's very very very important. And obviously the number that
important. And obviously the number that you have concurrent so depends on so on the background agents let's say just for the sake of people know lovable. So the
amount of users that they have is astonishing. And so each user for every
astonishing. And so each user for every task and they can have multiple tasks, multiple agents, they need a sandbox.
And so imagine I don't know their user number in the millions, whatever. So you
have to have millions of sandboxes up and running. Um on the RL side, you can
and running. Um on the RL side, you can have people smaller labs that will have concurrent 5,000 or 10,000, but we have a request today for 5 million concurrent look. So 5 million at one point in time,
look. So 5 million at one point in time, right? So being able to handle those
right? So being able to handle those types of things is part of that infrastructure segment. The other two
infrastructure segment. The other two are the primitive and the tooling. And
so the primitive is essentially let's call it the computer itself, the isolation, the VM, the container, the microVM, the isolate, the whatever. We
can get into details what those are, but there's different shapes of them. And
also what feature set do these things have, right? And so for the most part an
have, right? And so for the most part an a VM that you would find in you know AWS or whatnot is usually much slower to
start much harder to get those uh spikes but also when you get a certain size they're usually that size forever and forever and then we just give one one example
whereas in Daytona you can define a size the CPU the RAM the disk and then while it's running you can resize that so if an agent gets to you know use the entire
memory or 100% CPU, the sandbox would or VM would die. In our case, we can expand that. And there's other things. It's not
that. And there's other things. It's not
for us, it's not just a Linux CPU box.
It could be a Windows, a Mac, an Android. It can be, you know, they can
Android. It can be, you know, they can have a GPU in there. It can not have a GPU depending on what you need. And so
that's the primitive. And the last thing is the tooling. The tooling is the tooling. Tooling is either there to
tooling. Tooling is either there to support the agent to be better at its job or to be guardrails on the agent to stop it from doing dumb stuff basically.
So you can think of it, we have a bunch of tools, so headless, you know, terminal and uh fi uh file operations and other things that enable it to use
less tokens, get the job done faster, but also guardrails like secrets manager and a firewall and all these other things. And so the combination of those
things. And so the combination of those three, the infrastructure, primitive, and tooling essentially creates what we call a sandbox. So what's ultimately what's the different between a sandbox
or a container VM microVM all the things all the joy so all most sandbox providers are
firecracker VM microVMs most remind people what firecracker is firecracker it's an isolation priv so it is essentially a type of VM or virtual
machine that's very stripped down it was made by the uh AWS team it's used inside of Lambda and so the idea which are like functions um and so the idea was to have
something very very fast um that can spin up and stateless that that that was the entire time and they were very very ephemeral and so they were used for you
know when your website has a large um if it's Black Friday a bunch of people hitting your website you can spin up a lot of these very very fast like handle the load and enable all these humans to
to look at the website and buy whatever they were buying, right? And so that technology is there and it's a very stripped down version of a full-on VM.
So it has less features there, but the things that it does, it does very very very very well. So it can spin them up fast. It does that because it can sort
fast. It does that because it can sort of lazy load things in into memory. You
can do point in time snapshots. You can
do all these nice things. The things
that it can't do is it can't run a let's call it sandbox now. It can't run a sandbox with a GPU. It just doesn't work. So that you can't have a
work. So that you can't have a firecracker. So if you want a GPU, you
firecracker. So if you want a GPU, you have to do something which is a cloud hypervisor or a Kimu spelled Q E M U which are two different types. Q Kimu is almost a full VM. So it has all these
different things in inside of there. Um
so you can't do that. But let's for example if you need to run we were solving this for one customer earlier today is they need a machine that also has an Android device inside of it. And
so the only way we could solve that was inside one of these key moves. Like it
can't work in a firecracker, it can't work in a container, it can't work in anything else. And so there's different
anything else. And so there's different types of there. These are like the microVMs of the world. There's also
containers. And so container, most people know Docker, which um is there.
Daytona originally started running Docker containers. We do have them. The
Docker containers. We do have them. The
problem with Docker is that they're much less secure than a VM. The thing that we do in Daytona is we harden that with
SISBOX, which is um VM like isolation that wraps around that. um it's really good for handling density of these machines. It's very fast. It allows us
machines. It's very fast. It allows us to run Docker and Docker. There's a
bunch of features that are very that it's very very good for. Um and so there's differences there. There's also
like isolates which we've talked about or like other app containers which is abstractions of these containers just you're running a single app. And so these different
single app. And so these different things all exist in the world and they can all be theoretically sandboxes with these other two things that are there.
And our original idea was to get back to the original conversation. Most sandbox
providers started as firecrackers. But
as we start seeing that agents have different needs and different use cases, they are going to need all these different shapes consumed through one
interface. And so we at Daytona now
interface. And so we at Daytona now support containers and all the microVMs depending on your use case. The user
doesn't know it's the same ergonomics.
It's like, oh, I need a Windows, it'll spin up this one. I need a Linux, I'll spin up that one. And so we will continue to add these things inside of the same infrastructure. Sometimes it'll
be slightly faster or slower, but that the concurrency will be there. All the
um the the tooling will be there. And so
everything that you would come to know and like about a Daytona will be there, but it'll be different sizes and speeds and configurations depending on your use case.
Why are snapshots and ors important?
There's a lot of reasons. Let's get into the the commercial one, which is probably the the one that people most think about, which is pausing a sandbox.
So if you are an app layer company and you have 10 million users and your 10 million users spin up let's just say 10 million sandboxes just to make the math easier. These things cost right cuz
easier. These things cost right cuz they're running they're using CPU RAM and hard disk the entire time. The most
expensive thing is the CPU second their RAM disc is almost free. It's very
inexpensive and the user sends the agent to do something. The
agent does something and now it's waiting for reply. It could be waiting for the human or it could be waiting from a service depending on what it's trying to do. You ideally don't want that sandbox to run idle because you are
now pay regardless if you're a customer of Daytona or running your own but like there is a cost to running these things.
And so what you want to do is pause that and wait for a reply either from a service or for a human and then resume and the feeling and experience is that it never turned off. So you feel like it
never turned off but it actually did um turn off. And so that one is from a from
turn off. And so that one is from a from a compute management perspective if not from a cost perspective is probably the first reason. The second the second
first reason. The second the second thing is you can enable your agent to take multiple paths. So your agent can say oh at this point in time I'll take a
snapshot and then I will either continue and be able to roll back to this point like a point in time or at this point in time I will replicate the sandbox and have two sandboxes and I can try two at the same time. Obviously it can be more
but right but those are the reasons why you would have that there.
You mentioned uh uh performance and speed um and in connection with that you said that you had to rebuild your own scheduleuler.
Y so what is it first of all what does a scheduleuler do and then how did you go about it? Every cloud that exists today,
about it? Every cloud that exists today, NeoCloud, hypers scale or whatever, they are all built on servers, right? Like
metal machines, I don't know, like historically way back in the day, we actually stack these data the these data centers. Um, me and my co-founder a long
centers. Um, me and my co-founder a long long time ago, but they're all just like servers like CPU, RAM, disc, they're computers basically. Um, and the on top
computers basically. Um, and the on top of these computers, there is a software stack that everyone has built for their own reason. And so AWS has their
own reason. And so AWS has their software stack. Um, Cloudflare has
software stack. Um, Cloudflare has theirs which they're very different.
Everyone has has their own on top of that. And so basically what you're
that. And so basically what you're trying to do when you think of these machines, these servers, basically what we do is we cut them up into small little machines and then give you that
sandbox. And so you have these small on
sandbox. And so you have these small on these big servers, you have these small little machines that can run for a minute, a second, 3 hours, whatever. And
you don't have to worry about this. And
so basically the scheduler or the orchestrator is the one that that basically says, "Oh, after you send me a request, I send you to this server and turn it on that sandbox or little
computer there and then I turn it off or I kill it or I snapshot it or it's the management of all these things there.
And most of our other companies in the space basically have off-the-shelf schedulers. So it could be like
schedulers. So it could be like Kubernetes or Nomad or what whatever.
And when we decided to build Daytona the sandbox product, we inadvertently with very naive were like oh this stuff doesn't work cuz we had worked with all we had built our own then we had used
Kubernetes like we've seen all these different things and we knew what was good and was bad or what was use case or not. None of these were made for these
not. None of these were made for these like super fast staple longunning machines and so we're like we have to we have to do this again. we have to build it again because the way we thought
about it which was not known at the time is cuz at the time sandboxes were very ephemeral similar to a lambda function and we're like no why would your sandbox
be ephemeral by default like your laptop you don't want it to die like you want it to work until it's done and it has to be very fast it has all these things and so my co-founder mostly built this sort
of initial version of our scheduler and thatuler of ours is the basis of everything we do and it gives us a lot of things like outside of just like the performance. It also enables us to do
performance. It also enables us to do things like have four different isolation provide like we are the only company right now that you as a user don't know but like we have depending on
your use case we will spin up any one of these firecracker cloud hypervisor whatever um to get the job done. So
whatever is needed we'll get there. The
other thing the benefit is we built this we have collocation providers so we have like bare metal machines so you can think of us as our own cloud but we're more and more achen to now like the
neoclouds like the base 10s and like the fireworks because for us and we've got to this question yesterday because we have a large request of number of CPUs
is like we can't get enough and so how do we solve that problem well we've already solved it where our scheduler can be attached to any CPU machine, any
server that we think is good enough and it becomes part of our cloud essentially. And so, so why I say base
essentially. And so, so why I say base 10 and fireworks is that both of them run on more than a dozen compute providers depending on what we're going to call them. And so we can do something
similar because we've created this in such a fashion. So that has given us I believe that has given us sort of um a advantage there. And is there a
advantage there. And is there a fundamental technical difference between ephemeral and long running? So if you have a an agent that runs for 24 hours
uh how does that translate in terms of sandbox requirements?
The reason most sandbox environments do not run forever or even is because it's a technical problem. If you think about servers underneath these servers also
have to be managed right and maintained.
And so if your sandbox can run forever, that means that you can never reset, you can never reboot the underlying server.
You can't update it, you can't patch it, you can't do all these things without turning off all the sandboxes. And so
the way you solve that, the easiest way to solve that is having sandboxes that have a termination time. It's like they will only last an hour, 24, it doesn't matter. secure time and then if you
matter. secure time and then if you decide that you have to do something with the underlying machine, you just become you just flag that machine as non-scheduable.
And so at some point in time, there's nothing else on that machine. You can do what you can fix it. You can like reboot it. You can do whatever you want. Easy
it. You can do whatever you want. Easy
peasy done, right? And so because historically most of the workloads were ephemeral, you didn't have to try to solve that problem cuz you didn't care.
Like most workloads like a lambda function usually runs what five minutes 10 like whatever it's not a problem right you never had that restraint or constraint now that you do to have
something that can run forever you have to be able to live migrate the sandboxes itself between the machines so that you can reboot and manage these machines and I understand that most people it's
interesting even technical consumers are like oh all comput is elastic well yes your consumption of it from us or from AWS is elastic someone actually has to like screw together a server, turn it
on, boot it, make sure it works, right?
And have enough of that there. Um, so
that is the difference from from our perspective and why most start off emerald is just it's an easier way to do it. We believe and we've seen this I
it. We believe and we've seen this I think 2.5% of our revenue now comes from or 2 and a half% of our sandboxes run longer than 24 hours, but it's a large
percent. It's like 20% of revenue. So
percent. It's like 20% of revenue. So
like it's not an insignificant part to be there just because it can run for a much much much um longer time on the user perspective. Just to finish on that
user perspective. Just to finish on that sometime users actually want it to be ephemeral one is they don't want any data retention. So they want it to run
data retention. So they want it to run and die like whatever was run inside of that they got the data they wanted and they want that data dead. And so they
flag it with us as ephemeral which means that we won't force it to die but when they kill it it's no longer in it goes into the ether. It does not exist anymore.
Listening to uh this whole uh technical deep dive part to the conversation the the thought crosses my mind that a lot of people seem to think that they can create their own sandbox and it is not
that hard but uh again listening to all of this I'm like good luck. No, so that that we've seen this multiple time and I'm sure you've seen this across other companies as well where you know everyone's like, "Oh, I can spin up a
firecracker on a machine." Yes, you can spin up one, but spin up a million is a problem. Adding all these features is a
problem. Adding all these features is a problem. Having throughputs, different
problem. Having throughputs, different use cases, they're all different problems. And what we already see is like companies that were like, "Oh, we'll do ourselves come back 3 months later, 6 months later, 8 months later."
It's like, "Oh, actually, actually now we can't do X and Y." And so there's a bunch of things on technical side which we can like address and I think we can
finish with that is that the thing that Daytona does is we if you think of most sandboxes this a performance thing most sandbox and most
VMs actually they use the CPU RAM from the server and then they use the hard disk from a network drive.
You do that for a bunch of reasons. Your
life is so much easier if you manage that. Why is it easier? Well, it's
that. Why is it easier? Well, it's
easier because if a server dies, then all the data is on a shared drive and I can reboot that server really fast without maybe even the user not noticing and there's no data loss. Maybe
something in the memory, but nothing there is is lost. So, it's very very easy to manage. Whereas what we do is we actually use the CPU, RAM and hard disk of the machine.
What which means we have so much more overhead of if that machine dies, how can we reboot it back? like you have to do backups in the in that users don't see. You have to do all these things
see. You have to do all these things that happen there. But what that means is that the the IOPS the speed which with which you can um move data to the
hard disk um in which the CPU and RA RAM interacts with from a network drive just to give people like numbers is in the hundreds of thousands. Um if it's a
local drive it's in the tens of millions. So just like it doesn't matter
millions. So just like it doesn't matter if you know what the numbers are. It's
just like extraordinarily large, right?
It's very very different. And so you have use cases that no one really cares about that speed, which is fine. We have
new customers that have a use case.
They're like, it has to be that fast, right? And so it when you get to the
right? And so it when you get to the point, it's like, oh, I can use this, but wait, now I need this performance.
Oh, now I can't do that. Right? And so
to the complexity of sandboxes if when people are starting out their use cases are like oh I just need to run code you can run that on anything it's very easy to do except scale and then when you get to scale and then you need
performance because now oh agents can do a lot of things then it starts like breaking that sort of mold and security. Yeah. And so we didn't
and security. Yeah. And so we didn't even touch the security part, but yeah.
Great. Zooming out as we uh close this conversation. Um you mentioned something
conversation. Um you mentioned something that that caught my attention. So we
have been in a very well doumented GPU shortage. Uh but it uh seems that we
shortage. Uh but it uh seems that we might be heading towards a CPU shortage.
Yeah, it might be happening. So one of your former guests and was at our conference Dylan Patel like semi analysis they wrote the report and I believe it's somewhere like October we
have no more CPUs and people hadn't thought about it like it's it's all GPUs all GPU intensive then memory but basically now now that agents need all
these computers like it is very much needed now that RL is the way that we've gotten most of the progress in models over the last 18 20 months hunts you
need all the CPU machines to do this.
Now that the GPUs are more powerful, they can do in parallel more of these CPUs, CPU boxes. And then you see this, there was a tweet that I put a while
back and it's like you should definitely invest, not financial advice, you should invest in Intel and look at their stock price, right? And so like there's
price, right? And so like there's definitely need for that and it's something that people didn't understand or intuitively that that you would need there. So something that we
think about quite a bit is like can we alluding to also how our product is created how do we make sure that we can match the demand of customers for the
CPU that they have because I don't know it goes to the extreme to where GPUs are because that is very very very extreme. Uh but it is quite highly
very extreme. Uh but it is quite highly high probability that there will be shortages of CPUs going forward.
How do you think all this uh agent stack evolves? Does it feel like we have all
evolves? Does it feel like we have all the core pieces uh together or do you think like in two years the overall architecture uh may look very different?
So we talked about the the stack there and things that have to be done. The um
the identity part does need to be solved. Again it's basically solved but
solved. Again it's basically solved but not solved. So there there's a lot of
not solved. So there there's a lot of things to do there. I'm actually also very wary or think about people think that the models that we have the technology for the models that we have
is the end state and I really really don't think that is true like I think and that would probably be the most surprising thing is like we have a new type of model that is just better or
different than what we have now that either one needs less compute or doesn't use GPUs or doesn't like I don't know whether there's things that we've seen
where, you know, we've seen things with wetwware where people can have like fake human brains and it learns how to play Doom. You have the um what was it the
Doom. You have the um what was it the fly brain that was replicated inside of a computer that then works like a fly?
Like can you replicate a human brain?
And I say this not to replicate myself, but can you a generic human brain and then it's like an it's an AI that is essentially equal to a human and AI. And
so what does that mean for like all of technology? I mean, I'm saying I'm now
technology? I mean, I'm saying I'm now saying extremities for people, not getting me wrong, I don't think that happens tomorrow or like that. I'm
completely crazy, but like I don't think that this is the end all of the state.
The question is does that happen sooner or later? Do we keep our transformers
or later? Do we keep our transformers the the the vast majority of intelligence going forward? And if it if it is, then it continues probably going directionally where it does. But is
there something that happens that changes that fundamentally? something
that I think about because then all our bets, yours and ours are like very very different right?
Okay. Well, Ivan, that was fabulous.
Thank you so much for spending time with us today.
Thank you for having me. We're great.
Hi, it's Matt Turk again. Thanks for
listening to this episode of the Mad Podcast. If you enjoyed it, we'd be very
Podcast. If you enjoyed it, we'd be very grateful if you would consider subscribing if you haven't already, or leaving a positive review or comment on whichever platform you're watching this or listening to this episode from. This
really helps us build a podcast and get great guests. Thanks and see you at the
great guests. Thanks and see you at the next episode.
Loading video analysis...