LongCut logo

Claude Code Head Boris Cherny: Insane Growth, Tokenmaxxing, AI Agents' Next Frontier

By Alex Kantrowitz

Summary

Topics Covered

  • Claude Code Has Been Writing Itself for Months
  • AI Safety Is Why Anthropic Exists

Full Transcript

I've just never seen growth this deep.

And then it just kept going more and more exponential. Quad code is 100%

more exponential. Quad code is 100% written by quad code. Co-work is 100% written by quad code. An increasing

number of features are fully written by quad code across anthropic and and products. So I want to hear your

products. So I want to hear your perspective on on token maxing and whether you think that makes up a large portion of the usage of the products that you're building.

I don't write code. I prompt Claude.

And actually nowadays, mostly what I'm doing is I have a Claude that prompts other quads. So I don't even talk to

other quads. So I don't even talk to Claude. I have a Claude that's talking

Claude. I have a Claude that's talking to my quads. Let's talk with Claude code head Boris Churnney about the product's explosive growth. What's next on the

explosive growth. What's next on the road map and whether all this is sustainable? That's coming up right

sustainable? That's coming up right after this. Welcome to Big Technology

after this. Welcome to Big Technology Podcast, a show for coolheaded and nuanced conversation of the tech world and beyond. We have a great show for you

and beyond. We have a great show for you today. Cloud code head Boris Churnney is

today. Cloud code head Boris Churnney is here with us in studio. We're going to talk all about the product, the way it's taken off, what's next on the road map, and of course whether it's sustainable.

Going to go into things like token maxing, token inefficiency, and then of course the future of knowledge work. So

no lack of topics to cover. Boris, it's

so great to see you. Welcome to the show.

Yeah, thanks for having me.

So let's talk a little bit to begin with about the growth of cloud code. Uh it's

been massive, right? I think at a recent event, Dario Amade, the CEO of Anthropic, talked about how demand for Anthropic's products has been up like 80 times uh year-over-year. I remember

speaking with him last year around this time and he was thrilled that Anthropic was at $4 billion ARR. That seems quite right now. The numbers right now say

right now. The numbers right now say maybe it's 45 billion, right? So, a 10x there, 80x demand. And the question is how fast the company uh can serve the

demand here. But talk about the portion

demand here. But talk about the portion that of demand that claude code makes up and what you've seen in terms of demand growth and the amount of people using this thing for an increasing number of people in

the world. I think the way that you use

the world. I think the way that you use agents and the way that you use AI, it's not just enthropic products, but it's quad code in particular. And you know, of course, for enthropic, there's a lot of different products. There's, you

know, there's quad code, there's quadi chat, there's quad design, there's there's co-work, there's like the API products. There's a lot of ways to

products. There's a lot of ways to experience anthropic. Um, but for a lot

experience anthropic. Um, but for a lot of people, quad code is their first introduction. And yeah, the growth has

introduction. And yeah, the growth has just been insane. It's, you know, when we first released it internally, it just skyrocketed immediately. And so before

skyrocketed immediately. And so before we even released quad code to anyone outside of anthropic, we felt that it's pretty likely that this is going to be a

hit. And around the time that we

hit. And around the time that we released uh Opus 4 and Sonnet 4, this was in May of last year, the growth just went exponential. And I've just never

went exponential. And I've just never seen growth this steep. And then it just kept going more and more exponential with uh with Opus 4.5. that was November and then 4.6 six that was February of

this year and then 4.7 it just keeps inflecting over and over and you know there's a lot of people on our team that have worked in tech for a long time and you know we worked on all sorts of

hyperrowth products like this is something you talk about in tech all the time these like unicorns and hyperrowth but even on the team we've never seen growth like this uh and so we're we're just trying to figure out how do we how

do we make it so everyone can continue to experience this uh how do we make it so we can continue growing at this pace and the pace that we uh expect in the

future which might be even steeper than it is today and uh we're learning a lot about about how to do this and uh how to how to keep scaling the services. So a

year ago it was clear that the bulk of usage of anthropics AI models was happening through the API right that would be like a company like a consulting group for instance putting it

into action at a bank and the bank using it to summarize some calculations. I'm

just throwing an example out there. um

that compared to the cloud chatbot it was far and away the API was the lion share of usage revenue all these things um does that still the case today or is cloud code overtaking that we have a mix

so you know like products play a much bigger role for anthropic than they did a year ago that's that's definitely the case uh product growth is accelerating it's growing very quickly API is also

accelerating and growing very quickly and for us we are investing in both we have to be a product company because there's kind of a lot of reasons for lab

to build products and you know this actually wasn't clear early on like very early on in anthropics history this is before I joined this was actually like an active debate should we even build products like is this actually like a

useful thing to do and it turns out it's very useful um you know for mind share but then also for safety um fundamentally we exist to study AI safety this gives us better tools to do

that we're also a small number of people and so most things in the world we will not build, right?

And so this is why we also have to provide a platform and we have managed agents and API and SDK all all these products so people can build on top and you know thousands and thousands of businesses choose to do that.

Yeah, it's it's interesting to hear you even answer the question saying that it's a mix. So I take it you're not going to share which is bigger right now.

Maybe maybe not right now.

Okay. But the fact that like it's not a clearcut the API is bigger. Um

maybe it is. But the fact that you even say it's a mix just shows the the fact that Anthropics owned and operated products are just growing massively. And

now so you know we've set the we've set the stage here that this is a thing something that's growing exponentially.

Um we've obviously we obviously have seen the anthropic revenue grow exponentially kind of alongside this product. This is a product that you

product. This is a product that you conceived of and built and run today.

Uh, I think that there's probably some people watching who are like, well, what is Claude code? Um, most of our our viewers obviously know what it is. Um,

and I was like, how do I write this like in a simple one-s sentence definition?

And I wrote that it's a way to build websites and software in plain English.

And then on the way over here, I was like, well, that kind of sells it short a little bit.

I mean, what would you describe it as?

I think that's actually a pretty good description. It's

description. It's all right. We'll take it.

all right. We'll take it.

I I think when a lot of people think about AI, they think about chat bots.

And you know, for engineers, that's what AI was, you know, maybe like a year and a half ago, before we started quad code, that's what AI was for most people. And

we realized at some point that the model was actually getting really good at coding and it's getting really good at using tools. And these are things that

using tools. And these are things that we've kind of always trained the model to do. And you know, this has kind of

to do. And you know, this has kind of been the research direction for a while.

It started to become commercially useful about a year and a half ago. And so for cloud code, we took this bet and we deviated from the way that everyone wrote code at the time because the way

that everyone in the world wrote code was using essentially a fancy text editor and we just thought maybe we can do much better than this and we could do something really really different than

what's been done before. It was very much a bet. And so we introduced, you know, quad code. And the thing that made quad code different from chat bots at the time was quad code can use tools.

And this is it. Like this is just the difference. It's with a chatbot, you're

difference. It's with a chatbot, you're going back and forth and you're talking, but an agent and cloud code is an agent.

It can use your tools, right? And can we just quickly define

right? And can we just quickly define the tools? So tools could be anything

the tools? So tools could be anything and you tell me if I'm wrong from using a browser to like logging into Cloudflare and then setting up some agent that way, right? So it becomes

less of what does this product do itself and more of like what can this product log into and then sort of do with a multiplicity of products you can use online.

That's right. It can it can connect all your different tools. It can use your browser. It can use your computer. Um,

browser. It can use your computer. Um,

even something as simple as like editing a file on your computer. You know, like a year and a half ago, there was no AI product that could actually do that. But

this is the first thing that Quad Code was able to do. It could edit a file on your desktop. If you have a bunch of

your desktop. If you have a bunch of files on your desktop, it can organize them. And so like Quad Code and Co-work

them. And so like Quad Code and Co-work have this access if if you choose to give it.

Granted.

Yeah. And and you know, it it can it can do this. And this is magical. It's this

do this. And this is magical. It's this

tiny difference completely changes the way that people can use this product.

And it totally changes what this product can do for you.

Yeah. I mean the fundamental thing I think just to drill down here is that um it seems like AI has shifted from sort of like as great at autocomplete right

because at the fundamental layer AI is just predicting what comes next. Um

predicting you know if if you're using machine learning and applying it on a large data set predicting whether you might default on your mortgage and whether a bank should grant a mortgage.

when it comes to a sentence predicting the next word with code predicting the next bit of code in a sequence right so I think that was gener gen one but what

you're talking about now is the machine is actually just able to go and after you give it this natural language prompt code itself hook into tools and then do

things for you and so correct me if I'm wrong but the the use cases here have gone from developers hooking into it and writing code with cloud code and we've seen this explosion I guess largely

driven by them but then by a secondary force by non-technical folks people like me who can build software um by directing the AI agent which is cloud

code to build a piece of workflow software for them or a website or to take control of your computer via something like cloud co-work which is sort of the um maybe I would call it the

easier sister product and saying well you have access to my uh to my browser Um, you know what type of flights I like to book? I need to be in India in a

to book? I need to be in India in a couple of weeks. Book the flight.

Yeah. Yeah. Exactly. I I I actually just used co-work to book uh a bunch of flights. I'm I'm going to be flying a

flights. I'm I'm going to be flying a bunch this month for, you know, we have like Code with Claude coming up and in London and Tokyo. And there's some other stops along the way. And I went back and forth with co-work. And I was like,

"Okay, I need to be in these uh in these places at this time." And it was five stops. It was like a lot of cities. And

stops. It was like a lot of cities. And

here's roughly the schedule. um look

through my email, look through my calendar, and just double check it, make sure I'm not missing anything. It found

actually two stops that I was missing and also a couple dates that I told it wrong. And it just found this by looking

wrong. And it just found this by looking at my email after, you know, I I asked it to do that. And then I told it to book the flights and I went and, you know, was coding on something and I was

I was just doing work and I came back an hour later and it booked eight flights and five hotels and uh one of the hotels was kind of incorrect. was in the wrong area. I asked it to rebook it and change

area. I asked it to rebook it and change it and it was done. That was it. And I

actually, this is something that I try every time with co-work with quad code.

I have these sort of like uh test cases.

So these sort of um like a common thing that I would do and I just retry it with different models and you know as the model improves. This is the best result

model improves. This is the best result I've ever gotten. And there's something about co-work combined with Opus 4.7 where it's able to do this. And I think

one of the hardest things for me has been as the model improves, you constantly have to readjust your expectations of what it can do. And if

you talk to people, especially engineers that use the model a year ago, they might and and they didn't use it since.

They might say something like, "Oh, well, you know, it's not very good at coding and you know, I don't trust it to write more than a few lines at the time at a time." Because that's what the model was a year ago. It wasn't very

good yet. And if you fast forward to

good yet. And if you fast forward to today and you sit down these people and you know they they they try the new model and as as like a lot of people have been doing an increasing number of

engineers it's just a completely different experience the the capability is completely different and I I think this is the first technology I've used like this where

every month there's a step change in what it can do and as a user of this technology it's just quite hard because you have to kind of keep retraining you have to keep retrying you always need

this like beginner mindset to to retry the technology and use it for a thing it was not good at before because the next model might might just do it perfectly right and so I think this is the vision

the way that you're outlining it is effectively previously when you would use technology you would be subject to the interface you would have a software company that built for scale um but you

would get a lot of features that maybe weren't applicable to you would have to go through all these bells and whistles whenever you were trying to book something even though you knew what you

wanted and you wouldn't have a website that would know your preferences. Now it

sort of shifts the paradigm where you have again it's it's an agent. It's

something that goes out and does things for you and can potentially shape your experience online the way that you want it. And that's that is I think what

it. And that's that is I think what people are seizing upon and that's why we're seeing why you're seeing really the explosive growth. But now I want to pressure test the thesis a little bit

and uh bring up some things that make me curious how much of this is real uh and how much of this is just um unbridled enthusiasm at the potential but maybe

stuff we should have a reality check on.

Um and the first thing is that there is such great demand. U but the question is how much of that demand is pure demand

versus demand that's gamified. Um, and

there is a practice that's going on within Silicon Valley and outside of it that's called token maxing. I'm sure

you've heard of it. It's where um, companies are have a mandate where people are supposed to use lots of AI tokens by running their AI agents as

much as they can. And then those who run the, you know, use the most tokens are like rewarded on a leader or on a leaderboard or meet a goal of AI actions that they have to take

as opposed to physical actions. So I

want to hear your perspective on on token maxing and whether you think that makes up a large portion of the usage of the products that you're building.

Yeah, I I don't think token maxing is a is a large percent. Um the the way that I would think about it is um you know before Anthropic actually I used to work at a big tech company at Facebook.

I was at Facebook which is one of the companies that's token maxing for.

That's right. That's right.

Yeah.

And one of my responsibilities was the health of all of the code across you know the across meta apps. So this is like Facebook, Instagram, you know

WhatsApp. And one of the reasons that we

WhatsApp. And one of the reasons that we care about the health of the code and this is essentially things like code quality is if the code is really high quality engineers are more productive.

And there's like a big team of people that worked on productivity. And before

models before claude you would work for a really long time and you would see maybe like a one two 3% improvement in productivity per engineer over the course of a year like something like

that. And that was like a pretty big uh

that. And that was like a pretty big uh improvement. Um and it was like very

improvement. Um and it was like very hard one. You essentially had to try a

hard one. You essentially had to try a lot of ideas and eventually you find something that improves productivity like this. And what happened with Claude

like this. And what happened with Claude is now many companies including Anthropic and all of our biggest customers are reporting gains on the order of hundreds of percentage points.

And I think the the last number that we reported is the the amount of code written per engineer at Anthropic has grown something like 250%.

since we introduced quad code and this is uh with while keeping code quality and reliability and all these things uh kind of stable. So without those things

regressing the volume of code has has grown a lot and so um this kind of productivity impact I think is just like very new and I think people are trying to figure out how do we get this there's

a lot of companies asking like how do we how do we get these kind of benefits because a lot of companies are seeing it and then some are still figuring it out and I think my advice is almost always

the same. The first thing is just give

the same. The first thing is just give everyone tokens let people experiment. I

wouldn't necessarily recommend token maxing, but I would recommend let people experiment so they don't have to ask for approval for every token. The second

thing is give people psychological safety because a lot of times when people are innovating and they're building tools that make them more productive, they're changing their own workflows to make them more productive.

They try a bunch of ideas, some of them might not work and then some of them work. So you want to give people this

work. So you want to give people this kind of psychological safety so they feel okay experimenting with it and finding these new processes.

And then um the thing that a lot of companies see is the productivity improvements and the innovations do not come from the people you expect.

Back in the old days, you know, everyone could point out like these are my most productive engineers. But I think

productive engineers. But I think nowadays a lot of the improvements are coming from people you just never would expect. It could be like an accountant

expect. It could be like an accountant somewhere in the corner of your org that just automates like accounting in a way that no engineer would have thought of.

Um, it could be some marketer automating like marketing in a way that you never would have thought of. It could have been like a new grad software engineer that just built something amazing. And

this is something that just like didn't happen before. The challenge is you

happen before. The challenge is you can't identify these engineers and these people ahead of time. You don't know who they are and it's almost always going to surprise you. And so the thing you want

surprise you. And so the thing you want to do is let people experiment, give them safety, and then once there's some kind of use case that scales up, that's when you think about optimizing it. But

you don't want to optimize ahead of time. So I don't know if doing it in a

time. So I don't know if doing it in a competitive way works for some companies with their culture um then I think that's great. If for other companies the

that's great. If for other companies the way they want to do it is just kind of create safety and create space for engineers to experiment which is what we do at Anthropic then I think that's great too. It really depends on the

great too. It really depends on the company.

Yeah. And and I'll say look I use a lot of tokens. Uh I'm in the tools all the

of tokens. Uh I'm in the tools all the time. I think Cloud Code and Cloud

time. I think Cloud Code and Cloud Co-Work have both been pretty great for my business. I'm a solo operator,

my business. I'm a solo operator, although that kind of sells it short because I have a team of people behind me um that help me um mostly in a part-time basis, but um that's for a

different show. But but I I do wonder,

different show. But but I I do wonder, you know, when I read these stories, um the large corporations are largely making up big big percentages of these

budgets and the incentives, you know, and again like I started the show saying how sustainable is this? The incentives

are are bad in some of these places.

This is from the Financial Times recently. Uh Amazon staff use AI tool

recently. Uh Amazon staff use AI tool for unnecessary tasks to inflate usage scores. Some employees said colleagues

scores. Some employees said colleagues were using the software to automate additional additional unnecessary AI activity to increase their consumption of tokens. They said the moved reflected

of tokens. They said the moved reflected pressure to adopt the technology after Amazon introduced targets for more than 80% of developers to use AI each week.

I' I gut checked this with an Amazon employee. They're like, "Yep, uh, this

employee. They're like, "Yep, uh, this is what's happening." They told me, "I triggered an automation that runs for hours and then gets deleted every day in order to meet these targets." So you

said you don't think that this token maxing stuff is is a big part of demand.

Um is there anything that you can see on your end to indicate that it's not that this is an outlier and not the rule in most places?

Yeah, this is uh I don't know how many companies are doing this token maxing thing. I've I've heard of it as a trend

thing. I've I've heard of it as a trend you know a little bit. Um if you look at Quad Codes customers uh we have just many many many customers. Um, so it's not like uh, you know, there's like one

company driving the usage. It's not like that. I I I do want to kind of step back

that. I I I do want to kind of step back a little bit and just think about like how does this kind of change happen?

Because I think the goal of what these companies are trying to do, I don't want to speak for them, and I would recommend just talking to them.

But the the goal of what they're trying to do, I think, is probably like organizational change and business process change. How do you make it so

process change. How do you make it so your company benefits from AI? And this

is often unclear. It's very dependent on the company because every company has a different business, a different culture, a different org, a different way of doing things. There there's this old uh

doing things. There there's this old uh Harvard Business Review article from the '90s, which I just love, and I I forget the title, but it but it was something like uh computers are here, why is no

one seeing the productivity impact?

And this was a big question, right? It's

like to us it's obvious computers make us more productive. This is just incredibly obvious today.

But in that '90s, this was not obvious.

And what was happening is personal computers were being adopted. They were

replacing mainframes and now they're affordable. So the average company, the

affordable. So the average company, the average startup can can buy one. You

don't have to spend, you know, millions of dollars on a mainframe anymore.

But there was this challenge and there was this paradox. Companies were

adopting it, but they were not seeing productivity improvement. What's going

productivity improvement. What's going on? And so this Harvard Business Review

on? And so this Harvard Business Review article, it made the case that in order to get a benefit from computers, you have to restructure your your your whole business process around computers. They

have to be at the center of the way that you do things. And if you still have like paper, you know, filing cabinets and you have a bunch of drawers full of stuff and it's still a paper and pen kind of physical process and there's a

computer somewhere on the periphery, you're really not going to benefit. But

if you throw away your filing cabinets, you throw away your, you know, desk drawers full of, you know, papers and you put a computer at the center of it and that's the way that you do you all your business process, then you benefit.

And there was this split between companies. Some were doing this and they

companies. Some were doing this and they were doing this fairly painful change and they benefited from it and then others didn't. And I think it's kind of

others didn't. And I think it's kind of the same thing now. A lot of companies are trying to figure out how to benefit from the productivity impacts of AI and uh there's just a lot of experimentation

and everyone is trying different approaches to to figure out how to how to benefit from it. I don't I don't think there's one right approach.

Okay. And um look, I I think that when we see something grow as fast as cloud code has grown and as fast as as fast as Anthropic has grown um

it's good to just kind of talk this stuff through and and it's good to hear your perspective. So okay, that's token

your perspective. So okay, that's token maxing. Um now tokens of course are the

maxing. Um now tokens of course are the output of the model like the words or portions of words that the model outputs and the words and portions of words that go into it, right? Um and that is how

these companies charge and the more you have the more data centers you need etc etc. um you know as these models get better they they haven't and well let me

put it to you this way sometimes I wonder whether they're as efficient as they can be um these big models can sometimes do a lot of work use a lot of

tokens even if the output is great uh people wonder well is this sort of just driving up token demand where it could have been a really easy process and um and the models are are expending many

many tokens and not getting there as as efficiently as they Let me give you an example. Um, I've been using Claude

example. Um, I've been using Claude Co-work to make PowerPoint presentations.

It's really good at it. Um, and I've been using the Opus 4.7 model and a couple of times I've said, "All right, um, you know, you're working on this uh

this ship it as a PDF and um, it just starts to lose its mind. it cycles and it uses as many tools as it possibly can

and you know it's just seems unable to ship the PDF and um and eventually I kept telling it no you're making this PowerPoint you know where it is ship it

and it goes uh I owe you an apology I went down a rabbit hole worrying about a constraint that wasn't actually blocking us the files there and then it shipped it I mean talk a

little bit about the efficiency of these models um and whether that is a legitimate worry that you know as we've seen the growth part of it is these like

loops that a model like Opus 4.7 might find itself in to do basic tasks.

Yeah, generally when we think about models there's a few different aspects of it. um one is just how intelligent is

of it. um one is just how intelligent is it another one is how fast it is and another one is how efficient it is and we generally try to move all these together between these I think we should

probably optimize for intelligence that's the most important thing so even if it's like a little bit less efficient but it's more intelligent and lets you do more things that's really useful because the efficiency optimization comes after after we make it more

intelligent then we can make it more efficient so it's sort of kind of we do one then then we do the other we've been experimenting a with like how exactly we give people control over this because we

don't always know the right default sometimes like when you're using it you know better you know better um and so one mechanism that we had for this is picking a model so you can pick you know

opus or sonnet or haiku um another mechanism that we've been experimenting with is effort opus is like the biggest sonnet middle haiku smallest that's right that's right and this is just like the size of the model

right um and then there's effort and effort is essentially how it you know I think the word is actually really uh descriptive it's how much effort do you want to put into it and uh you can set this we have

a recommended effort so you know for example to maximize intelligence for opus 4.7 you want to use extra high or maximum effort but if you wanted to use west tokens you can pick like medium or low effort and this is a control that

you have yeah I talked about this on a show recently and we had a commenter that came in and I was of the opinion that this will these you know bigger models will find a way to become more efficient

on like the export the PDF thing. Um, we

had a commenter come in that wrote, "Alex, they can't fix things like that PDF problem. It's inherent to LM

PDF problem. It's inherent to LM technology and it's the biggest barrier to useful widespread dissemination and usage of agentic AI." I think I'm going to try to translate that what they were

trying to say is we talked about predictions earlier that this is all probabilistic. It's sort of predicting

probabilistic. It's sort of predicting the next word. You don't get the same answer from an AI agent twice. And so

therefore this type of thing is a feature of the way that they work and not fixable. What do you think?

not fixable. What do you think?

No, I I don't think that's right. When

when you think about like okay, let's zoom out a little bit. Y

so engineers are the first adopters, right? Like engineers started using

right? Like engineers started using cloud code like a year and a half ago and uh you know this is before non-engineers were using agents in a meaningful way. This is you know before

meaningful way. This is you know before co-work and so on. If I think back to what cloud code was a year and a half ago, it wasn't very good. I could use it to write a little bit of code, but if I

really trust it to build an entire feature or entire product, it wouldn't turn out well. It did the same thing like it would go in spirals and the quality wasn't good or, you know, it built it and either the code was bad or

it didn't work. And at some point, it just started to get better. And as the model improved and as quad code improved, the result just got better and better and better. And so you fast

forward to today, quad code is 100% written by quad code. Co-work is 100% written by quad code. An increasing

number of features are fully written by quad code across enthropic and and products.

And this is something that we hear from customers also.

I did a I did a talk at Y Combinator, you know, the the the the um the startup incubator yesterday and I asked people to raise their hands.

you know everyone everyone's using quad code and I asked them you know raise your hand if a 100% of your code is written using quad code today about half the hands went up and then you know I asked people raise your hand if 0% of

your code is is you know written with AI there's like one hand that went up and there's a room of like a hundred people power to that person um and you know there's still room for this obviously and then everyone else was somewhere in the middle you know

it's like most of their code is written with quad code but not all of it but that's kind of the place where the model was at today it was not there a year ago a year ago it was not good enough for this and so this is exactly what you're

seeing play out with co-work right now it's still early you know we released it what like a a few months ago um it's it's going to keep improving it's going to keep getting better as the product gets better as the model gets better but

this is early days I think still everyone using co-work today is an early adopter everyone even using AI today is an early adopter there are so many people in the world and most people have

not tried AI in a meaningful sense um so there's just like there's a lot more room to improve this.

Yeah, we're hosting an event here in San Francisco on June 18th and a lot of the marketing material I've turned out with co-work. Now, I go back and forth. I

co-work. Now, I go back and forth. I

don't let it oneshot it, so I'm looking at the copy, but I do things like uh you know, upload um you know, our our you know, download statistics to sort of show the growth of the podcast. And um I

give it the names of the speakers and it like is amazing at saying building a perspectus. Here's what the event's

perspectus. Here's what the event's going to be. Who's here's who's going to be in the audience? Here's who's

speaking. Here's why you should be there. Here's how to get in touch.

there. Here's how to get in touch.

Insane. It's so good.

What What was your uh what was your feeling like the first time that you used it and the first time that you saw like the agents use your tools?

Well, I I mean obviously I've sort of enabled everything. Um, so and I think

enabled everything. Um, so and I think this is kind of an experience that many people have had where you there's a browser extension for Claude and you realize that you can only get the benefit of this or you you'll get most

benefit by letting Claude take over your browser and do things for you. Um, and

the experience is kind, it's almost the same as I had with a Whimo where um, those first couple turns I was like white knuckling and like watching and like should I approve reading everything and then you start to trust it a little

bit and you just hit approve approve approve right in the Whimo the same thing. you're like, "Okay, this looks

thing. you're like, "Okay, this looks like it's not going to kill me." And

then 5 minutes later, you're on your phone as the AI does the work. And that

was my experience with code and co-work.

Is that does that sort of track?

I mean, this is like my experience, too.

It's like it's I think it's like any technology. Um, I was watching someone

technology. Um, I was watching someone that's uh it's it's like a friend that's that's been learning to use uh co-work over time and like, you know, she she's not an engineer. And um there's this use case the other day like her there was

like a language input on on the computer where you can kind of choose between languages on a laptop and there was some issue with it and she couldn't figure out how to fix it. And so before what she would have done is go to Google and ask like hey how do I fix this you know

this issue that I'm having with my computer and this time she just like asked co-work and a co-orker was like cool let me take a look can I can I use your computer and she said yes and it took over the computer and it gets this

kind of like orange glow and you get to watch as co-work open settings and it sees what's going on with the language picker and it diagnoses it and it fixes it and um you know you're still in the

driver's seat so you you can see this happening you can monitor it. it's not

happening in the background or anything.

Um, but it's just it's magical. And I

actually did like my instinct was to open Google. And so so it's funny that

open Google. And so so it's funny that like for her, she went to using co-work for this. And this is actually something

for this. And this is actually something I feel all the time. I I think for people that have kind of grown up with these products and they've seen previous versions, they might not be as ambitious as they could, but for people that are new to the products, I often see them

using quad code and co-work for things that I wouldn't have even thought of.

And it's just like amazing. It's It's so creative. Um, and I I learn a lot every

creative. Um, and I I learn a lot every time I see it.

Yep. Now, the biggest drawback right now, I would say, and I've seen you reply to people on X about this, um, is the rate limits. Like, when I see people say, "I've given Cloud Code a shot, but

I'm I'm kind of done with it." It's

typically because they've hit their token allotment and it only works for like an hour for them.

And then they have to wait four to use it again. and they look for

it again. and they look for alternatives. Um,

alternatives. Um, what do you think the rate limits have done to the ability for your product to grow and what is the plan if there is

one to make people be able to use this without those rate limits?

There's uh this is something we're actively working on. The reality is a very small percent of people actually hit their rate limits which is surprising for surpris for pro users

it's a little bit higher for max it's actually quite low and I think the thing that you're saying when people talk about it is um there's a couple things happening one is that we actually reduce

the peak rate limits um and that's now rolled back and we've actually doubled rate limits um so we're giving people more rate limits but there was a brief period where um we reduced them and so

people were running into The second thing that's happening is cloud code is actually quite extensible.

Um, and so people can use plugins, they can use all sorts of integrations. And

some of these use tokens in a pretty inefficient way. And so thing that we've

inefficient way. And so thing that we've been working on is surfacing this to you. So users can decide uh do you want

you. So users can decide uh do you want to use this plugin or do you not? So you

can see kind of what percentage of your tokens goes to it. And then I think the third thing is there's a lot of people that have just increasingly become power users. Like first when we released quad

users. Like first when we released quad code, you know, you ran one quad at a time. Nowadays, I'm running, you know,

time. Nowadays, I'm running, you know, like on my computer, I run maybe five at a time. And then every night I run like,

a time. And then every night I run like, you know, not every night, but most nights I run like hundreds of quads at a time all in parallel. Yeah. Hundreds,

sometimes thousands. And this is something that I just like wouldn't have imagined a year ago. And obviously this uses a lot of tokens. And there's a lot of people that are figuring out these new workflows that are using a lot more

tokens. And this is sort of like at the

tokens. And this is sort of like at the edge of what you can do with a max plan.

Um, and you know, this is why you can just like pay using API also. So if you just want to have as many tokens as you need, um, you can do this too. And this

is what a lot of enterprises do.

Right now, it wasn't long ago where I'm pretty sure Dario uh, Anthropics uh, CEO was referring to OpenAI and talking about the spending on the buildout and

he and he's talked about this afterwards. He said, "I'm trying to be

afterwards. He said, "I'm trying to be disciplined in the way I spend, which is still spending many billions of dollars on data centers to enable this stuff like you're talking about and others

which we think is open AAI are yoloing, right? But um now OpenAI is is doing

right? But um now OpenAI is is doing this too with codecs and you could call it yoloing, but they have a lot of data center capacity that they've built." U

how do you think about that? because you

know when people do hit these rate limits they may just go over to codeex um it's pretty intense competition so um how do you think about that how does

anthropic think about that internally that you know I mean at least from the outside perception is that um this added discipline on data center buildouts might end up losing users in the most

important product battle that your two companies are engaged in.

Yeah. So you know f first of all our growth has never been faster than it is today. Um so you know for quad code the

today. Um so you know for quad code the growth is accelerating and I think because most people don't actually hit rate limits very often. Um it's uh uh

it's actually not not a huge issue for the people that are we are laser focused on improving the experience and so we doubled the 500 rate limits. Uh we are

announcing today that we're increasing the weekly rate limits and of course we announced the new Colossus capacity which you know we brought online to serve all these new users via Elon Musk.

Via Elon. Yeah. Because this I mean this growth is just no one no one would have predicted this. This was just beyond our

predicted this. This was just beyond our wildest forecasts. Um and so you know I

wildest forecasts. Um and so you know I think for us what matters the most is we we need to serve our users. We want to make sure our users are really happy. Um

and we're doing everything we can to to make that happen. Are you surprised by Codeex? How do you view them as a

Codeex? How do you view them as a competitor?

I think there's always, you know, there's always copycats, there's always competitors. Um, for me, it's uh it's

competitors. Um, for me, it's uh it's flattering and uh I think it just forces everyone to do better. Um, so you know I for me the thing that I care about the most is

just doing the best job that we can to serve our users and we encourage everyone on the team to you know talk to users every day and um you know just make keep making the product a little

bit better every day. So this is what I care about the most.

Okay. Um I want to take a break but we have so much more to cover. I want to talk about how this extends beyond code uh the future of the chatbot and then maybe talk a little bit about we have I mean I could go through our agenda. We

really need two hours. So, um, why don't we take a break and come back and get to as much as we can right after this. And

we're back here on Big Technology Podcast with Boris Churnney, the head of Claude Code at Anthropic. Boris, it's

great having you here. Um, like I said, I'm in your product daily, so it's really fun to speak with you about it.

Um, we talked a little bit about this, but I think one thing we should highlight is that this is really going to extend uh beyond the chatbot. We

talked about booking flights. Uh I

talked about it uh with marketing presentations uh and you know the week that we're talking you have a a new use case out where um claude co-work can be

used for small businesses including uh taking over QuickBooks and doing some bookkeeping. Um where where does this

bookkeeping. Um where where does this go? I mean what do you think the broad

go? I mean what do you think the broad roadmap where does where does the broad road map take you?

We're thinking about a few things for quad code and for co-work. There's a few few big themes. One is improving intelligence and you know I I think almost all of this is just the model as the model improves we can do more and

more ambitious work for coding it used to be writing a line of code at a time now it's building entire features or entire products for co-work used to be you know like you know it's started pretty recently but it was like you know

making a document and now it's things like booking flights combining many tools doing doing your QuickBooks um so this this frontier is improving and and moving just very very quickly We're also

thinking about how to do longer running tasks for cloud code. We recently

shipped this thing called auto mode. And

auto mode is uh essentially a replacement for permission prompts.

Before what we used to do is whenever the model uses a tool, plot would ask you is it okay if I use this tool? And

you know, usually you just say yes and you get kind of tired of saying yes kind of over and over.

Always allow. That's the button to hit.

That's right. That's right. Um, but it's actually very important for security that you're very thoughtful about this.

And the thing that we were realizing is actually instead of being thoughtful about, you know, every prompt because we're showing people so many of these dialogues, they just kind of got fatigued and they would just say yes or,

you know, always. And so auto mode is the answer. And this is a new way of

the answer. And this is a new way of routing these tool calls. And the way that it works is whenever Claude wants to use a tool, it asks another Claude,

is it safe to use this tool. Claude has

some of the context. It doesn't have all the context. And there's also a number

the context. And there's also a number of layers of safety checks. And we spent months iterating on this to make it really safe. There's thousands of

really safe. There's thousands of different benchmarks and evals that we use to make sure that this is safe. Um,

and essentially we found both in the laboratory setting and now we're finding in the wild this is safer than what we had before. So, as a user, it's a really

had before. So, as a user, it's a really nice benefit because you don't have to sit there and say yes over and over. And

actually, the result is better because if there's one unsafe command buried somewhere in this big list of things that quad asks you to do, you might have accidentally said yes. But actually, if

you ask a second quad using auto mode, it's not going to say yes. So, this is kind of one big investment.

Um, maybe the third big one is just running more quads in parallel. One of

the cool things about quad and this is something that we started to see pretty early with quad code users is actually very few people nowadays run one quad code at a time. Most people run many many quad codes, you know, ranging from,

you know, a few to thousands. And with

co-work, we're starting to see the same exact thing. As you get more comfortable

exact thing. As you get more comfortable letting co-work run, you start a task and then you start a second task and you move on and you just do more in parallel. And I think there's just a lot

parallel. And I think there's just a lot of opportunity to make this experience very nice and to make it more obvious for people. How how do you do this? When

for people. How how do you do this? When

do you do it?

Right? And and it probably extends to the way that you use a chatbot, right?

And it's interesting because Anthropics had this um kind of interesting relationship with the chatbot. Started

out as technology first, decided to build the chatbot, ship claude, and then just kind kind of moved more towards enterprise. like you looked at all the

enterprise. like you looked at all the charts and um and Claude was always at the bottom u but now you're seeing Claude's usage rise and I have a thought

and I'd love to check this by you that the future of the chatbot is is not like I'm going to give you a question and you'll give me an answer. It's I will give you a question or you know talk to

you about a problem and you the chatbot will then suggest some sort of action you can take on my behalf. Like right

now I'm talking a lot about a trip to India and what I think I'm going to get back in the future is this thing being like like what you said not having this like secondary step between having to go

there and and book the flights a more proactive chatbot that's going to say okay let me take the let me take care of this for you. Is that the right direction? Like am I thinking about

direction? Like am I thinking about that?

I could see that. I could see that.

Yeah.

Are you working on it?

Asians are the future and you know we're trying all these different experiments.

There's some stuff that we're trying that's like this. Yeah.

Okay. Um but there is a limit here, right, to what this can do. Um a funny way people have talked about the limits of the thousands of clouds that you can

run in parallel is kind of looking at who Anthropic is hiring. Um my favorite job listing on the Anthropic site is that you're hiring Salesforce

administrators. Um, you're also hiring

administrators. Um, you're also hiring consultants to help enterprises uh deploy this technology. Um, and many are viewing that as like a sort of tacit

admission that this stuff can only take you so far. Here's uh Wharton professor Ethan Malik on it. He says, "You will know that the AI labs believe in artificial super intelligence when they

disband their newly formed consulting, sorry, forward deployed engineering groups. As long as people are required

groups. As long as people are required to figure out how AI is useful and do organizational change and systems integrations, jobs seem pretty safe."

What do you think about that?

Yeah. Um, when you look at the kind of engineering that I do, I don't write code. I prompt Claude. And actually

code. I prompt Claude. And actually

nowadays, mostly what I'm doing is I have a Claude that prompts other clauds.

So I don't even talk to Claude. I have a Claude that's talking to my quads. And I

think in engineering, you've seen just this explosion in the amount of leverage that a single person has. It's about how how how big of a business can a person build? How many products can one person

build? How many products can one person support? The leverage that one engineer

support? The leverage that one engineer has now at Anthropic is just insane.

And I think we're starting to see this across other disciplines too. So we're

starting to see this with the marketers that are, you know, using cloud to do things. We're starting to see this also

things. We're starting to see this also for forward deployed engineers that are using quad code to build implementations. We're seeing this for

implementations. We're seeing this for our sales team cuz, you know, actually at anthropic, I think like half the go to market team uses quad code and the other half uses core. You know, I I think everyone's using all these

products. Um, and so the thing that

products. Um, and so the thing that we're seeing is the amount of leverage an individual has goes up and we are still bottlenecked on the number of good people. And so even if the leverage per

people. And so even if the leverage per person goes up, you still just can't hire enough good people because the demand is so insane and there's so much more to build. So that that's still the

bottleneck for us. But I would say like if people would people would argue that if this stuff was so powerful um you could say take a look at the way my

sales organizations operates and then configure Salesforce that way with a prompt. Is this and another people

prompt. Is this and another people another example people give is u I'll believe that anthropic has very powerful AI if they let it let it uh handle the

the IPO paperwork and don't hire an investment bank. Are these unfair tests?

investment bank. Are these unfair tests?

Well, we're starting to see there there's one person on the team that was using Claude to do their taxes.

Um, you know, I I would not necessarily recommend this, but I'll admit I've run my taxes through Claude and compared it against my accountant and it was pretty close.

Yeah, I did the same thing.

Folks, not suggest not saying you should do that, but it is it's an interesting use case.

That's right. But I but I think fundamentally what people are missing in this conversation is in the end it's a person that has to talk to Claude to ask Claude to do this thing. So even if

Salesforce is automatically configured and you know it's not a person pressing all the buttons it's Claude doing it.

Someone has to ask Quad to do that. And

if you have to configure Salesforce in you know a bunch of different ways it could actually be a full-time job to ask Quad to do this. And at some point Claude is going to become really good at asking Claude to do this. And that

person is going to be asking Quad that asked Quad to do this. And this this chain will just keep getting deeper. But

in the end, you still need people that that are piloting this.

But maybe they're just job is just asking one question then in the future.

Yeah. But imagine how much leverage that has asking the right question.

That's true. That's a good point. U so

you know, we talked about Salesforce, so we have to talk about the SAS apocalypse. um you have some interesting

apocalypse. um you have some interesting views on the type of software companies that will be safe as we get more automated programming and those that that might be in trouble and you talked

you've talked previously about the different modes that exist and which modes are more important and which modes are less important. Can you just share that briefly, you know, while we're talking about it? There's this really

good framework called the seven powers for talking about modes in business. You

know, there's so many of these frameworks for this, but this is my favorite. Um I actually studied

favorite. Um I actually studied economics in school. I I didn't study computer science. So this is still kind

computer science. So this is still kind of the way that I think is in terms of these kind of frameworks. And there's a lot of these different modes in business. And some companies have one

business. And some companies have one mode, some have a few modes. You know,

they have like a portfolio of modes. Uh

there's a bunch of these modes. So like

one is uh scale economies. So as you scale up your production, then there's increasing returns to scale. Uh another

one is network effects. So this is like a you know like a messaging app or something like that. The more people that are on it, the more valuable it is for any person. Um another one is uh switching cost. There's another one

switching cost. There's another one that's process power. I I think most of these modes are still going to matter and relatively some are going to increase in importance over the next year and some are going to decrease in

importance. One that I think will

importance. One that I think will increase in importance is something like network effects because it doesn't matter who's writing the code. It

doesn't matter if it's an agent at the core of your product or or something else or if there's intelligence in your product. If there's a network effect in

product. If there's a network effect in your product, that's still going to matter. Some modes get less important.

matter. Some modes get less important.

And this is for example switching costs because if you want to switch from vendor A to vendor B, you can, you know, you can just ask Quad to do that. And

Quad is going to get better and better over time at it. And so I I think as a company, a thing that you should be thinking about is what are your modes?

And I think a lot of the largest companies just have many many modes.

It's not it's not just one thing cuz the way you get to a scale and the way you build a defensible business over time is you accumulate these modes. You need a number of them. Um but yeah, I I would just think what's going to be more

valuable in here and what's less valuable? I think that um when you think

valuable? I think that um when you think about these different software companies though if you're using a cloud code u do the most all modes kind of blend away

because you could potentially be in this like one app that is interfacing with all software which means therefore there's really only one software company.

Yeah. I mean there there's just like a lot of ways that this could play out. I

think something like this is possible, but it it seems a little far-fetched to me cuz if I think about, for example, like let's say I'm using a messaging app, how do I decide which app to use? I

use the app that my friends are on that I can that I can reach. So, it doesn't matter if I can build a really awesome app for myself, which I can do today.

Like, I can build a great messaging app with quad code in like a few hours. It's

still not useful because I can't talk to my friends.

But this is the example. Exactly. You'll

have you can you can um fact check me on this. You're gonna have an agent in your

this. You're gonna have an agent in your messaging apps that's going to let you know when your friends have messaged you. I know you use cloud code on your

you. I know you use cloud code on your iPhone a lot, right? So then you will just see the notification and you'll speak back to people. All your

communication could potentially be centralized in these as long as the companies play ball.

Yeah. I mean, it could be kind of the agent in the end, but how how does the communication actually happen? So like

you know for example if you look at a messaging app like uh you know like signal there's a protocol that it uses to communicate and you know I can build an app it can maybe use that same protocol but I think it actually can't

message other people that are on signal but yeah like I can have an agent that uses my app to do to do that messaging using an existing app that that supports this.

Yeah.

So yeah it's not obvious how it's going to play out. I think today people use a mix of you know apps and and agents. Um

but you know I I do fundamentally think that a lot of these modes are actually still going to increase in value over time.

You can think of another example let's say you know like a TSMC or some kind of like chip manufacturer. If you think about um the amount of work that they put into making a process and in making

a process where the costs go down with scale, this is a fundamental economic force and there's a lot of companies that that that do this kind of thing where you know especially in manufacturing where with scale the cost

goes down. With tech companies this is

goes down. With tech companies this is the case for infrastructure. So if you build a really great infrastructure you can support more users and the marginal cost per user goes down over time. So if

you have this kind of effect, it doesn't matter if you or I can build apps.

That's still a really powerful mode. But

I I do think for sure both things are in play.

Okay, I got three more in 10 minutes.

Let's see if we can get to them all. U

Jack Clark, one of the anthropic founders, recently said I think that he believes there's like a 60% chance that these models will start improving themselves by 2028. It could be off by a

percentage or a year, but ballpark that's accurate. Um, you're in the app

that's accurate. Um, you're in the app where coding happens autonomously.

You're running this app. Um, do you agree with Jack? Seems right.

Yeah. When I look at the way that quad code is uh written, 100% of quad code is written using quad code. This has been the case since I think November of last

year, since Opus 4.5.

It's like a fast takeoff scenario then.

Do you anticipate that? I mean, it's it's it's possible and like this is this is why Anthropic exists. If you ask anyone any any engineer or any researcher why they joined Anthropic, they're going to tell you it's for AI

safety. And it's because for us when we

safety. And it's because for us when we think about the future, you know, years from now, the thing that's the most important and the thing that we want to get right, you know, for our kids is we want to make sure this thing is safe and we want to make sure it goes well

because yeah, like that is one of the possible outcomes. I think that's not

possible outcomes. I think that's not yet what we're seeing. Um, you know, right now Quad Code is writing itself, but it's still a person that's doing the prompting. Quad is starting to generate

prompting. Quad is starting to generate its own ideas for what to build next for quad code, but it's, you know, it's not always good ideas, and I still generate most of the ideas. And, you know, at some point, it's going to change. The

model's going to improve, and it's going to become more of a of a self-reinforcing loop.

Okay. I definitely want to get your thoughts on um the world model uh argument here where people who are pro-orld model says say that um a large

language model has no understanding of the consequences and you need to build a world model into it to have effective agents. Here's something from Yan Lakun.

agents. Here's something from Yan Lakun.

Uh he says you cannot build a reliable agentic system without a world model.

LLMs don't have world models. They can't

predict the consequences of their actions before taking them. According to

Yan, they just act and whatever happens next is someone else's problem. I was

speaking with Greg Brockman from OpenAI recently and he said basically he doesn't accept that argument and he thinks LLMs are the way directly these text models are the way to AGI. Uh which

side are you on? Are you a believer that that world model intelligence needs to be baked in or do you think that LLMs alone are good enough?

I would put out an offer to Yan if he wants to sit down and quad code together for an hour. I'd love to show him. And

then you guys should do that on this show.

Yeah. And then I'm curious to hear what he thinks. Maybe he'll change his mind,

he thinks. Maybe he'll change his mind, maybe he doesn't.

Right. But your perspective though, you know, I'm on I'm pretty firmly on the product side. Um so, you know, I I I don't I don't really have a have a perspective on it.

But okay, let me let me drill down a tiny bit deeper if you don't mind. um

you know, you're you're on the product side, but I've heard multiple people bring out this idea that without a conception of the way the world works, like in a world model, um a a uh LLM

just doesn't have an understanding of the way that the world works and consequences and stuff. You use co-work to book how many flights? Eight flights

in hotels. Like, you must think that it has some understanding of consequences, otherwise you wouldn't have given it your credit card, which I presume you did. So what do you think about that

did. So what do you think about that argument in particular?

I think from what I've read from folks working on on research at Anthropic, it is surprising the degree to which these models are intelligent because like you said at the beginning, the the thing that they fundamentally do is they

predict the next token.

Mhm.

And so you think like this is kind of like a stupid thing like how can this possibly lead to intelligence? But you

know we we've actually published a lot of work about how the models are able to plan, they're able to actually reason.

Um there was all these like very surprising behaviors that you actually wouldn't expect from a model that just predicts the next token. So I don't know I I wouldn't discount it.

I mean I think my favorite is when they write poetry um as they're writing the first line you can see in the model this is anthropic research that they're already thinking about the next line.

That's right.

Which is like how is that even possible?

But that's right. I mean and that's kind of

that's right. I mean and that's kind of you know how I think about it. Like if I if I were poetry that's how I would do it too. And it's it's crazy like you

it too. And it's it's crazy like you teach this thing to predict the next word and somehow if the next word is hard enough, it has to learn to really plan ahead and it has to learn how to do all of this.

Okay, last one for you. Um, sometimes I wonder when I see big tech changes uh underway and in my career covering this stuff, some have worked out and some

haven't. Um, I always have to ask myself

haven't. Um, I always have to ask myself uh how are we sure that this is the future and this is not a fever dream. Um

and I think the data indicates that this is a real thing. But I also wonder you have to sort of you have to question how much you can extrapolate towards the future in terms of how will this

continue to progress. Um the argument that this is a fever dream is that um maybe people just want simple interfaces and they don't mind tapping through

things and you know speaking in a cloud code feels a little bit too techy uh and it just won't appeal to the everyday user as much as it's really taken off

with developers. I mean how would you

with developers. I mean how would you answer that?

We had this uh we had this hackathon for Opus 4.7 recently and one of the winners was a doctor that built an app. There

was a there there was an electrician uh there was a carpenter and a lot of these people didn't have coding experience but they use quad code to build something useful. There's one person that built

useful. There's one person that built and sold a startup as a result of one of these hackathons that we put on. And

undoubtedly when we first built quad code it was for engineers and engineers kind of figured out how to use it. But very quickly people that were not engineers figured

out how to use this to build economically useful things. And actually

if you look at a lot of the usage today it's like it's not engineers and it's just so useful for people that they were going out of their way. They're jumping

through hoops. Even before co-work people were like installing quad code in a terminal. For a lot of people, this

a terminal. For a lot of people, this was their first time using a terminal.

And of course, now, you know, for quad code, we have a desktop app, we have iOS app, we have a Slack app. You know,

there's many ways to interact with it.

But people were jumping through hoops to use it because it was so useful. And so

for me as a product person, this is the ultimate market test of is this thing useful? Is are there a lot of people

useful? Is are there a lot of people that use this every day and that keep using it every day? And yeah, it's a lot of people and it just keeps growing. And

I'm just constantly surprised by the way that people use this.

Yeah, I I will say I've been surprised by the way that I found myself uh using the tools and I don't know, we we'll see what comes next. So excited to keep using it and and thrilled to have a chance to speak with you. I hope we can

do it again.

Yeah, thanks for having me on.

All right, thank you Boris. Great

speaking with you. All right, everybody.

Thank you so much for listening and watching and we'll see you next time on Big Technology Podcast.

Loading...

Loading video analysis...