LongCut logo

Cursor 2.0 is INSANE, but is it better than Claude Code?

By Alex Finn

Summary

## Key takeaways - **Cursor 2.0 shifts focus from code to agents**: Cursor 2.0 is now fully focused on agents, moving away from a traditional development environment. Users will primarily interact with agents they are managing rather than seeing files and code by default. [00:46], [03:25] - **New in-house AI model: Composer**: Cursor 2.0 introduces its own in-house coding AI model named Composer. This move aims to reduce dependency on external models like OpenAI and Anthropic, which the speaker views as crucial for Cursor's survival. [01:26], [04:31] - **Multi-agent testing for stock app**: The video demonstrates building a stock portfolio app, testing Cursor 2.0's ability to manage multiple agents simultaneously. Four agents were spun up to add features like a watchlist chart, share ownership tracking, portfolio value display, and market overview. [03:48], [12:21] - **Initial UI issues resolved by agent**: The initial build of the stock app had significant styling issues, making it look 'terrible.' However, the agent was able to identify and fix the problem by reviewing the CSS setup, ultimately resulting in a 'beautiful' and well-functioning application. [08:26], [10:35] - **Cursor 2.0's speed vs. code quality debate**: While Cursor 2.0's Composer model is exceptionally fast, the speaker emphasizes that speed is secondary to code quality. The initial styling issue highlights that fast, low-quality code is less valuable than slower, correct code. [07:09], [10:10] - **Speaker still prefers Claude Code for quality**: Despite being impressed with Cursor 2.0's speed and agent management, the speaker currently prefers Claude Code due to its higher quality code output and clearer explanations, though the gap has significantly narrowed. [17:37], [17:52]

Topics Covered

  • Cursor 2.0 shifts focus from code to agents.
  • Cursor 2.0 introduces its own AI model, Composer.
  • Benchmarks for Composer are impressive but taken with a grain of salt.
  • Cursor 2.0's agent manager UI simplifies development.
  • Managing multiple agents simultaneously is Cursor 2.0's core advantage.

Full Transcript

Cursor 2.0 just dropped and it is

radically different than the cursor

you're used to. The question is, should

you switch from clawed code and codeex

to this new cursor? I honestly think the

answer is going to surprise you. In this

video, I'm going to cover all the new

features and functionality of Cursor 2.0

so you can know what to expect, as well

as tell you if it's time to switch from

those other AI coding tools. Is Cursor

2.0 the new king in town? Let's get into

it. So, Cursor 2.0 0 just dropped like

an hour ago. The question is what's new

with it? I'm going to go over what's

new. Then I'm going to pop open Cursor

2.0 and show you how you can get the

most out of it. How you can start

building apps immediately, even if

you've never written a line of code in

your entire life. But let's start off

with what's new with Cursor 2.0 before

we jump into the platform. First of all,

it is fully focused on agents, not code.

They don't want to look like a

development environment anymore. So when

you go in, by default, you're not going

to see files. You're not going to see

code. You're just going to see agents

that you're managing. And this is pretty

big. I'm excited for you to check that

out. I actually really like this.

Second, you can manage multiple agents

at once. So, it's all about being like

the employer, managing a whole bunch of

agents that are going out and doing work

for you. That's really cool. Here's a

big one, and I actually think this is

sneaky. The biggest feature of them all.

I think this is the only way cursor is

going to survive against the clawed

codes, the codeexes of the world is they

actually developed and released their

own in-house coding AI model named

composer which I see they went with the

open AI route from naming conventions by

reusing the same name for multiple

things at the same time just to confuse

the consumer as much as humanly possible

but hey I respect them for it. So they

have a new AI model called composer even

though they have features called

composer. whatever. We'll go over that

new model as well, but it makes it so

they're less dependent on OpenAI and

Anthropic for their models, which is

really, really important if they want to

survive. We'll go over just how good

that new model is. And the last big

feature is here, you can have multiple

models working on the same problem. So,

you can spin up a task and have Claude,

Codeex, as well as Composer work on that

problem, and then you can see what the

results are and choose which result you

like the best. a little bit on

benchmarks before we go into the

platform in just a second. These are the

benchmarks Cursor put out when it comes

to Composer. Uh I don't know why they

didn't name the competing models they're

comparing it to, whatever. But they're

saying from an intelligence perspective,

it is right behind the best Frontier,

which I'm going to guess is Sonnet. And

then from a speed perspective, it is by

far the fastest. I think the fastest

Frontier is probably the Gro one. I'm

gonna guess I again I don't know why

they couldn't just name the models. Uh

here's what I'll say. I'd say take every

benchmark any company's ever given you

with a grain of salt. I think every AI

benchmark on the planet is gamed. I

think it's a totally fake stat that

every AI company tries to put out to

make their models look good. So listen,

I'm taking this benchmark with a grain

of salt because at this point benchmarks

mean pretty much nothing. But let's get

into Cursor 2.0. Let's get into the

actual platform itself and see how it

holds up and build an entire app with it

and show you how to use all the new

features. So, here is Cursor 2.0. Voila.

This is it. One thing you'll notice is

you don't see any code. You don't see

any files. You don't see anything

technical related. All you see is this

bar that'll list all your agents doing

work in a chat interface. Why are they

doing this? Because people over the last

several months have been switching to uh

Claude CLI. They've been switching to

codeci. people, they have learned that

people no longer want to look at code,

write code, manage code. They just want

to put their agents out there and get to

work. So, they are focusing a lot more

on this agent interface. So, let's start

building an app. What we're going to

build today is a stock market app. I

really like building stock market apps

as tests for these agents and for these

models because it tests design. Will the

chart look nice? It tests technical

capabilities. Can it research APIs and

pull in the right data and display the

data correctly? It tests taste. Will it

make it nice and tasteful for the users

to use the app, the user experience? So,

I like using this as a test. So, let's

do this. Let's use all the new features.

I'll go through every single new

feature. So, let's do this. Let's start

building it out. I am going to go over

here into the main chat window. We have

all our different agents here. I'm going

to use composer one. Composer one is the

new cursor model, right? So, this is the

one they developed in house, which which

I think is a really smart idea. I think

that depending on anthropic and open AI

would have been the downfall of cursor,

but I'll explain all that later. Let's

get into it, though. So, here's the

prompt I'm going to do. I want to build

a stock portfolio app. It should allow

me to type in a ticker for a stock. And

by the way, the prompts down below if

you want to follow along with me. Highly

recommend it. Then it saves the stock to

a watch list, pulls the data about the

stock from the Alpha Vantage API, really

nice free stock API, displays the chart,

and displays other helpful information.

It is critical that this is a beautiful

app that doesn't look like it's built by

AI and has nice animations and

transitions so it feels really nice to

use. Any other features you think are

interesting, please add them. I like to

add that so that the I can test the

model to see what new ideas it can come

up with and how creative it can get.

Let's do this. Let's just hit send.

Let's get it popping and let's get the

model working. It is going to be

interesting to see how this cursor

inhouse built AI model will perform. I'm

going to hit run. I'm going to allow it

to create all the folders. This is a

fresh new project we're building here.

No prior code or anything inside it.

We're building it from scratch so we can

see how this does. We're going to test

this out in the inapp browser that

cursor just released as well. I like the

sound of this feature. We'll see how

that goes. We'll see if we can actually

test the browser inside the app. I'm

going to run this and I'm going to keep

it running. Uh I do like the user

interface. Composer and cursor have

always had uh some of the best user

interfaces. Very clean and easy to read.

They also up to this point had one of

the strongest uh plan modes as well.

We're not using the plan mode for this.

We're just letting it run to see how

well it does. And once this gets built

out, what I'm going to test next is how

well can this go with multiple agents.

I'm going to spin up a bunch of agents

right after it builds out this V1. I'm

going to see are they going to conflict

with each other? Are they going to

clash? Is it going to crash? We're going

to see how managing multiple agents at

the same time in cursor is going to go

as well because that's kind of the whole

point of this cursor 2.0 is to make it

so instead of writing code, managing

code, looking at code, you're just a

manager of agents. You're just like the

CEO of this agent company, the agents

are going and working for you. So, we're

going to see how that experience is when

we run like six or seven different

agents at the same time. So, stick

around when we see that. But, I'm going

to let this run. And I'm going to let

this build out here and we're going to

see what this looks like in just a

second. And wow, this is actually fast.

Uh I have not really edited this at all.

We're probably what, a minute in here,

and it's already made,00 lines of code.

That is pretty good. That is definitely

faster than the other models. I can say

that right off the rip. This is probably

the fastest coding model. Uh it is

coding very fast. That is very

impressive. But I will say this, speed

means absolutely nothing if the code

stinks. So we're going to see if this

works. Wow. Oh, just like that, it's

telling me to npm run build. Okay, so

we're just going to let this rip. This

is This is it. This is working right

away. Let's see how this goes. Wow. It

is coding so fast that I literally uh

cannot even read what it is doing. It is

going so fast. That is extremely

impressive. I assume it's running npm

run build over and over again uh so that

it can test what it's doing. Uh I again

I can't even tell because it's moving so

fast, which I think is a good thing. I

think I like that. All right, build

succeed. I literally it's moving so fast

I can't read anything it's saying. All

right, let's see what we got here. How

to use start your server with npm

rundev. Search for stocks. Click a

result to add your watch list. I think

we probably need to add the Alpha

Vantage API somewhere. Oh, here we go.

It's down. Important. The app uses a

demo API key here. Place it in Alpha

Vantage with your Alpha Vantage API key.

Okay, so I'm going to grab that. I'll

have a link to the Alpha Vantage API

down below if you want to do this with

me. Just it's completely free. Just put

in your email, you'll get a key. All

right, so I put that in. Let's test this

out. So I'm going to open up the

terminal. I'm going to do npm rundev.

That looks like it's running. Okay,

let's see if we can do Can we use the

inapp browser here? Let's go browser

tab. We're going to enter, I assume,

local host right here. So, this is just

Google Chrome builtin. Let's see how we

go here. Okay. Uh, this looks terrible.

Let's just see. I'm going to open up in

a regular Chrome tab to see if it looks

any different. And I'm going to pull it

open. Nope. Looks exactly the same. It

looks terrible. That's one of the worst

looking apps I've ever seen in my life.

Uh, I'm going to imagine the styling is

not getting applied because there's no

way it would by default think this is

the way an app should be styled. So,

let's go back to the agent here and

let's say looks like the styling didn't

get applied and I'll send a screenshot.

Whenever you get UI issues or anything

like that, just screenshot what you see,

put it in here. I wonder would we be can

we select element toggle? It'd be cool

if I could like Oh, take Okay, take full

page screenshot. Oh, that's what I was

looking for. That's what I wanted. Okay,

cool. I like that you can just take a

full page screenshot. I like that. Let's

hit enter. That's a good feature. Thank

you, cursor. That is a good feature. But

now, more important than screenshot

features is the model actually doing

well. So, let's see how it goes here.

Let's see if we could fix this UI issue

because that is I don't even want to

test it. It's so ugly looking. It hurts

my eyeballs. Checking why Tailwind isn't

applying. review the current. Okay, I'm

going to have it review the current CSS

setup. We're using Tailwind v4, which

requires different syntax. I've seen

this a lot with all models to be honest

with you. For some reason, they all go

for Tailwind 4 and then have no idea how

to use it. So, they always have to

downgrade to three. For me, Claude Code

stopped doing this like a month ago. It

was doing this all the time and then it

stopped. Codec still does this sometimes

for me where it just it doesn't know how

to use the newest version of Tailwind

and I force it to downgrade. And now

we're doing it with Cursor's new agent.

So, we'll see if it can figure it out

here if we're going to go into a circle

here right off the rip. Here's the

thing. It is fast. It is the fastest one

out of all of them. But if it is running

lowquality code or doesn't understand

what it's doing, the speed doesn't

matter. It just doesn't matter if you do

I'd rather have the right thing done

slow than the wrong thing done very

fast. So, hopefully you can figure this

out. It's I've had to run literally 400

different commands right now to fix

styling being applied. All right, looks

like it is all set. Let's run npm rundev

and see if it was able to fix this. I'm

just going to open us this up in Oh,

there we go. Look at that. That looks

good. It fixed it. Okay, so it felt like

it was taking a little while to do it,

but it actually fixed it. That looks

beautiful. Good work. Let's uh let's

test it out. Let's uh do Apple hit. Oh,

I like that. I like that. I uh type in

the beginning of the company and it

gives me a bunch of suggestions. I've

literally never seen any coding model do

that for like build that out when I

asked them to build this app. Like this

is usually one of my first tests is to

build like a stock market app. I've

never seen them have like this autofill

suggestion before. So that is really

smart and nice. I hit it. Oo, that is

nice. That is nice. That is really well

formatted. I like the UI. It is clean.

Let's uh you know, it is nice that it's

doing it in this little tiny browser,

but I'm going to pull open regular

Chrome just so we can see it in like a

browser people would use. All right,

let's do Tesla. Oh, that that comes in.

I like the way it like slides in all the

stock slide. That's a nice attention to

detail here. The charts looking good. Is

this real time data? I think it is. I

think this is real time data. Gives all

the information on the stock. I like

that. This is nice. This This does look

Oh, I could favorite. They add favoring.

So, they did add some features. Oo, and

then the watch list. The watch list is

nice. Okay, so this is good. And it says

live data. This is good. Does it still

go with the blue and purple color scheme

that every other AI on planet Earth goes

with for some reason? Yes. the first AI

model that makes it so blue and purple

aren't the default color scheme for

everything. I will be I will use that

model forever. Be the first one. That's

all I ask. All right, so let's move on

to other features. We tested the model a

little bit there. It looked good.

Listen, it messed up at first, but it

recovered and it recovered very nicely.

Let's do this. Let's use what this was

built for, which was spinning up and

managing tons of agents at once. Let's

go into it. So, we're going to do a new

agent. We're going to keep using the

same composer one model we've been

using. We're going to have it add a new

feature. So, we're going to say have a

chart open on the main screen that shows

all the stocks on my watch list on the

chart at the same time. I'm going to hit

send on that. I'm going to spin up a new

agent. We're going to say, give me the

ability to add how many shares I own for

each stock and show me portfolio value.

I'm going to hit send on that. I'm going

to do a new one. put a section at the

top that shows my entire portfolio value

and other stats about my portfolio. I'm

going to hit send on that and I'm going

to do another agent at the same time

that does have a section at the top that

shows all the data about the stock

market today. And then we're going to

hit send on that. So, let's see how it

can do this. We're spinning up four

agents at once, working on four

different features. Let's see if it can

if it's going to clash with each other,

if they're going to make errors, if

they're going to break. Let's see.

Because if it can do this, this is

really nice, right? This is the

advantage to having a UI like Cursor 2.0

over say like a CLI is that with a CLI,

it's very hard to spin up multiple

agents like this and manage them all.

You'll have to open up multiple terminal

windows at once to do that. With here,

we have this like really nice cursor. UI

where I can see all my agents, see their

progress, click on them, and see how

it's going. Okay, looks like all of the

agents finished their work. That was

fast. That was like two minutes tops.

No, like one minute. It was like 1

minute uh to do all those at once, which

is actually spectacular. Let's go in

real quick. Put a section at the top

that shows my entire portfolio value. So

went in, did all that. All right, it did

a hundreds of lines of code changes for

each one, which is amazing. So let's go

in. Let's see what it looks like. All

right, let's see what we got here. I

refresh it. Portfolio overview, total

portfolio value, total change today, one

position. That looks nice. Market

overview shows me S&P, Dow Jones,

NASDAQ. That looks nice. Let's add other

stocks to our watch list so we can test

this out. Apple. Cool. That looks great.

Let's see if it adds it to our chart.

Boom. Both are on there. And let's see

here. Okay. Let's see if I can add how

many shares I own. Oh, add shares

buttons right there. Let's do 100. Okay,

there's the Apple one. Let's say we have

20 shares of that. All right. Nice,

nice, nice. Now, let's see if it updates

our total portfolio value. It does.

That's amazing. That works pretty

incredibly. That's pretty flawless.

That's amazing. That's really cool. I am

actually blown away by how well that

worked. Uh, and that I was able to do

that basically in one shot. We had a

little bit of the styling hiccups, but

to be quite honest with you, as I said

earlier, every AI model I've ever worked

with when it comes to coding has for

some reason attempted to install

Tailwind V4 doesn't know how to use it

and has to downgrade. So, I'm not going

to knock off points for that. I will

give it a ton of points though for

building this really nice to use stock

tracking app that works pretty well,

didn't have any issues at any point, and

is very pleasant to use. I mean, this is

very nice to look at and use. I think it

did a really, really good job here. So,

let's do this final review. Should you

be switching from clawed code and codeex

to cursor 2.0? I do like the UI. I do

like that you can spin up multiple

agents and manage them at once. I do

really like the new model. I think the

new model is really good. I mean, it's

unbelievably fast. It's the fastest

coding model I've ever used in my entire

life. And it seems to write really

really good code at that speed which is

very impressive. Cursor they had to do

this. They had to release their own

model because cursor being dependent on

open AI and anthropics models was its

death sentence. Right? Cursor and

anthropic they can manage their margins

right they have full control over the

model. Cursor they have to do whatever

margins anthropic and open AI give them.

So if they started to take over cursor,

Anthropic and OpenAI can push them out

by making it way too expensive for them

to run. So from a financial perspective,

the fact that they were able to make

their own model, very, very impressive

and it works really well. UI

perspective, I like it. I like that I

don't have to look at code. I like that

I don't have to manage code, write code.

I'm just managing agents. This is

clearly the future. This is clearly

where it's going. They're going in the

right direction by having this agent

manager. That is really nice. They need

to come out with a mobile app. There's

no cursor mobile app. The fact that I

can go on Claude's app or Codeex app and

spin up agents and spin up tasks in the

cloud whenever I want on mobile wherever

I am is a huge advantage and that's

going to be an advantage over cursor 2.0

today. Am I personally switching from

Claude code to cursor 2.0? For me,

claude code is king right now. The

answer to that is no. But the gap

between the two is a lot smaller. Before

it there was no question I'm not using

cursor 1.0 when it comes to claude code.

Claude code was so much better. This is

actually up there. This is much closer.

But at the end of the day, I still think

that the Claude Sonnet 45 model is

better for me. Speed is not quite as

important. I will take a little bit of

slowness to get better, higher quality

code. I also like the way clawed code

and sonnet 45 talk to me more in the

window. It feels it explains a lot

clearer. It explains a lot better. I

know exactly what my next steps are. I

know exactly how to test. Cursor's model

is not quite there yet when it comes to

the explanation, the feel, the taste,

but it is really impressive for what it

did. So, I am probably sticking to

clawed code for now, but cursor is a lot

closer. And I am very, very, very

impressed with what they did here. if

they add a mobile app that has an

excellent agent manager experience in it

and if they keep tinkering with this

model, which I'm sure they are. This is

day one of them releasing this model.

So, I'm sure they will work a little bit

on, you know, the communication, how it

talks to you, the feel, the vibe of the

model, I'm sure that will improve. But

if they can close the gap on those two

things, I think there's a pretty good

chance I might switch and use it. But

either way, there's a trial for Cursor

you need to test out. Test Cursor 2.0

out now. See how it feels for you. see

if you like it better in your workflow.

There's a lot of people that are cursor

heads. They're dieh hard for it. You're

going to love love love this update.

This is a really strong update. I love

the feel of it. Give it a test. Let me

know down below. Are you switching? Is

this for you? Have you tested it out

yet? Is Cursor 2.0 your new main driver?

Let me know if you learned anything at

all. Hit subscribe. Turn on

notifications. I live stream a few times

a week as well. Leave a like if you

learned anything. and I'll see you in

the next

Loading...

Loading video analysis...