Should You Use OpenAI's New Browser?
By The AI Advantage
Summary
## Key takeaways - **ChatGPT Atlas Browser Available for Free**: OpenAI has released ChatGPT Atlas, a new browser available on macOS for all plans, including the free tier, with no geographic restrictions. [00:02], [00:32] - **Browser Memories Feature Introduced**: ChatGPT Atlas introduces 'browser memories,' a unique feature that learns from your internet usage to automatically gather and save browsing context to your account. [01:25], [01:48] - **Privacy Concerns with Data Import**: When installing ChatGPT Atlas, it prompts to import bookmarks and history from Chrome, raising privacy concerns about sharing extensive browsing data with OpenAI. [02:01], [02:25] - **Agent Mode Performance vs. Claude**: While ChatGPT Atlas offers agent mode, the Claude browser extension with its Sonnet 4.5 model is found to be faster and more reliable for complex computer use tasks. [06:17], [07:29] - **E-learning Certifications at Risk**: Tools like ChatGPT Atlas and Claude's extension can automate the process of completing online courses and quizzes, potentially rendering e-learning certifications less valuable. [11:01], [12:40] - **Atlas: Nice-to-Have Features, Not Game-Changer**: Features like summarizing websites with one click are considered niche and 'nice to have,' not significant enough to alter core browsing behavior, and the browser's primary concern is privacy. [13:09], [13:26]
Topics Covered
- Is OpenAI's new browser a privacy trap for your data?
- Why unreliable AI agents create more work than they save.
- The AI model, not the interface, dictates agent performance.
- AI agents can now "farm" online certifications effortlessly, creating a mess.
- OpenAI's browser: A new interface, not a novel AI model.
Full Transcript
OpenAI just released another big thing.
They have their own browser now and it's
making real waves all across the
internet. And I really wanted to make
this video to talk about if this is any
good and worth your attention. How this
compares to some of the competition out
there cuz this product doesn't exist in
a vacuum because consumers have many
choices these days. And three, how
capable is it actually? So let's start
that discussion by summarizing what this
actually is. It's called Chachib Atlas,
a brand new browser that is as of now
available on Mac OS for all plans. Yes,
that includes the free plan and there's
no geographic restrictions here. So even
people sitting in Europe like me can try
this out right now. If you're on a Mac,
simply download this, install this,
login with your Chat GPT account, and
off you go. This is what it looks like.
It's basically a browser that has Chat
GPT as the default new tab window with a
few things that should be noted. and it
brings some unique features compared to
just using chatb in a tab as everybody
has been so far. First of all, and kind
of obviously if you're on any website,
you can just click ask chat here and you
have it right there and it has the
context of the entire site loaded into
the chat. No need to copy paste. You can
kind of just talk to it. And at this
point, I want to start pointing out how
this compares to some of the competition
cuz this is not a brand new idea. These
Chrome extensions that do this have been
around well since the advent of chat in
2022. It's a useful thing, but honestly,
copying something and moving it to a new
tab isn't that big of a pain point. So,
call this one a nice to have. Secondly,
and more interesting, so they have this
concept of browser memories. So, if I go
here to my settings and I go to
personalization, you will see a new
category here where it says reference
browser memories. If you're familiar
with chatdy memories, this is the same
idea, but from your browsing experience.
So, as it learns about your internet
usage, it adds them here. This is a
thing I don't dare to express an opinion
on yet cuz time will show how this is
actually implemented. And this is a very
unique feature that I believe we haven't
seen from any other product yet, but at
the end of the day, it's just another
way to automatically gather context and
save it to your account. Beyond that, I
should point out when you install this,
it asks you to import all of your
bookmarks and your history from Google
Chrome. And this kind of launches me
into the biggest question and concern
here, which is, okay, so you install
this and then you decide to give all of
your browser history and all of your
bookmarks to OpenAI just by importing
it. Hm. Sure, that can be useful in some
of the things, but just be aware that
that's what you're doing when you're
installing this, importing everything
over. And yes, it will open up some nice
opportunities, but you might not want to
share your entire browser history with
OpenAI. And this is kind of a sneaky way
of them getting it. So, just be aware of
that. What I could do now is something
like I did in a previous chat here and
ask something like find movies and shows
I recently viewed. Scan my recent
browsing and surface movies and shows I
viewed or searched for recently and then
use my browser history as context to
correctly identify that yeah I checked
out trailers for the bear pantheon the
Gothic one remake game trailer. That's a
good one. And then make suggestions for
what I should view next which actually
I've seen half of these. So this is
quite good yet again none of this is
really novel. We've seen this from other
AI browsers and from Google themselves
where you could use your Google Chrome
history as context in the chat. And you
might see the theme here already. It's
sort of pulling these various things
together. And that brings me to the next
big point which is the agent mode in
here. And this is the one where we'll
start getting practical. And I want to
actually start by running a rather
complicated prompt for the agent mode to
execute. It's going to be one that
includes three different applications.
Before I run this, I just want to
quickly intro agent mode. It's basically
their version of what would be called a
computer use agent, an AI that has
access to a computer, to a browser, and
then it does things, clicks buttons,
gets things done. Usually, they take the
same demo of like booking travel, which
is not a real painoint people have. I
don't understand why they always use
that as a demo, but that's what this is.
It's basically agent mode that you
already had in chat just within here.
Now, it has all of the context, the
history, maybe even the passwords that
you have in your browser if you store
them in there, which is just a more
user-friendly way to use this. But
basically, it's the same agent mode.
It's a very interesting app. But here's
a quick word from the sponsor of today's
video. Okay, so one of the things I
noticed with the new video generation
models is that while they're amazing,
most people don't actually want the new
video generation model, they want one
platform that can do it all rather than
jumping back and forth between different
tabs to make their vision come to life.
And that's why I want to show you LTX
Studio, the sponsor of today's video. So
LTX Studio is a AI powered studio built
for filmmakers, advertisers, and
creative teams. It gives you access to
cutting edge video and image generation
models, but also everything else you
need to take a project from concept to
final cut all in one place. You can
start with just a script, a concept, an
image, or even a video. And LTX gives
you professionalgrade tools to shape it
however you want. You can generate
production ready visuals, direct every
shot with camera movement controls and
key frames. It even gives you the
possibility to guide the look with
preset visual styles so your work stays
consistent. There's also a feature
called AI casting, which lets you create
and reuse characters across scenes with
custom wardrobes, so you don't lose
continuity. And when your project is
progressing and the time to polish it
comes around, you've got built-in tools
like dynamic storyboards, a timeline
editor, and even sound design. Again,
all in one place. They also just
launched a voiceover function, which is
powered by Google Gemini 2.5 Pro, and
lets you use 30 professionally designed
voices. Plus, if you're working with a
team, all of this is collaborative. You
can share projects, manage assets, and
give feedback all inside the same
workspace. So, if you want a platform
that brings the power of AI video models
together with the control and
collaboration tools you actually need,
check out LTX Studio. You can start
exploring it today by using the link at
the top of the video's description. A
big thank you to LTX Studio for
sponsoring this video. So, let's see if
this works any better in this desktop
version by trying out this test prompt.
We're in step one, we're going to be
reading in context from this document. I
just copied a guide from our community
over. This will be the context. The
content here doesn't really matter. I
just wanted to read it from that Google
doc. Then I want to write an email with
a summary of what the document includes
that will be using my Gmail account that
I'm logged into here. And then thirdly,
I want to track all the emails that have
been sent in this spreadsheet that is
open over here. It's completely empty.
And at this point, I'll just make sure
agent mode is enabled and that I'm
actually logged in with all the accounts
that I have. Otherwise, these things
wouldn't work. Okay, send this. And now
let me tell you how this actually worked
in the main competitor to this product
which is Claude's Chrome extension. If
we just briefly pull up my other
browser, it's right here installed. And
I actually keep using this thing ever
since its release. And it's so damn
good. It's better than all the
competition. I know the top comment on
the video when I made a video on that
Chrome extension was, "Hey, what about
comment?" Well, comet is good. It's the
same concept as Atlas here, right? But
it just doesn't work as reliably. And
that's a big word in this discussion
here. Reliability. Is this thing
reliable? Because of course the idea of
an agent that just controls your browser
and does the work for you is great. But
if you run a task 10 times and three
times out of that it does something
completely random, you're not going to
end up using it cuz it's going to cause
more work than it actually removes. And
that brings me to the last part of this
video, which is the various examples
that I tested in here and how they
contrast to some of the competing
products as of the features. And I
pointed out to you what it does and what
competing products exist that do this
already. Sure, this ties it all into one
package, which is kind of nice. But then
at the end of the day, if I want to get
something done like the task we just
launched, it's a real question if I
should be using this cloud's browser
extension, which has the same agentic
capabilities, or something like Google's
computer use model that they just
released two weeks ago that has beat all
the benchmarks. And let me just tell you
on these agentic tasks, I find claude to
be straight up better. And I don't think
that's because the extension has some
genius architecture running in the
background or that chatbt atlas is just
not good enough. I think it's because of
the model. Brief reminder that OpenAI
did not ship a new model here. They
shipped a new interface that contains
and uses the model they already had. And
I'm really curious to see how this task
actually performs because in the case of
Claude that uses the Sonnet 4.5 model
where one of the big headline benchmarks
and announcement was that it's so much
better at computer use. It was so smart
at doing this. If you saw that original
video, you might remember that it
actually did this task. It sent the
email and at this step right here where
I'm at right now, by the way, the
computer is doing this, not me, right?
It put in the summary of the email. It
put in all the data correctly, but it
was lacking a header. And then it
actually went back and saw that there's
no header and it deleted it, added the
head header, and then moved everything
down. Here in this case, it seemed to
have got this right on the first try.
So, that is nice. And that's actually
it. It just finished up the task. So,
let's see. It created the draft of the
email. It's right here. And it saved the
data that I asked for in here. And I
remember this prompt actually just asks
for the date and the email and not the
content of the email. So it did what I
told it correctly. But if it does this,
it could easily also add the content
over here that it wrote here. So on this
task, it did equally as well as clot.
Nice. Let me give you a few more
examples that I ran before making this
video on which it wasn't equal. And that
should give you enough of a feeling for
how this works. And that should round
out the video and give you enough
information to make a decision for
yourself if you want to give this a shot
yourself on your own machine. So, first
of all, I just gave it a picture of
Conor McGregor that was randomly on my
desktop. I think for AI news you can
use. I was doing some AI remix of it or
whatever. And I asked it to generate a
100 social media captions using Claude
AI for this image and put them into this
Google sheet. So, what I found is that
it actually did that, but it didn't use
Claude in the process. So, it just
opened up the sheet and it successfully
put them in here. And look at these
captions. They always like repeat four
times, three times, just emojis or the
hashtag varies. And it's good. You know,
they're solid captions. Then I followed
up and said, "Do 100 more by using
Claude AI." And then I provided the URL.
And this would be my tip. If you're
using this, give it the URLs and then it
will reliably use the tools. What it did
then is it opened up Claude, which I was
logged into. It wrote this prompt right
here, which I thought it was funny that
it mentioned avoid mentioning the
fighter's name. That's obviously the
chat GP restrictions at work here. And
then it just wrote 100 captions here.
The formatting is a bit off. Doesn't
really matter. It copied all over that.
made one mistake where it just kind of
copy pasted everything into one field,
but then sort of it corrected itself and
split them apart and there you go,
another 100 captions from Claude. Quick
note that I cannot withhold. I mean,
look at how much more variety you get in
the Claude captions. This is why Claude
is so damn good for writing and why
people really prefer it in many cases.
You just see that right here. But we're
judging chat GPT agent in chat GPT Atlas
right here. It did this super well. So
you can automate the usage of various
AIs and the manipulation of data with
spreadsheets, which I think is one of
the main things you will want to do with
tools like these, these computer use
agents. It's like using AIS, researching
stuff and manipulating spreadsheets, and
that's what it's really good at. This
worked well, but my tip would be be very
specific with the URLs of the tools and
something like the Claude Chrome
extension. It just pulled this up
without the URL here. It needs it. Not a
big deal. Then I did one more and this
is a tricky one. We had our quarterly
town hall in the community yesterday and
at the end of it we just got together
and tried this all together and we had
multiple AI educators and university
professors in the room and they brought
up one topic and that is that these
tools basically render things like
LinkedIn learning certifications useless
because the agent can just go through
it, take all the quizzes, watch all the
videos, pretend like somebody's taking
the course and just farm certifications
for you. So we put this to the test. We
actually made it take one of the quizzes
in our community concretely the final
quiz on the advanced LLM prompting
course and the results were very
interesting because I gave the same
question to both the claude chrome
extension and to chatpt atlas here and I
told it complete this quiz think
thoroughly I really need 100% at all
cost and here's what happened chatbt
took about 30 minutes and it still
wasn't done its method of getting all
the questions was basically scrolling
through all of this multiple times
compiling the screenshots and then very
slowly answering these questions. Claude
did the same, but it must have been like
10 or 15 times faster than Chat GBT here
at doing it. And both of them got 95%.
And at this point, I have to actually
admit my fault. I'm taking full
responsibility for this. There was a
mistake in one of the quizzes and that's
the one question both of them got wrong.
This question that is taught in the
course, both AIs actually answered
correctly, but we in our system had a
wrong answer selected. So, long story
short, basically both of the systems got
a perfect 23 out of 23, but there was a
mistake on our sides, which really
tripped GT5 up. Kept running in loops
after it was done and just couldn't
finish the task because I asked for 100%
and Claude was just like, "Hey, I did it
and this seems right to me." But even
without those loops, Claude was around
10 times faster at actually completing
the quiz the first time. So, yeah, that
should make you think about all types of
certifications that people show off on
their LinkedIn. All you need is a free
chat account in the Atlas to kind of do
that. and it's a real mess for
e-learning. So overall, what's the
verdict here? Well, it's just OpenAI's
version of something that we already
had. If I had to pick one winner, which
one of these computer use agents works
the best? Well, it would still be the
cloth browser extension with Sonnet 4.5.
That model is just the fastest and the
most reliable at all of these tasks that
I have tried so far. GBT5, this would be
a close second though, and maybe this
browser interface is actually preferable
to you. I know a lot of people used
Atlas before and really enjoyed that.
But I personally think that a lot of
these features like, you know,
summarizing a website with like one
click are a nice to have, but they're
rather niche. It's not something you're
going to be using on every site that
you're on. And honestly, even if I had
to copy this over to a new tab and
summarize it here, it's not big enough
of a pain point to change my personal
behavior and approach to browsers. And
above all, there's a real privacy
concern where you're sharing all of your
browsing data with OpenAI if you use
this and even if you import your Chrome
profile. So think about that before you
try this. And lastly, I want to point
out that this is just OpenAI's version
of the open-source Chromium, which
Google Chrome is based on. So it's
nearly identical to Chrome, but it's set
up to be AI first with interesting
features that will evolve over time like
browser memories and also you can do
custom instructions for your agent mode
here. Although I personally cannot edit
these right now. So I think this is the
first version that for the curious is
worth exploring, but I expect this to
get a whole lot better and featurerich
over time. And I also expect them to do
particular models that are really good
at these type of tasks. Interesting
release, not a game changer. All right,
that's pretty much everything I have for
today. My name is Igor and I hope you
have a wonderful
Loading video analysis...