這次AI工具的升級,連工程師都傻眼‼️ 這已經超出控制範圍,連馬桶都AI化了‼️|Google|OpenAI|Microsoft|Claude|Perplexity|Adobe|Kohler
By 小發學姐
Summary
## Key takeaways - **AI Browsers Replace Search with Tasks**: AI browsers like OpenAI's Atlas and Perplexity's Comet are shifting from keyword searches to task-oriented commands, acting as digital assistants to perform actions and summarize information, fundamentally changing how we interact with the internet. [00:35], [01:32] - **AI Video Enters Narrative Era**: AI video generation is advancing beyond visuals to understand stories, with tools like Google's Veo and OpenAI's Sora capable of generating full clips from storyboards and maintaining character consistency across scenes, revolutionizing content creation. [03:29], [05:02] - **AI Democratizes Coding and App Creation**: New AI tools like Claude Code on the Web and Google's Vibe Coding allow ordinary individuals to create digital tools and apps by simply describing their needs, lowering the barrier to entry and enabling anyone with an idea to build software. [09:23], [11:01] - **AI Agents Automate Workflows**: AI agents, exemplified by OpenAI's AgentKit and ChatGPT Apps, are evolving from chatbots to task executors that can manage repetitive daily processes, connect to external tools, and integrate various services, acting as automated assistants. [12:27], [13:04] - **Smart Toilets Signal AI's Pervasive Integration**: The development of AI-powered smart toilet analyzers by Kohler, capable of monitoring health metrics, demonstrates how AI is becoming deeply integrated into everyday life, collecting personal data in inconspicuous ways. [16:03], [16:26]
Topics Covered
- AI browsers replace search with task-oriented commands.
- AI video enters the narrative era, understanding stories and characters.
- AI programming barriers are falling for everyone, not just engineers.
- AI agents become automated secretaries for task execution.
- AI becomes invisible infrastructure, like electricity or the internet.
Full Transcript
Hello everyone, I'm Xiaofa, a senior student.
The changes in AI this year
have been so rapid it's almost breathtaking.
Browsers can now automatically move
videos generate
images from a single sentence, fix things in seconds,
and I'll even tell you
, toilets are now being equipped with AI!
In today's episode,
I'll summarize
some of the most representative recent AI developments.
But I'm not just reporting the news;
I'll also explain
how these things will affect our lives and work, and
what new opportunities they bring.
First, the AI browser era has officially begun.
AI is no longer just for chatting
; it's starting to help you browse the internet.
In the past, we used Google
by entering keywords
, but now we
directly enter commands/tasks.
OpenAI's ChatGPT Atlas
is a new browser.
When you open it, the homepage isn't Google
, but ChatGPT.
You don't enter search terms,
but tasks,
like "Find the cheapest hotel in Tokyo
near the subway for
three nights for two people.
" Atlas doesn't throw a bunch of links at you
; it directly starts searching for you
. There's also a very interesting Agent mode,
which means AI... This
truly allows you to manually operate web pages.
It can filter
click sort and
copy information
, like having a digital assistant
performing your internet browsing actions.
This is a significant shift
because search behavior
is now being replaced by task-oriented methods.
For the average user,
there will be less and less need to compare prices
and find information manually
. However,
the clarity of your instructions
will determine the quality of the results.
In the future, only
those who ask questions will receive truly useful answers.
Another player is Perplexity,
whose Comet browser has
also officially launched for free.
It doesn't replace ChatGPT
but focuses more on information search.
It automatically reads the content of the web pages you are browsing,
provides summaries and
extended information
, and even automatically finds relevant links
for research. For those writing reports or preparing scripts,
this is a magical tool.
For example,
when you're researching a company's news,
it automatically summarizes the key points
and provides sources
to help you verify the authenticity
of the information, greatly increasing its credibility.
Browsers that help you organize information
can save information workers,
students, and creators a lot of time.
It's not just faster
; it makes it easier to understand the world.
Microsoft's approach
is a bit different.
They added a
feature called Copilot mode
to their Edge browser.
It remembers what websites you've visited,
like asking for help finding the blue jacket you saw last week,
and
it can immediately retrieve it for you.
This feature seems small, right?
But it's actually crucial
because it makes memory a part of the search process . In the
past, AI... It helps you answer questions
and now it's starting to remember who you are.
For creators or online shoppers,
this will extend to personalized content recommendations
in the future.
It knows what you like to watch,
what you care about
, and even what kind of subject you
're planning to film.
Next,
let's talk about AI video.
AI video has now entered the narrative era.
From the beginning of the year to now,
the progress of AI video has been frighteningly fast.
But the recent wave of updates
has several directions that
are worth noting
because it's no longer just about visuals,
but has begun to understand stories.
Google's Veo video generation has been upgraded again.
It can automatically generate the process in between
based on the start and end you set .
In other words,
AI is no longer just randomly moving things around
, but will help you fill in the story.
Another feature
is called image fusion.
You give it characters, scenes, and clothing,
and it will generate which person is wearing which clothes
and where.
This will make brand advertisements, product displays,
and even movie clips
more controllable.
Simply put,
Veo is now
an AI that understands storyboarding.
This step is really crucial
because it makes AI... The film
is closer to the logic of realistic shooting.
For short video creators, this means that
in the future,
they can automatically generate complete clips
simply by conceiving the scene . OpenAI's Sora
continues to evolve.
The new version can generate multiple scenes at once
, like storyboards,
and the video length has also increased.
More notably,
it has added character cameos,
allowing the same character
to appear in different videos.
Whether it's your cat, a doll,
or even a virtual IP, they
can maintain a consistent appearance.
Sora is actually
paving a new path. In
the future, short film production
won't necessarily rely on a camera.
As long as you have a story
, characters, and a voice,
AI can help you shoot videos that look real
. I succeeded! Listen carefully:
you can use Sora to generate brand new characters,
new monsters, heroes, or even more
. Or you can upload videos from your camera,
edit videos of your pets, and then turn them into cameos.
Even if your pet is a giant duck,
okay, let's go again.
We're excited to see how you use characters.
You've got me hooked!
Another noteworthy feature
is the open-source model LTX2,
which can generate 4K quality
50fps videos,
even with synchronized audio and video.
Most amazingly,
it can run on a regular computer.
What does this mean? It means
AI... With the barrier to entry for video production further lowered,
even companies outside of major corporations
can create professional-grade AI visual content.
This represents a true democratization of the content ecosystem
. YouTube recently
launched a facial recognition feature
, meaning that
if someone uses AI to mimic your face
or voice in a video,
you can directly appeal for its removal.
This might sound like a defensive mechanism
but
it also sends a signal:
AI-generated content is becoming increasingly human-like,
forcing platforms to establish rules
to protect creators like us.
In the future,
this will become a new issue for creators,
with licensing and protection
becoming as important as the creativity itself.
Speaking of AI video,
AI drawing, and image editing,
we've entered an era of integration.
Have you noticed
the trend in recent months ?
AI image editing is no longer confined to a single website or model
but has permeated everything.
For example, Microsoft
launched its own image model, MAI Image 1,
which can draw people and animals.
The generated text
simulates realistic lighting and shadow
effects,
and it's quite stable.
Currently, it can be used on the LM Arena platform.
Interestingly,
Microsoft, a major shareholder of OpenAI
, still launched
its own image model, essentially creating competition within a friendly environment.
This means the AI image market will become increasingly diverse,
which is good for us users
because competition
leads to better image quality
and lower costs.
Google's Nano Banana is
now almost integrated into all Google products;
you can use it
in Photos, Lens, and even Notebook LM.
Just type out what you want to change,
and it will immediately fix it for you.
For example, change the background to a sunset or
remove passersby.
These image editing tasks can be completed with a single sentence.
This allows even those without design knowledge
to produce beautiful images.
I think
this means the technical barrier to image editing is disappearing.
In the future,
the only difference might
be aesthetics and creativity.
Adobe also held its annual MAX conference and
officially announced that Photoshop
Premiere, Illustrator, and Lightroom have all adopted AI.
Now, it's not just about using text commands to edit
images and generate visuals;
you can also freely switch between different models,
including Nano Banana, Flux One, and Topaz. Upscaler and similar
features mean that
professional creators
no longer need to wait for others to generate content.
AI is no longer just a source of inspiration
but has become part of the workflow.
Creating videos, designs, and posters
can be more than twice as fast as before
. And the good news is that
ordinary people can now also use AI to write code. In the past, when
people talked about AI programming,
their first reaction was
that it was an engineer's job
and had nothing to do with them.
But this update's
focus is no longer on increasing engineer efficiency
, but on anyone being able to command AI
to help you create digital tools.
Anthropic's new feature, Claude Code on the Web,
sounds very professional
but
its concept is actually very simple:
you can ask AI to help you modify
or create small tools
through chat.
Imagine having a super-powerful AI in your company. The assistant
not only helps you organize data and reply to emails, but
can also write simple programs
, such as creating a simple
function to automatically organize spreadsheets and send email reminders.
You just need to speak to it and
ask it to organize customer data
into a spreadsheet,
and
delete duplicates .
It will then handle it automatically in the background.
Moreover, it operates on a webpage,
so ordinary people can use it
without downloading software
or worrying about programming syntax.
Programming
is no longer just the domain of engineers.
In the future, anyone who can describe what functions they want
can create tools.
For example, I might think
, "Can I use it to organize video comments
, categorize repetitive questions,
list popular keywords,
and automatically compile a report for my reference?"
This is the direction Claude Code aims to achieve.
Google has also launched Vibe Coding in AI Studio,
which is even more impressive.
You can speak
and let AI create apps for you
—literally speaking!
For example, you can say, "I want a page
that allows people to upload photos
and generate comic-style AI versions
." It will automatically design the interface
and buttons for you.
Previously, you'd need engineers and designers to help
you with the model; now, you
can create it yourself as long as you have an idea.
It also has many built-in sample apps
like music players, chat rooms, and notebooks.
You can directly modify the text, colors
, and functions
, much like the concept of PowerPoint templates.
It even allows you to point to the screen and make changes,
such as circling a button and saying, "Change this to blue,"
and AI can immediately fix it for you.
But actually
, Google is realizing something:
in the future, building websites and apps
won't necessarily require programming skills.
You just need to clearly explain your needs,
and AI can
help you piece together the functions.
AI programming
isn't about replacing engineers
, but about enabling people with ideas but no technical skills
to implement them themselves.
This is similar to learning Photoshop and
other video editing software.
Initially, many people might not know how,
but those who learn
will have a significant advantage.
In the future, it
won't be about who can program,
but who dares to ask AI to realize their ideas.
And we must also mention the current state of AI... Agents
can truly help you with your work.
A clear trend in AI updates over the past few months
is that they're no longer just answering questions and generating images,
but starting to do tasks for you.
OpenAI's AgentKit function
is somewhat like a "self-driving workflow."
You write down the repetitive steps you take daily
—for example, someone fills out a form
, then you categorize it
, then you send a reply email, and
finally you update the form.
Previously, you had to use Zapier or other automation tools to
manually execute
this process;
now ChatGPT
does it for you.
AgentKit
allows you to turn ChatGPT into a true assistant.
It knows how to use external tools,
connect to databases,
and even call APIs for you.
In other words,
it doesn't just answer "what to do
," it does it itself. For
example,
suppose after each video shoot, you
first need to organize the names of the footage
, then upload the subtitles,
and finally send notifications to partner vendors.
All three tasks
can be handled by AgentKit;
it remembers the rules
and executes them in order.
AI is starting to truly understand processes,
not just chat .
For many small teams, work-from-home professionals
, and even students,
you finally have your own automated secretary.
AgentKit can help you get things done.
So, what about the ChatGPT apps? It
helps you connect to external plugins
, turning ChatGPT into an AI super platform.
Now, within ChatGPT, you
can directly use Canva, Spotify, Booking.com, and Figma
without opening new web pages.
You can say,
"Design a thumbnail for me,"
and it will open Canva and generate a template for you.
Or, "Check hotels in Osaka
for tomorrow and plan a three-day itinerary,"
and it will
grab data from Booking.com and
create an itinerary for you.
For the average user,
you won't necessarily need to open a bunch of websites anymore.
AI will handle everything for you in one place,
which is great news
for creators
. You can directly organize materials,
write scripts, schedule,
and design covers
within ChatGPT.
This is no longer just a chatbot
, but a work hub.
Anthropic has also launched a feature called Claude Skills,
which can be said to turn AI into your own clone.
It allows you to store your work habits,
reply styles, and
judgment logic
into a skill pack.
For example,
you can teach it how to write the opening of a video,
how to reply to a vendor,
and how to create a title.
After teaching it once, when
you ask Claude to write copy for you,
it will automatically apply your style. It
's like copying your brain into AI.
In the long run,
this will become a new personal asset
because your way of thinking and your
standard operating procedures for handling things
can be packaged up
and even shared or sold in the future. It's
not just convenient,
but symbolizes AI. It's learning to do your job,
but this time it's not taking your job
; it's giving you more time for creative, strategic, and
communication tasks—things more human-like.
The last one, which I find most outrageous
yet most realistic,
is the development of a smart toilet analyzer from
the American bathroom brand Kohler.
It's not a joke;
it can genuinely analyze your health
using sensors to observe your excrement
, assessing hydration, digestive health
, and even detecting bleeding.
Priced at approximately $599, it
's already available for pre-order.
Sounds absurd, right?
But this actually represents
AI permeating our lives in
the most inconspicuous places.
It's not just saving you time
; it's collecting your body data.
In the future, these everyday behaviors
may become part of your health data.
We often say AI is powerful,
but the real change isn't new models
or new technologies
; it's AI becoming part of our lives. It's like a background
browser, understanding what you're looking for,
automatically adding frames to videos,
fixing pictures with a single click
replying
to emails, recording information
, and even monitoring your health.
In the future, AI
won't be called AI anymore. It will become
an infrastructure like
electricity and the internet
. You won't particularly notice it,
but you'll use it every day.
For creators, office workers, and students,
this means the era of tool dividends is beginning
. Those who learn how to collaborate with AI earlier
will enter the next productivity stage sooner.
If you like this video,
remember to like, subscribe, and turn on notifications. Leave a
comment telling me
which new AI feature you most want to try.
I'll also share some practical tools and
test them
with you. See you in the next video!
Bye-bye,
guys! You should know that advertising agencies are finished.
With AI,
you can create studio-quality ads in seconds.
Loading video analysis...