Google AI Studio Tutorial - Beginner to Expert in 10 Prompts

By Santrel Media

Summary

Topics Covered

Live AI Copilot Beats Solo Work
Nano Banana Edits Photoshop-Level
System Instructions Reshape AI Persona
Vibe Code Full Apps Instantly

Full Transcript

Google AI Studio is so much more than just a basic chatbot. For example, you can vibe code an entire app in here. You

can turn text into speech. You can

generate images or videos. You can even do Photoshop level editing of images without ever opening Photoshop. And all

of this is completely free to use. So,

this is going to be a full tutorial on how to use Google AI Studio to make sure that your job doesn't get replaced by AI. And even more importantly, your job

AI. And even more importantly, your job doesn't get replaced by somebody who knows AI better than you. So this

tutorial will show you 10 incredibly powerful ways you can use Google AI Studio to improve your life for work, for travel, for anything that you could imagine. So with that being said, let's

imagine. So with that being said, let's start off by going on my laptop and you can get started for free on Google AI Studio by going to a studio.google.com.

I'll put a link in the description as well and it should be fairly easy to find this. Now, once you go to Google AI

find this. Now, once you go to Google AI Studio, you will have to sign into your Google account. I already signed into

Google account. I already signed into mine, and it looks like this. Now, don't

be alarmed if yours looks slightly different, but recently Google AI Studio did have an update. So, if you used it in the past, a lot of things have moved around. I will show you how to access

around. I will show you how to access what used to be the stream feature and different things like that later on in the video. But, let's start off with the

the video. But, let's start off with the basic layout of the land. On the bottom left, you'll see your Google account. If

you have multiple Google accounts, maybe a work one and a personal one, you can toggle between them on the bottom. Above

that, you have settings. We can dive into settings later in the video, as well as API keys. But the main thing you'll see on the left side is going to be home, chat, build, dashboard, and documentation. So, home, you're not

documentation. So, home, you're not really going to spend a lot of time here, but this is basically like the quarterback of Google AI Studio. It's

showing you some different directions you can go, some quick ways to get into specific models as well as some new features they launched. I like I like going here every now and then just to see like all right, what's the latest

stuff that has been launched? Google VO

3.1 for example to generate videos and different things like that. Now below

that chat is where you're probably going to spend most of your time unless you're vibe coding apps in which case I'll show you the build section in just a minute.

But within chat, you'll see we have this main window here with many different again suggestions. Google does make this

again suggestions. Google does make this a little bit complicated, but I'm going to demystify it for you. This may look a little bit different based on which model you're using. So Google uses

Gemini. That is their AI brain if you

Gemini. That is their AI brain if you want to think of it as that. But within

Gemini, there are many different models.

You've got some that make images, some that make videos. And it's not just one thing. So you have to select which model

thing. So you have to select which model you want to use based on what you're trying to accomplish. And you can do that on the top right. So if I click on that box right there, you'll see if we

go to all, there are a ton of models we can choose from. Each one has a little bit of text below it. So it'll tell you the other name of it. So Nano Banana is what everyone calls it, but it's also

known as Gemini 2.5 flash image. That

doesn't really matter. What really

matters is below that it tells you with a little eye right here. That's going to essentially be what the model is. Now, I

don't like just scrolling through all of them here. You could search if you

them here. You could search if you really know what you want, but typically I'll use these other tabs right here.

So, Gemini, if we just click on Gemini, usually those are going to be more chat focused ones. So, this could be for

focused ones. So, this could be for example pro is going to take more time, but it's going to give you more advanced reasoning. If you're trying to diagnose

reasoning. If you're trying to diagnose what's hurting on your foot, I mean, obviously consult a medical professional, but maybe you need some advanced reasoning to talk about what you were doing, what angle your foot was at, and and why it might be hurting. And

then the Pro model would make more sense there. The Flash model is going to be a

there. The Flash model is going to be a little bit faster. And then Flash Light is going to be even faster yet. And uh

that's going to be more beneficial if you're trying to just ask a lot of questions that are a little bit more basic. For example, if you're asking

basic. For example, if you're asking like, "What do sloths eat?" that doesn't need a huge, you know, pro model to figure that out. You could use something a little bit lighter. Then we have images. I'll come back to live in a

images. I'll come back to live in a minute. That one's really cool. But

minute. That one's really cool. But

images. This is again a little bit complicated the way Google sets this up.

But you have four right here. Imagine

four. That's going to be your main image generation. You've got Ultra, which

generation. You've got Ultra, which takes a little bit more time, but does a much better job rendering text. So if

you have an image with like signs in the background, the text will not look jumbled or should not look jumbled with Ultra. The fast model is a little bit

Ultra. The fast model is a little bit lighter, a little bit less precise, but you know, can crank out images quite quickly. 3.0, that's an older one. And

quickly. 3.0, that's an older one. And

then, of course, we have nano banana.

Now, nano banana, it's not so much for generating images. It's a lot more for

generating images. It's a lot more for editing images. And we can talk more

editing images. And we can talk more about that later in the video. We have

video generation. Here we have audio, which is going to turn text into speech.

And if I go over, that's the last one.

But live is really quite cool. So before

we go into any of the other ones, I think live's one that I want to get into as one of the 10 really cool features I wanted to show you on here. So if I click on this Gemini 2.5 flash native

audio preview, not the best name, like I said, but it gives us three options.

Now, if you saw Google AI Studio in the past and they had a little like um what they call it, live stream or stream something on the left side, that was a way you could share your screen and work

in real time with Gemini. So, this is actually easy to do right here by going to that, as I said, the live one, and then you can click on either talk and you can talk to it in real time and ask

it questions, have a conversation. Uh,

just I'll show you. Hey, Gemini, what uh what's a good recipe to make cookies?

>> And everyone seems to love them. Is that

the type of cookie you're thinking of?

>> Okay, so you can see you're able to just talk with it back and forth without having to keep asking prompts. And that

would be good if you're cooking in the kitchen and your hands are dirty. You

can just talk to it like that. the other

option. So if I go back, if I want to make a new chat, I'll just click on chat again. We can click on click on this

again. We can click on click on this model again. And now we can go to webcam

model again. And now we can go to webcam and you can show it things on im on you know actual images or the one that I think is really interesting is share screen. So if I say share screen, I can

screen. So if I say share screen, I can then select the PowerPoint presentation I'm working on. Now it's working. Hey

Gemini, I have a question. How do I change the background of this slide on PowerPoint?

>> To change the background, first go to the design tab. Then you should see an option called format background on the far right. Click on that and a panel

far right. Click on that and a panel will appear on the right side of your screen where you can choose various fill options. Does that make sense?

options. Does that make sense?

>> Yeah. What do you think of this one?

Does that really fit the vibe?

>> Well, the texture is interesting, but it might not be the best fit for a presentation about sloths. It might be a little too busy. Maybe try a solid color

or a more subtle pattern that doesn't distract from the content.

>> There you go. So, that is a really cool feature when you're trying to work with something. You have a little co-pilot.

something. You have a little co-pilot.

Essentially, if you are doing any kind of coding, if you're doing any design, I don't know, anything that you have questions about and you need some, you know, kind of more advanced help. This

is a great way to manage that by live streaming with Gemini and showing it, you know, whatever. Obviously, be

careful if you're working and showing like sensitive data. You don't

necessarily want to do that, but still a really cool feature. I wanted to show.

Next up, if I click on chat again, I can go to a different model. So this time, let's talk about generating images. So

if I go to images and let's say we want to go to Imagine 4 Ultra. And this, like I said, is going to be significantly better at showing text. So here I could say generate, show me a sloth in the DMV

working very slowly. And with Google AI Studio, you can't just hit enter.

That'll give you a new line. You have to hit control enter on Windows or command enter on Mac. And again, this is going to take a long time. Actually, that

didn't take that long at all. But on the right side, we have some settings. And

this is true with all of the models we're working with. This one is going to be how many results is it making. You

can have up to four results. You can

choose the aspect ratio, which is really beneficial if you're using chat GPT or just, you know, Gemini on its own website. You're not going to really be

website. You're not going to really be able to do that. You can also choose the resolution. So, if I want a 2K

resolution. So, if I want a 2K resolution in 16x9, let's try this.

Let's try this again. Going to generate that again. And it's going to give us

that again. And it's going to give us higher resolution, wider aspect ratio, which maybe is what I want for the PowerPoint I'm generating. It also shows you how long it takes to make this. You

can see right there. There we go. So, I

can click on this. And from here, you can copy it if you wanted to. You can

add it to Google Drive just by exporting like that or you could download it. Now,

number three, while we're talking about images, let's talk about Nano Banana. I

actually made a full video just using Nano Banana, but on the right side, you can see we're able to search for the model. Use Nano Banana. And Nano Banana,

model. Use Nano Banana. And Nano Banana, like I said, is really good for editing images. So, if I click on the plus

images. So, if I click on the plus button on the bottom, so this is normally where we type in our text. I

can click on plus. I could select something from Google Drive. I could

take a photo. There's sample media. I'm

going to upload a file. So, I'm just going to upload a profile photo I used on one of my other channels, my tech review channel. And now I can ask it,

review channel. And now I can ask it, please give me Ray-B band style glasses.

That would look good. And before I hit enter, I want to show you some settings on the right side. Everything's going to be a little bit different, but on this one, we've got temperature, which is going to be how creative it is. You

could have something more straightforward, which is exactly what you ask for, or higher creativity, which is you wanted to kind of just play around and try something a little different. Aspect radio ratio, I would

different. Aspect radio ratio, I would leave it as auto. It's going to do whatever based on the image you give it.

And you can have some other settings down here that are changed a little bit more advanced. I wouldn't worry about

more advanced. I wouldn't worry about those nearly as much. So here you can see it did a really good job of maintaining the image which looks essentially identical, but it added the glasses on there. And if I click on

that, you can see I it gave me the Rayban logo on the glasses. I don't

really want that on the bottom right. So

we are actually able to edit this image even further. So from here, so when you

even further. So from here, so when you have an output like this, you could either click on this, which reruns it.

You can click on this which you can delete it or you can branch off from here and have two different lines of conversation based on this right here.

So I'm going to try rerunning it first.

Actually no, we don't need to rerun it.

Let's go and ask it the next question.

So let's say please please remove the Ray-B band logo from the bottom of the glasses. And again, it should maintain

glasses. And again, it should maintain the glasses. It shouldn't maintain the

the glasses. It shouldn't maintain the image of me without changing that, but it should hopefully remove the logo in the bottom of the glasses. And there we go. It looks like it still has the

go. It looks like it still has the glasses. It removed the logo from the

glasses. It removed the logo from the bottom right. We still have the logo on

bottom right. We still have the logo on the other side up top. And I think that looks pretty good. That actually is really consistent. And that's what Nano

really consistent. And that's what Nano Banana is really quite good at. I think

we're on to number four right now. And

this is what Google used to call gems, but here they call them system instructions. So if I just go to, let's

instructions. So if I just go to, let's say we're going to go to all or just Gemini. Let's go with Gemini Flash. Just

Gemini. Let's go with Gemini Flash. Just

a fast model. Maybe you're asking it more textbased questions. Now I'm going to say how how do I make how do I bake cookies? But before I hit enter, I want

cookies? But before I hit enter, I want to go over to system instructions.

Within system instructions, you can create a new kind of instructions. So,

I'm going to say this one is angry football. Angry football coach. I'm just

football. Angry football coach. I'm just

going to say uh you're an angry football coach. Make everything I talk to about

coach. Make everything I talk to about about So, I'm going to say you're an angry football coach. Make everything

about football and turn it all into life lessons and never be impressed by me.

So, we're going to do that. And now, I could say, how do I bake cookies?

Control enter or command enter on Mac as I mentioned and it should think and let's see what it comes up with. Yeah,

there we go. Looks like uh it did exactly what I wanted it to do. Now,

this is kind of like a silly example here of it acting like a football coach.

And you can obviously continue this conversation and it'll keep doing that.

But you could very realistically do this for anything else. You could say, "You are my Spanish teacher and you're helping me learn a language." You could tell it, you are my supervisor and you're analyzing every, you know, all

the work I do with, you know, a very analytical eye. Or you could say, uh,

analytical eye. Or you could say, uh, you're an audience member. I'm going to practice my comedy on you. Um, let me know what the feedback is. And that way, you don't have to keep asking it every single time and say, what's the feedback? What's the feedback? Instead,

feedback? What's the feedback? Instead,

you talk to it in this certain light.

You give it the perspective that you want Gemini to have, which is, in my opinion, a really cool feature when you're trying to shape how you're using Gemini and Google AI Studio. Now, I

forget what number we're at, but I want to show you two different ways you can generate videos on here. The free way and the fast way and the much more advanced way. So, if we go to home on

advanced way. So, if we go to home on the left side, you can see VO 3.1 pops up right here. By the time you're watching this video, perhaps it's 3.2. I

can click on that. It'll bring us into VO Studio. A different way to generate

VO Studio. A different way to generate videos and work on this, but you will have to get an API key to do that. And

so, you'll have to create a new key.

That's something for another video. But

you can manage all of your API keys. If

I go back to start on the bottom right here. So, get API key, like I said, will

here. So, get API key, like I said, will bring you into this right here. And you

can, you know, set that up and purchase credits as you need to down in usage and billing because something like VO3.1 is going to use a lot of power from the back end. And Google just doesn't give

back end. And Google just doesn't give that away for free right now. But if you wanted a free video, we can go down to chat. We can select the model on the

chat. We can select the model on the right side, go to video, and select V2.

Now, V2 videos are going to look decent enough. You can see some examples right

enough. You can see some examples right here. They're not super advanced. I

here. They're not super advanced. I

wouldn't expect any text or anything like that to look good, but some basic physics, you know, does apply. And so,

let's just try it. Let's say we're going to generate a video of an aloe plant growing in the desert with a time lap time lapse. And we can say it's going to

time lapse. And we can say it's going to be a 8-second video. Sure, it's going to be 16 by9, maybe 9 by 16. Make it

vertical. Uh 24 frames per second is the aspect ratio. Um, and that's pretty much

aspect ratio. Um, and that's pretty much all we could do. You can add a negative prompt there, something you don't want it to be. But I'm going to run this. Or

you can even add an image, by the way, as a kind of a addition to this prompt.

Uh maybe, you know, the specific landscape you want it to be in. But I'm

going to run this. So it looks like it generated it. I can click on play and

generated it. I can click on play and see what it does. All right. So it looks like not quite what I wanted. Um Oh,

okay. Now it's night time. All right. So

that's not that's not what I was looking for necessarily. V2 is the older model

for necessarily. V2 is the older model and definitely not quite as good as V3.

V3 even has sound as well. Um, so that's definitely a lot more advanced, but I think that's kind of all I wanted to show you in the chat section, but that's nowhere near the extent of what Google AI Studio can do. The next tab, build,

is incredibly advanced. So, if I click on build, we can vibe code all types of things using Gemini. So, they've got a lot of kind of prompts down here. So,

you can analyze images, uh, do all this different stuff, but I'm just going to start off by describing my idea. And I'm

going to say create a create a snake game. I don't know. Let's start off with

game. I don't know. Let's start off with something really basic. just create

snake game and see what it's able to do.

And on the left side, you can see it did that. Uh, it named it all here. And from

that. Uh, it named it all here. And from

the right side, you can we're able to actually test it out. So, let's say start game. And I can use my keyboard

start game. And I can use my keyboard and let's play this. Oh, okay. Game

over. Now, we can also go more advanced and say let's let's make this you can rename it to worm game and make it full screen. Things like that. I don't know.

screen. Things like that. I don't know.

Just some kind of edits you want to make. Let's try it and see what it's

make. Let's try it and see what it's able to do. So, this is the kind of stuff that I've done a little bit in the past with like lovable or hosting or horizons or um there's a lot of them that are actually using this kind of setup where you have the chatbot on the

left side and the output on the right side and it's able to vibe code some pretty advanced things. So, this again just one of the many things you can do using Gemini but now baked in very

natively on Google AI Studio. So, that's

how you can use some of the basic features in the build section here. But

if we go to build, so I just I went back right there. We can also go to gallery

right there. We can also go to gallery and see some of the apps that have already been built on here. And we can kind of work off of those. So chat with maps live, that's a pretty cool feature

that is actually able to be used on I think what is that? The Galaxy um XR I think is what they call it. So that's

like the Apple Vision Pro essentially made by Samsung. You're able to interact with Apple Maps or Google Maps rather.

Um and that's a cool feature that uh I guess is is obviously using Gemini in the back end. So otherwise you can scroll through these, see a gallery of a ton of other ones. You've got your apps down here, the ones that you have made

or kind of experimented with. So um you can click on any one of those and experiment more with those. Now I think there were two more things I wanted to show you. One of them, as you can see

show you. One of them, as you can see right here, is the URL context tool. So

when you're actually asking questions on Gemini, so say I'm using the flash model as you can see in the top right, you can include links and ask it for certain things uh within a page, you can say use

this page for research, whether that is a scholarly article. Um you can also obviously upload a lot of files and images. But let's talk about this right

images. But let's talk about this right here. You can upload this, you can copy,

here. You can upload this, you can copy, you can copy a URL, have it, you know, use that for research and it's able to answer a question based on information only on that URL. So that's going to be

very beneficial. The next thing I want

very beneficial. The next thing I want to show you is all the different ways you can interact with Gemini. So

obviously I showed you the real-time you know screen sharing and stuff like that.

But if you are uploading you know for example multiple PDFs for research, maybe it's multiple contracts you wanted to read, maybe multiple images, you can do that down here by clicking on the

plus and adding all of those in. And

then the very last thing I wanted to show you in this video was the last model. If we go over to audio, we are

model. If we go over to audio, we are able to go from text to speech. So here

you're able to do either single or multis speaker audio and you can tell it what it's supposed to read just by adding in dialogue right here. So you

can go and generate that somewhere else and then paste it in here and it can read things. You can also choose what

read things. You can also choose what the tone is going to be. Again written

in a very natural language. So you can say read aloud in a warm welcoming tone.

You could say an angry tone. You can

choose what the speaker's voice is. You

can also name the speakers over here. So

speaker's voice, we can go with maybe uh I don't know this one right here. And

like I said, you can name them on the right side. So that's a great way you

right side. So that's a great way you can generate like an AI podcast or you can have another option would be I don't know, maybe if you're making a video and you want some voice, some voice over over like a screen recording of a

slideshow or something. That's something

you could do um you know just by putting this in here. Some people don't want to record their own voice, and this is a great way to use other voices. Or maybe

these voices are just a more natural and easy to understand and interpret voice than your own voice. So, those are the fundamentals of how to use Google AI Studio. I hope you found this video

Studio. I hope you found this video helpful, but that is not the limit of what Google AI is able to do. The next

video I highly recommend you watch is Google's Notebook LM. So, I'll have that video linked right here. And in Notebook LM, you can actually have it automatically generate an entire podcast without having to paste things in like

this. It's an incredible feature. Go on

this. It's an incredible feature. Go on

over there to learn how to do that.

Thanks for watching, guys. And I'll see you over there.

Loading...

Loading video analysis...