LongCut logo

Lee Sedol vs. AlphaGo: What Really Happened in the Match

By Go Magic

Summary

## Key takeaways - **AlphaGo's early training: Human games and prediction**: AlphaGo was initially trained on a database of human Go games, using neural networks to predict moves and game outcomes, though with only around 60% accuracy. [00:31] - **AlphaGo vs. Fan Hui: A 5-0 shockwave**: In October 2015, AlphaGo defeated professional Go player Fan Hui 5-0, marking the first time a human pro lost to a machine, though initial reactions were muted and questioned Fan Hui's professional status. [01:48] - **Early AlphaGo played human-like, made mistakes**: The AlphaGo that played Fan Hui exhibited human-like play and made visible mistakes, such as failing to see a crucial cutting point or an urgent move, which modern AI can now easily identify. [02:53], [05:09] - **Game 2's 'Shoulder Hit': A creative leap**: AlphaGo's 'shoulder hit' move in Game 2 against Lee Sedol surprised observers, including Lee Sedol himself, as it seemed to creatively tie different parts of the board together rather than relying on a database of moves. [12:59] - **Move 78: Lee Sedol's brilliant tesuji**: In Game 4, Lee Sedol played a spectacular move known as a tesuji, which, despite its brilliance, did not change the AI's winning advantage, as AlphaGo at that moment could not find the optimal response. [17:51] - **AlphaGo Zero: AI trained from scratch**: AlphaGo Zero, developed without human game data, learned from self-play, eventually surpassing all previous versions and demonstrating a potential strength of three stones over the version that played Lee Sedol. [21:34]

Topics Covered

  • Early AlphaGo: A Human-Like Machine with Flaws
  • AlphaGo's Secure Path: A Win is a Win
  • The 'Creative' AI Move That Shocked the World
  • AI's Unseen Evolution: From Human-Like to Superhuman
  • Did AI Expose Human Conventionality in Go?

Full Transcript

In this video, we're finally going to

talk about Alph Go from a Go players

perspective. So, I will show you some of

the key moments in all Alph Go matches.

And even if you've watched the Alph Go

documentary, then we will talk about

some of the things that weren't covered

there. So, here we go. As you probably

know, Alph Go was developed by Google's

subsidiary, Deep Mind. And because I'm

not really tech-savvy, I can only

attempt to explain in the simplest of

words how that wonderful AI worked. So

they started by taking a database of

human go games and they used it to train

Alph Go and they had several neural

networks working together on this. The

first neural network looked through all

the games and it learned to generalize

the patterns and predict what the next

move was going to be. The prediction

wasn't perfect, but it was good enough

for now. Another neural network was

shown many positions and then it was

told who won the game from there. And so

it learned to predict who was going to

win the game from any position that it

saw. Again, in both cases, the

predictions weren't really accurate. It

wasn't 100%. Maybe it was only 60%. Now,

armed with this knowledge, the AI would

play many, many games against itself,

only to gain new data for reinforcement

learning. Now that more games were

played, the AI would use those games to

generalize and find patterns, and then

the process would repeat. This is

probably a crazy oversimplification on

my part. So, if you're an AI developer

or you just know more about this, you

can leave your criticism in the

comments. Now, everyone talks about the

big match between Alph Go and Lisa Doll,

and we'll get to it as well, but let's

not forget that there was another match

prior to that. So, let's travel back to

October 2015. There's a professional go

player from China. His name is Funway

and he's living in France. And at that

time, he became the European Go champion

for the third time consecutively. So,

Deep Mind contacted him and offered to

play a match against their new AI.

Naturally, he was skeptical, but he

agreed to play. His skepticism quickly

gave way to surprise, then shock, then

despair. He lost five games in a row in

that match, 5 to zero. That was

astounding. This is the first match lost

by a professional goal player to a

machine. However, I remember that back

then it didn't cause any sensation yet.

Instead, this 50-0 result started some

whispering and muttering that Funway was

not maybe a real professional, that he'

lived in Europe long enough to forget

the skills of an Asian pro. I imagine

there must have been a lot of pressure

on his shoulders. Go players often say

that the AI brought a lot of innovation

into go theory in general and opening

and joking specifically. However, that

Alph Go version was still not quite the

ruthless, unbeatable machine that we

would know later. This Alph Go played

very much like a human here. Let me show

you a couple of moments from those first

games. This is the very first game in

the match against Funway. Alph Go is

playing white here. This entire opening

setup until here looks very human. These

jokies on the left were played long

before AlphaGo. At this point, it's

white's turn and AlphaGo approaches the

corner. White responds with a ka and

white slides. The most classic of all

variations and it's the one that was

criticized by AI and it got replaced by

this in modern go. However, at that

time, black maybe exchange

here

on then protects the corner and Alph

Go just plays a normal two space

extension. This simple shape that was so

incredibly common 10 years ago, you will

hardly ever see it now. Now, let's see

one more example from game four of their

match. In this game, AlphaGo is playing

black and look at the joseeki that it

chose on the left. Alph Go

approaches

[Music]

attachment. White pulls

back. White connects like

this. White protects the

corner. Black extends. White extends.

And Alpha Go jumps. And once again, this

is something you'll be likely to see in

a human game because this jump as

protection on the left is not really

necessary. There is some invasion here,

but it's not really that dangerous. So,

in a modern game, you'd be much likely

to see a tener key somewhere on the

right, for example. And while we're on

the topic of this joseki, the modern AI

would actually not even connect this way

anymore. In the modern game, you're much

more likely to see a simple solid

connection. And after white jumps, black

jumps here or here. That is the modern

approach. But okay, forget about opening

and joking. Alph Go actually made some

visible mistakes that Katago today picks

up on. Let me show you a couple. Once

again, this is game one of their match.

Alpha goes white. Fun just invaded at

the bottom here. And next, he played

here to save these stones. But actually,

as we now know, before playing here,

black could have made a very useful

exchange. I could include this example

into the useful moves in the second line

video. So instead of this, black could

push here first. This move doesn't look

very powerful, but in fact, it makes the

whole white formation weaker and exposes

this cutting point. Now, if white blocks

like this, which seems like the obvious

response, black can actually cut later.

And now white wouldn't be able to atari

because now black can escape with atari

and all of these white stones are in

danger. Alphago's mistake was that

after fun responded here it also failed

to see the importance of this move.

White should have turned here. This

makes the all the white group so much

stronger. But instead of this alpha go

played

here. Black responds. White pushes.

White

blocks. White saves the stones and black

gets a chance to play here. White can't

block. So in the game, white had to

respond like this. A great exchange for

black. Now we're still in the same game,

but let's skip a few moves forward.

Right now, Alph Go plays here to get

control of the center. But according to

Kato, this is a three-point mistake. And

the reason for that is that once again

there's an urgent move on the board that

both black and white can take. Right now

it's black's turn. So punlay could play

here which protects the corner and

threatens to save the two stones. White

would be forced to capture them and

black and now tenarchy protecting the

corner incent. This is amazing. So if we

know this instead of this move, white

could also play something like this. I

know this is a move on the first line

and I personally would never be able to

find this, but it's capturing these

stones and it's threatening the corner.

If black doesn't respond right now, this

next jump could be quite devastating.

And even if black

responds, white has moves in the future

like this one to enter black's corner

and reduce it. So, however big this move

in the center may seem, that was an

urgent point. And by the way, Funway

didn't just lose the match peacefully.

When he saw he was losing, he tried

different strategies. He even used some

of the tricks that he learned from Alph

Go itself in this very match. But

experience teaches us that changing your

own style for a particular opponent or

event is usually not the best thing to

do. And if anything, it makes you play

worse. So when we put all five games of

this match under the microscope of

modern AI, we see that the game Punway

handled best was the very first one.

This game

categ's middle game accuracy in this

game was higher than that of AlphaGo. He

played a great game and his loss was

never bigger than 10 points, usually

around five or six. For Deep Mind, it

was time for new challenges. And next,

they decided to challenge the player who

had a semi-leendary status at the time,

Lisa Doll Non Pro from South Korea, the

winner of many world championships.

Remember all those inaccuracies and

humanlike play in this match? Lisa Doll

saw the match, too. So, it's no surprise

that he couldn't help feeling a little

skeptical at that point. He said back

then that the level of AlphaGo's first

opponent wasn't really on par with his

own, and that was the main reason the

machine won so easily. In his turn, he

was confident about winning the match.

The only doubt in his mind was about the

final score. Would it be 4 to 1 or 5 to

zero in his favor? Oh well, DeepMind had

some surprises up their sleeve. They had

four or five months before the next

match to train, train, train Alph Go

more. Millions of practice games for the

machine. How strong did it become?

Nobody outside of Deep Mind knew. And

certainly not Lisa Doll. He would only

get a chance to discover the current

level of Alph Go on March 9th, 2016 on

the day of their first game. the first

game that he started to play in a

relaxed manner, not expecting that Alph

Go would start a fight in the early

opening. Let me remind you how that

happened. Alph Go is white here and I

remember that the commentators were

saying that this looks like a

comfortable game for Lisa Doll. Black

has a lot of points on the right, these

two stones, everything. And the white

stones don't seem to be making any

points. But AlphaGo had a plan here.

White caps. Black tries to get into the

center by attaching fully expecting

something like

this where white now has a cutting point

and the black stones escape into the

center. But Alph Go had a different

plan. It peeps here. White has to

connect.

Next white

pushes and cuts.

AlphaGo turns this game into a

disconnected running fight right from

the beginning. Now Lisa Doll needs to

live with those three stones at the top,

manage these two stones in the center

and also think of something for this

stone here on the upper left. At the

time, most of the commentators seem to

be saying that this looks like a

difficult fight, but this is a

comfortable game for Lisia Doll. And

they were saying that right until late

in the middle game when it became

obvious that Lisa Doll was losing. But

if we ask the modern AI about this game,

it will show you that actually right

from the opening the game was

comfortable for white for AlphaGo and

that winning gap would only increase

after this first fight. However, at the

end when it was already clear that Alph

Go was going to win the first game,

something interesting happened. Alph Go

seemed to have made a mistake. Let me

show you how that happened. At this

point in the endame, when everyone was

counting that white was a few points

ahead, here's what happened.

AlphaGo blocked here living with this

group and Lucid Doll played the

Atari. Now this seemingly starting a

co black takes and after this move

instead of playing this co Alphago just

makes second eye. The commentator said

that this was probably the first clear

mistake that AlphaGo made in this game

and it is true. White doesn't have to

protect like this. White can keep

playing this co considering how many

local co- threats white has in this

area. But there was something that they

didn't know and that was revealed a

little later about the style of AlphaGo.

AlphaGo's style was a little similar to

that of Lie Ho. That is not playing the

most aggressive moves killing the

opponent but rather trying to aim for

the most secure path to victory even if

it means playing some slack moves and

not trying to maximize the win or kill

something. So, at this point, these

slightly passive moves in the lower left

corner only meant that AlphaGo knew it

was winning this game, and it didn't

really matter if it was winning by 10 or

by three points. A win is a win. Lisa

Doll was shocked after the first game,

and he said at the press conference

after it that he believed now the

chances were 50/50. Probably at that

time, he was still underestimating

AlphaGo a little bit. Let's take a look

at game two. In this game, Alph Go is

playing black and the opening looks

fairly peaceful until now. Now Lisa doll

approaches the

corner. Black picks. White

stands. White jumps and white wants to

make this position a little higher.

So white plays here. Having played this

move, Lisa Doll left for a little smoke

break. And to his surprise, when he came

back, he saw this move played by

AlphaGo. All the commentators, everyone

watching this game live and Lisa Doll

himself were shocked when they saw this

shoulder hit on the fifth line because

we all know that a shoulder hit is good

when it's leaning on the opponent's

position and making it low. So, for

example, if white has a stone here, this

shoulder hit would be perfectly

reasonable, making white low on the

third line. But now that white is high,

this move seems to be giving white

territory on the fourth line. And yet,

just like Shuaku's famous ear redne,

this move seems to be tying the entire

black position together. It's helping

and developing the lower stones, helping

the upper stones, and also preparing to

save these stones here and start a fight

because at any moment in the future,

AlphaGo can connect here and try to

attack these white stones separately.

This move is preparing for that. Lisa

Doll said after the match that it was

this move that showed him that AlphaGo

could think and play creatively instead

of just looking for moves in a big

database. After losing the second game,

Lisa Doll was put on the verge of losing

the entire match. So, game three was

supposed to be his last stand and no

surprise there, he started a very big

fight in the opening. AlphaGo is white

in this game and it just played

here. Lisa Doll responds like this and

white responds with a very light move to

jump out of here. And would you just

look at that? Lisa doll attaches trying

to separate the white stones and start a

fight. AlphaGo parries this

way. Black

separates. White

pushes. Black saves the corner. The

black and white stones are all

disconnected and they will keep on

fighting like this for the next 50 moves

or so. And this is the result of that

fight that AlphaGo handled beautifully.

Nobody died in the process, but AlphaGo

got a big moyo at the bottom and a

comfortable lead in this game. Now we

skip a few moves forward and this is the

same game closer to the endame. Alpha

goes white and white has too many points

at the bottom. At this point, Lisa Doll

feels that if he doesn't invade this

area, if he doesn't live here, it

wouldn't be enough territory for him.

So, he makes this exchange

first. AlphaGo answers.

Then a few more exchanges in other

places. Now this atari

first white ignores and saves this

stone. And now the final

fight, an invasion. If Lisa doll lives

here, all of white's points will be gone

and AlphaGo will lose this game. So,

white needs to kill these stones. Let's

see how that's going to

work. White limits the

space. Black

attaches. White separates these

stones. Black attaches here. There's

some weaknesses.

So white

connects. Now, a tricky move to the

second line. Preparing to make shape to

the top and to play

here. Alph Go

separates. Leafy doll makes a double

tiger's mouth. White needs to block.

Black

pushes. This is white's

sente. And white

connects. Now

[Music]

push and

cut. And these white stones are also

separated from the rest of the

groups. White blocks. Black plays this

honey.

White responds

and black makes an eye and prepares for

a co. Many people believed at the time

that machines were weak at handling

co-fights because you know a co involves

a lot of factors. You need to calculate

all of the possible co- threats and

evaluate which one to respond to and

which one not to respond to. So we can

almost say that this co-ight was the

final test for AlphaGo and it did just

fine. After this co-fight, Lisa Doll

resigned the game. The score in the

match became three to zero and AlphaGo

won the match. But believe it or not,

and I think many of you know what I'm

talking about, the most spectacular move

in the match was yet to be played in

game four. In this game, AlphaGo is

playing black. And black has a scary

territory/mo at the top of the board.

Lacy Doll did a very good job at taking

a lot of points everywhere and leaving

those four stones up there lightly,

hoping to save them later in the game.

And the moment to show some miracles of

Sabaki is now AlphaGo just played here

trying to just swallow those stones in

there. White plays here threatening to

rescue

them and AlphaGo makes a very solid

shape. Connecting here would be

impossible for white. Black is too

strong around.

So, white cuts here as a

probe and black responds this way. Just

separating all the white

stones. Another useful

exchange. Black

blocks

this. Black captures that stone. And

next comes the move that only led Doll

could see.

white wedges. This move is like a bomb

explosion. It just activates all the AGI

in this shape at the same time. Of

course, now that we have superhuman AI

to tell us how things could have been

played, we know that actually black is

leading by about 10 points here and

nothing is changed by this amazing tuji.

Black should respond like this and after

this white shouldn't be able to get any

advantage out of this. But in that

particular game, at that moment, that

version of Alph Go couldn't find the

right move. It played

here. And this allows white

to play

this and then

cut. Now, there's a possible throw in

and snap back here, a cut here, this

atarian cut here, this atari. All of

those possibilities allow white to

escape with these stones through here.

Save everything. Alph Go started making

unreasonable moves, not being able to

understand how to win the game anymore.

And Lisa Doll won this game in the end.

Lisa Doll said after this game that he

had won many world championships, but he

was never congratulated and greeted this

much after winning just one game.

[Music]

[Applause]

[Music]

[Applause]

In the end, Alph Go won this match 4 to1

and Deep Mind showed the whole world

that the AI was now stronger in Go than

human players. But Deep Mind was not

going to stop there. The alpha go that

played against Lisa Doll was stronger

than the one that played against Fon

Huay. So what if we give it more time

and let it train more? In December that

same year 2016, we got an answer to this

question. A mysterious player registered

on Taien Go server and played 60 games

against all the strongest players in the

world and won all 60. Of course, this

was another version of Alph Go that

became known as Alph Go Master. Another

match between that version of Alph Go

and China's strongest player Ko was held

the following year and Ko never had a

single chance. That match was 3 to zero.

Alph Go got to superhuman level. Did

Deep Mind stop there? No, they didn't.

They released yet another version of

Alph Go in October the same year, 2017.

It was called Alph Go Zero and it was

entirely different from all previous

versions because no human games were

used for its training. It didn't start

from any human games at all. It

developed all of its skills and

strategies from scratch. So they it

started playing with itself just making

random moves at first and then little by

little it surpassed all previous

versions and then got above them. It

played a match against the previous

strongest version Alph Go Master. out of

100 games and it won 89 of them and lost

only 11. This shows that maybe we're

somewhat lucky that that Alph Go version

never played against any human player.

The results that it showed indicate that

maybe it was about three stones stronger

than Alph Go Lee, for example, which

means that if Deep Mind had waited

another year and allowed Alph Go Zero to

play that match against Lisa Doll, that

match wouldn't have been that pretty and

it wouldn't have been very close either.

Once DeepMind managed this gargantuan

task of developing a superhuman Go AI,

they published their findings and that

allowed other developers to replicate

their process and make equally strong

AI. Thanks to that, we now have several

of them. The strongest ones currently

being fine art and galaxy from China.

But at that level, it doesn't really

matter which one is the strongest. Even

Kate that is easily available for free

plays at the level that exceeds that of

any human professional player. Lisa Doll

said something interesting in one of his

interviews after the big match.

Something that struck a sad chord in my

Go player heart. People always believed,

he said, that they were creative and Go

was the perfect playground to show their

creativity. But Alph Go demonstrated

that people were actually very

conservative and conventional in their

playing and in their judgment of go

positions. From teacher to student,

knowledge would be passed on. Most of us

would just remember that this is how you

need to play here. The funny thing is

that now with AI being the new benchmark

of perfection, we've become just as

conventional except with a different set

of moves. Alph Go made a revolution in

the Go world. It changed opening theory

forever. A lot of new variations and

ideas were introduced that were

surprising in 2017, but now less than 10

years later, all of them are accepted as

the new golden standard. The changes

that the AI brought into the Go world

are so monumental and so numerous that

we'll have to take a very good look at

them in some of our future videos. Until

then, keep making creative magic on the

Goboard and don't overdo it with those

sunsign invasions. Okay. By the way, you

can also watch these lessons on our

platform

gomagic.org except there you'll watch

them with interactive quizzes right

within the lessons and practical

exercises right after them. And if you

enjoy watching these Go videos and you

don't want to miss others like this one,

go smash that like button, subscribe to

our YouTube channel, and this is Go

Magic.

[Music]

[Music]

Loading...

Loading video analysis...