Terence Tao: Hardest Problems in Mathematics, Physics & the Future of AI | Lex Fridman Podcast #472
By Lex Fridman
Summary
## Key takeaways - **Math problems hover at the edge of solvability**: The most interesting problems in mathematics are those that lie on the boundary between what is currently solvable and what is not, requiring that last 10% of insight to overcome. [01:14] - **The Kaya conjecture: A 2D puzzle with 3D implications**: The Kaya conjecture, a puzzle about efficiently turning a needle in 2D space, has surprisingly deep connections to problems in partial differential equations, number theory, and wave propagation. [01:31], [04:05] - **Navier-Stokes: The million-dollar question of fluid flow**: The Navier-Stokes equations, governing fluid dynamics, pose a million-dollar question: can fluid velocity become infinite in finite time, a phenomenon known as 'blow-up,' which remains unproven but is a key concern in physics. [06:16] - **Supercriticality: The key to chaos in fluid dynamics**: The difficulty in predicting fluid behavior, like weather, stems from 'supercriticality,' where nonlinear transport terms dominate viscosity at small scales, leading to unpredictable turbulence and potential blow-ups. [16:39], [17:31] - **Mathematics as a 'cheat code' for problem-solving**: Mathematicians can strategically 'cheat' by simplifying problems, turning off difficulties one by one, to understand complex issues, akin to using cheat codes in a video game to master its mechanics. [17:17], [17:37] - **AI as a collaborator, not a replacement, for mathematicians**: AI tools like Lean, while powerful for formalizing proofs and managing complex collaborations, are seen as assistants that augment, rather than replace, human intuition and creativity in mathematical discovery. [21:23], [21:46]
Topics Covered
- How mathematicians truly discover new connections.
- Mastering math: Cheat codes and creative analogies.
- Will AI be our future math collaborator?
- Why do "simple" math problems resist all solutions?
- Math's greatest discoveries are deeply human.
Full Transcript
The following is a conversation with
Terrence Tao. Widely considered to be
one of the greatest mathematicians in
history. Often referred to as the Mozart
of math, he won the Fields Medal and the
Breakthrough Prize in mathematics and
has contributed groundbreaking work to a
truly astonishing range of fields in
mathematics and physics.
This was a huge honor for me for many
reasons, including the humility and
kindness that Terry showed to me
throughout all our interactions. It
means the world. This is the Lex
Freedman podcast. To support it, please
check out our sponsors in the
description or at
lexfreedman.com/sponsors.
And now, dear friends, here's Terren
Tao.
What was the first really difficult
research level math problem that you
encountered? One that gave you pause
maybe. Well, I mean in your
undergraduate um education, you learn
about the really hard impossible
problems like the reman hypothesis, the
twin primes conjecture. You can make
problems arbitrarily difficult. That's
not really a problem. In fact, there's
even problems that we know to be
unsolvable. What's really interesting
are the problems just at the on the
boundary between what we can do
relatively easily and what are hopeless.
Um but what are problems where like
existing techniques can do like 90% of
the job and then you just need that
remaining 10%. Um I think as a PhD
student the CA problem certainly caught
my eye and it just got solved actually.
It's a problem I've worked on a lot in
my early research. Historically it came
from a little puzzle by the Japanese
mathematician Soji Kaya uh in like 1918
or so. Um, so the puzzle is that you you
you have um a needle um in on the plane.
Um think like like a like driving like
on on on a road something and you you
want it to execute a U-turn. You want to
turn the needle around. Um but you want
to do it in as little space as possible.
So you want to use as little area in
order to turn it around. So um but the
needle is infinitely maneuverable.
So you can imagine just spinning it
around its um as a unit needle. You can
spin it around its center. Um, and I
think, um, that gives you a disc of of
area, I think pi over four. Um, or you
can do a three-point U-turn, which is
what they we teach people in in the
driving schools to do. Uh, and that
actually takes area pi over 8. So, it's
it's a little bit more efficient than um
a rotation. And so, for a while, people
thought that was the most efficient uh
way to turn things around. But,
Mazikovich uh showed that in fact, you
could actually uh turn the needle around
using as little area as you wanted. So
0001 there was some really fancy multi-
um u back and forth U-turn thing that
you could you could do that that you
could turn a needle around and in so
doing it would pass through every
intermediate direction. Is this in the
two dimensional plane? This is in the
two dimensional plane. Yeah. So we
understand everything in two dimensions.
So the next question is what happens in
three dimensions. So suppose like the
Hubble space telescope is tube in space
and you want to observe every single
star in the universe. So you want to
rotate the telescope to reach every
single direction. And here's unrealistic
part. Suppose that space is at a
premium, which it totally is not. Uh you
want to occupy as little volume as
possible in order to rotate your your
needle around in order to see every
single star in the sky. Um how small a
volume do you need to do that? And so
you can modify basic construction. And
so if your telescope has zero thickness,
then you can use as little volume as you
need. That's a simple modification of
the two dimensional construction. But
the question is that if your telescope
is not zero thickness but but just very
very thin some thickness delta what is
the minimum volume needed to be able to
see every single direction as a function
of delta. So as delta gets smaller as
you need gets thinner the volume should
go down but but how fast does it go
down? Um and the conjecture was that it
goes down very very slowly um like
logarithmically um uh roughly speaking
and that was proved after a lot of work.
So this seems like a puzzle. Why is it
interesting? So it turns out to be
surprisingly connected to a lot of
problems in partial differential
equations, in number theory, in
geometry, comics. For example, in in
wave propagation, you splash some some
water around um you create water waves
and they they travel in various
directions. Um but waves exhibit both
both particle and wave type behavior. So
you can have what's called a wave
packet, which is like a a very localized
wave that is localized in space and
moving a certain direction in time. And
so if you plot it in both space and
time, it occupies a region which looks
like a tube. And so what can happen is
that you can have a wave which initially
is very dispersed but it all comes it
all focuses at a single point later in
time. Like you can imagine dropping a
pebble into a pond and ripples spread
out. But then if you time reverse that
that um that scenario and the equations
of wave motion are time reversible. You
can imagine ripples that are converging
um to a single point and then a big
splash occurs um maybe even a
singularity.
Um and so it's possible to do that. Uh
and geometrically what's going on is
that there's always s of light rays. Um
so like if if if this wave represents
light for example um you can imagine
this wave as a superp position of
photons um all traveling at the speed of
light. They all travel on these light
rays and they're all focusing at this
one point. So you can have a very
dispersed wave focus into a very
concentrated wave at one point in space
and time, but then it defocuses again
and it separates. But potentially if the
conjecture had a negative solution. So
what that meant is that there's there's
a very efficient way to pack um tubes
pointing different directions into a
very very narrow region of of of very
narrow volume. Then you would also be
able to create waves that start out some
there'll be some arrangement of waves
that start out very very dispersed but
they would concentrate not just at a
single point but um um there'll be a
large um there'll be a lot of
concentrations in space and time and uh
um and you could create what's called a
blowup where these waves their amplitude
becomes so great that the laws of
physics that they're governed by are no
longer wave equations but something more
complicated and nonlinear. Um and so in
mathematical physics we care a lot about
whether certain equations in in wave
equations are stable or not whether they
can create um these singularities.
There's a famous unsolved problem called
the Navia Stokes regularity problem. So
the Navia Stokes equations equations
that govern the fluid flow for
incompressible fluids like water. The
question asks if you start with a smooth
velocity field of water can it ever
concentrate so much that like the
velocity becomes infinite at some point
that's called a singularity. We don't
see that um in real life. You know, if
you splash around water on the bathtub,
it won't explode on you. Um or or have
have water leaving at the speed of
light, I think. But potentially, it is
possible. Um and in fact, in recent
years, the the consensus has has drifted
towards the uh the belief that uh that
in fact for certain very special initial
configurations of of say water that
singularities can form. But people have
not yet been able to uh to actually
establish this. The clay foundation has
these seven millennium prize problems
has a million dollar prize for solving
one of these problems that this is one
of them. Of these seven only one of them
has been solved the point conjecture by
Pelman. So the Ka conjecture is not
directly directly related to the Navis
Stokes problem but understanding it
would help us understand some aspects of
things like wave concentration which
would indirectly probably help us
understand the Navis problem better. Can
you speak to the neighbors? So the
existence and smoothness like you said
millennial prize problem right you've
made a lot of progress on this one in
2016 you published a paper finite time
blow up for an averaged threedimensional
navia stoke equation right
so we're trying to figure out if this
thing usually doesn't blow up right but
can we say for sure it never blows up
right yeah so yeah that is literally the
the million- dollar question yeah so
this is what distinguishes
mathematicians from pretty much
everybody else like it
If something holds 99.99% of the time,
um that's good enough for most, you
know, uh for for most things, but
mathematicians are one of the few people
who really care about whether every like
100% really 100% of all um situations
are covered by by um yeah, so most fluid
most of the time um water that does not
blow up. But could you design a very
special initial state that does this?
And maybe we should say that this is a
this is a set of equations that govern
in the field of fluid dynamics. Trying
to understand how fluid behaves and it's
actually turns out to be a really comp
you know fluid is yeah extremely
complicated thing to try to model. Yeah.
So it has practical importance. So this
clay price problem concerns what's
called the incompressible navio stokes
which governs things like water. There's
something called the compressible navio
stokes which governs things like air.
And that's particularly important for
weather prediction. Weather prediction
it does a lot of computational fluid
dynamics. A lot of it is actually just
trying to solve the ny stokes equations
as best they can. Um also gathering a
lot of data so that they can get they
can in initialize the equation. There's
a lot of moving parts. So it's very
important practically. Why is it
difficult to prove general things
about the set of equations like it not
not blowing up? Short answer is
Maxwell's demon. Um so exos demon is a
concept in thermodynamics like if you
have a box of two gases and oxygen and
hydrogen uh and maybe you start with all
the oxygen one side and nitrogen the
other side but there's no barrier
between them right then they will mix um
and they should stay mixed right there
there's no reason why they should unmix
but in principle because of all the
collisions between them there could be
some sort of weird conspiracy that that
um like maybe there's a microscopic
demon called Maxwell's demon that will
um every time a oxygen and nitrogen atom
collide they will bounce off in such a
way that the oxygen sort of drifts onto
one side and then goes to the other and
uh you could have an extremely
improbable configuration emerge. Uh
which we never see. Um and and we
statistically it's extremely unlikely
but mathematically it's possible that
this can happen and we can't rule it
out. Um and this is a situation that
shows up a lot in mathematics. Um a
basic example is the digits of pi
3.14159 and so forth. The digits look
like they have no pattern and we believe
they have no pattern. On the long term,
you should see as many ones and twos and
threes as fours and fives and sixes.
There should be no preference in the
digits of pi to favor let's say 7 over
8. Um, but maybe there's some demon in
the digits of pi that that like every
time you compute more digits, it sort of
biases one digit to another. Um, and
this is a conspiracy that should not
happen. There's no reason it should
happen, but um there's there's there's
no way to prove it.
uh with our current technology. Okay. So
getting back to Nabia Stokes, a fluid
has a certain amount of energy and
because a fluid is in motion, the energy
gets transported around and water is
also viscous. So if the energy is spread
out over many different locations, the
natural viscosity of the fluid will just
damp out the energy and will it will go
to zero. Um and this is what happens um
in um uh when we actually experiment
with water like you splash around there.
there's some turbulence and waves and so
forth. But eventually it it settles down
and and and the the lower the amplitude,
the smaller the velocity, the the more
calm it gets. Um but potentially there
is some sort of a demon that keeps
pushing the uh the energy of the fluid
into a smaller and smaller scale and it
will move faster and faster and at
faster speeds the effective viscosity is
relatively less. And so it could happen
that that it it creates a some sort of
um um what's called a self similar
blowup scenario where you know um the
energy of fluid starts off at some um
large scale and then it all sort of um
transfers it energy into a smaller um
region of of of the fluid which then at
a much faster rate um moves into um an
even smaller region and so forth. Um and
and each time it does this uh it takes
maybe half as as long as as the previous
one and then you you could you could
actually uh converge to all the energy
concentrating in one point in a finite
amount of time. Um and that that's uh
that scenario is called finite blow up.
Um so in practice this doesn't happen.
Um so water is what's called turbulent.
Um so it is true that um if you have a
big eddy of water it will tend to break
up into smaller eddies but it won't
transfer all the the energy from one big
eddy into one smaller eddy. It will
transfer into maybe three or four and
then those must split up into maybe
three or four small edies of their own
and so the energy gets dispersed to the
point where the viscosity can can then
keep that thing under control. Um but if
it can somehow um concentrate um all the
energy keep it all together um and do it
fast enough that the viscous effects
don't have enough time to calm
everything down then this blob can
occur. So there were papers who had
claimed that oh you just need to take
into account conservation energy and
just carefully use the viscosity and you
can keep everything under control for
not just Navia Stokes but for many many
types of equations like this and so in
the past there have been many attempts
to try to obtain what's called global
regularity for Navio Stokes which is the
opposite of final time blow up that
velocity say smooth and it all failed
there was always some sign error or some
subtle mistake and and it couldn't be
salvaged. Um so what I was interested in
doing was trying to explain why we were
not able to disprove um planet time blow
up. I couldn't do it for the actual
equations of fluids which were too
complicated. But if I could average the
equations of motion of naval basically
if if um if I could turn off certain
types of of ways in which water
interacts and only keep the ones that I
want. Um, so in particular, um, if, um,
if there's a fluid and it could transfer
energy from a large Eddie into this
small Eddie or this other small Eddie, I
would turn off the energy channel that
would transfer energy to this this one
and and direct it only into um, this
smaller Eddie while still preserving the
law of conservation of energy. So you're
trying to make it blow up. Yeah. Yeah.
So I I I basically engineer um, a blow
up by changing the laws of physics,
which is one thing that mathematicians
are allowed to do. We can change the
equation. How does that help you get
closer to the proof of something? Right?
So, it provides what's called an
obstruction in mathematics. Um, so, so
what I did was that uh basically if I
turned off the um certain parts of the
equation, so which usually when you turn
off certain interactions make it less
nonlinear, it makes it more regular and
less likely to blow up. But I found that
by turning off a very well-designed set
of of of of interactions, I could force
all the energy to blow in finite time.
So what that means is that if you wanted
to prove um global regularity for Navia
Stokes um for the actual equation you
had you must use some feature of the
true equation which which my artificial
equation um does not satisfy. So it it
rules out certain um certain approaches.
So um the thing about math is is it's
not just about finding you know taking a
technique that is going to work and
applying it but you you need to not take
the techniques that don't work. Um and
for the problems that are really hard,
often there are dozens of ways that you
might think might apply to solve the
problem. But uh it's only after a lot of
experience that you realize there's no
way that these methods are going to
work. So having these counter examples
for nearby problems um kind of rules out
um uh it saves you a lot of time because
you you're not wasting um energy on on
things that you now know cannot possibly
ever work. How deeply connected is it to
that specific problem of fluid dynamics
or just some more general intuition you
build up about mathematics? Right. Yeah.
So the key phenomenon that uh my my
technique exploits is what's called
superc criticality. So in partial
differential equations often these
equations are like a tugof-war between
different forces. So in Navia Stokes
there's the dissipation um force coming
from viscosity and it's very well
understood. It's linear. It calms things
down. If if viscosity was all there was,
then then nothing bad would ever happen.
Um but there's also transport um that
that energy from in one location of
space can get transported because the
fluid is in motion to to other
locations. Um and that's a nonlinear
effect and that causes all the all the
problems. Um so there are these two
competing terms in the Davis equation
the dissipation term and the transport
term. If the dissipation term dominates,
if it's if it's large, then basically
you get regularity. And if um if the
transport term dominates, then uh then
we don't know what's going on. It's a
very nonlinear situation. It's
unpredictable. It's turbulent. So
sometimes these forces are in balance at
small scales, but not in balance at
large scales or or vice versa. Um so
Navis Stokes is what's called
supercritical. So at at smaller and
smaller scales, the transport terms are
much stronger than the viscosity terms.
So the viscosity are the things that
calm things down. Um and so this is um
um this is why the problem is hard in
two dimensions. So the Soviet
mathematician ladish skaya she in the
60s shows in two dimensions there is no
blow up and in two dimensions the nav
equations is what's called critical the
effect of transport and the effect of
viscosity about the same strength even
at very very small scales and we have a
lot of technology to handle critical and
also subcritical equations and proof um
regularity but for superc critical
equations it was not clear what was
going on
and I did a lot of work and then there's
been a lot of follow-up showing that for
many other types of superc critical
equations you create all kinds of blow
up examples. Once the nonlinear effects
dominate the linear effects at small
scales, you can have all kinds of bad
things happen. So this is sort of one of
the main insights of this this line of
work is that superc criticality versus
criticality and subcriticality. This
this makes a big difference. I mean
that's a key qualitative feature that
distinguishes some equations for being
sort of nice and predictable and you
know like like planetary motion and I
mean there are certain equations that
that you can predict for millions of
years and or thousands at least. Again,
it's not really a problem, but but
there's a reason why we can't predict
the weather past 2 weeks into the future
because it's a super critical equation.
Lots of really strange things are going
on at very fine scales. So, whenever
there is some huge source of
nonlinearity,
yeah, that can create a huge problem for
predicting what's going to happen. Yeah.
And if the nonlinearity is somehow more
and more featured and interesting at at
small scales. Um I mean there's there's
many equations that are nonlinear but um
in in many equations you can approximate
things by the bulk. Um so for example
planetary motion you know if you want to
understand the orbit of the moon or Mars
or something you don't really need the
micro structure of like the seismology
of the moon or or like exactly how the
mass is distributed. um you just
basically you can almost approximate
these planets by point masses and just
the aggregate behavior is important um
but if you want to model a fluid um like
like the weather you can't just say in
Los Angeles the temperature is this the
wind speed is this for super critical
equations the finance confirmation is is
really important if we can just linger
on the narto's uh equations a little bit
so you've suggested maybe you can
describe it that one of the ways to uh
solve it or to negatively resolve it
would be to
sort of to construct a liquid a kind of
liquid computer, right? And then show
that the halting problem from
computation theory has consequences for
fluid dynamics. So uh show it in that
way. Can you describe this this Yeah. So
this came out of of this work of
constructing this this this average
equation that that blew up. Um so one um
as as part of how I had to do this. So
there this naive way to do it. You you
just keep pushing um um every time you
you get energy at one scale you you push
it immediately to the next scale as as
fast as possible. This is sort of the
naive way to to to to force blow up. Um
it turns out in five and high dimensions
this works. Um but in three dimensions
there was this funny phenomenon that I
discovered that if you if you keep if if
you change the laws of physics you just
always keep trying to push um the energy
into smaller smaller scales. Um what
happens is that the energy starts
getting spread out into multi many
scales at once. Um so that you you have
energy at one scale you're pushing it
into the next scale and then um as soon
as it enters that scale you also push it
to the next scale but there's still some
energy left over from the previous
scale. um you're trying to do everything
at once. Um and this spreads out the
energy too much. Um and then it turns
out that that um it makes it vulnerable
for viscosity to come in and actually
just damp out everything. So um so it
turns out this this direct bush doesn't
doesn't actually work. There was a
separate paper by some other authors
that actually showed this um in three
dimensions. Um so what I needed was to
program a delay. Um so kind of like air
locks. So um I needed an equation which
would start with a fluid doing something
at one scale. It would push this energy
into the next scale but it would stay
there until all the energy from the from
the larger scale got transferred and
only after you pushed all the energy in
then you sort of open the next gate and
and then you you push that in as well.
So um by doing that it kind of the
energy inches forward scale by scale in
such a way that it's always um localized
at one scale at a time. Um and then it
can resist the effects of viscosity
because it's not dispersed. Um so in
order to make that happen um yeah I had
to construct a rather complicated
nonlinearity. Um and it was basically
like um you know like was constructed
like electronic circuit. So I I actually
thank my wife for this because she was
trained as a electrical engineer. Um and
um you know he talked about um uh you
know he had to design circuits and so
forth. And you know if if you want a
circuit that does a certain thing like
maybe have a light that that flashes on
and then turns off and then on and then
off. You can build it from from more
primitive components you know capacitors
and resistors and so forth and you have
to build a diagram and you um and these
diagrams you can you can sort of follow
your eyeballs and say oh yeah the the
current will build up here and then it
will stop and then it will do that. So I
knew how to build the analog of basic
electronic components, you know, like
resistors and capacitors and so forth.
And and I would I would stack them
together um in in such a way that that I
would create something that would open
one gate and then there'll be a clock
that would and then once the clock hits
a certain threshold it would close it
kind of a rude Goldberg type machine but
described mathematically and this ended
up working. So what I realized is that
if you could pull the same thing off for
the actual equations. So if the
equations of water support a computation
so um like if you can imagine kind of a
steampunk but really water punk uh type
of thing where um you know so modern
computers are electronic you know they
they they're powered by by electrons
passing through very tiny wires and
interacting with other electrons and so
forth. But instead of electrons, you can
imagine these pulses of of water moving
at certain velocity and maybe it's
they're two different configurations
corresponding to a bit being up or down.
Probably if you had two of these moving
bodies of water collide, it would come
out with some new configuration which is
which would be something like an ANDgate
or orgate. you know that if the the the
output would depend in a very
predictable way on on the inputs and
like you could chain these together and
maybe create a touring machine and and
then you could you have computers which
are made completely out of water um and
if you have computers then maybe you can
do robotics so I you know hydraulics and
so forth um and so you could create some
machine which is basically a fluid
analog what's called a vonomian machine
so vonomian proposed if you want to
colonize Mars. The sheer cost of
transporting people machines to Mars is
just ridiculous. But if you could
transport one machine to Mars and this
machine had the ability to mine the
planet, create some more materials to
smelt them and build more copies of the
same machine. Um, then you could
colonize a whole planet um over time.
Um, so uh if you could build a fluid
machine, which uh yeah, so it's it's
it's a it's a robot. Okay. And what it
would do it its purpose in life, it's
programmed so that it would create a
smaller version of itself in some sort
of cold state. It wouldn't start just
yet. Once it's ready, the big robot
configuration water would transfer all
his energy into the smaller
configuration and then power down. Okay?
And then like I clean itself up. And
then what's left is this newest state
which would then turn on and do the same
thing but smaller and faster. And then
the equation has a certain scaling
symmetry. Once you do that, it can just
keep iterating. So this in principle
would create a blow up uh for the actual
Navia Stokes and this is what I managed
to accomplish for this average Navia
Stokes. So it provided the sort of road
map to solve the problem. Now this is uh
a pipe dream because uh there are so
many things that are missing for this to
actually be a reality. Um so um I I I
can't create these basic logic gates. Um
I I don't I don't have these in these
special configurations of water. Um, I
mean there's candidates there things
called vortex rings that might possibly
work but um um but also you know analog
computing is really nasty um compared to
digital computing. I mean because
there's always errors um you you have to
you have to do a lot of error correction
along the way. I don't know how to
completely power down the big machine so
that it doesn't interfere with the the
running of the smaller machine but
everything in principle can happen like
it doesn't contradict any of the laws of
physics. Um so it's sort of evidence
that this thing is possible. Um there
are other groups who are now pursuing
ways to make navis blow up which are
nowhere near as ridiculously complicated
as this. Um um they they actually are
pursuing much closer to the the direct
self similar model which can it doesn't
quite work as is but there could be some
simpler scheme than what I just
described to make this work. There is a
real leap of genius here to go from
Navia Stokes to this touring machine. So
it goes from what the self similar blob
scenario that you're trying to get the
smaller and smaller blob to now having a
liquid toying machine gets smaller and
smaller and smaller and somehow seeing
how that
could be used
to say something about a blowup. I mean
that's a big leap. So there's precedent.
I mean um so the the thing about
mathematics is that it's really good at
um spotting connections between what you
think of what you might think of as
completely different um problems. Um but
if if the mathematical form is the same
you you can you you can you can draw a
connection um so um there's a lot of
work previously on what called cellular
automator um the most famous of which is
Conway's game of life. there's this
infinite discrete grid and at any given
time the grid is either occupied by a
cell or it's empty and there's a very
simple rule that uh tells you how these
cells evolve. So sometimes cells live
and sometimes they die. Um and this um
you know um when I was a a student it
was a very popular screen saver to
actually just have these these
animations going and and they look very
chaotic. In fact they look a little bit
like turbulent float sometimes. But at
some point people discovered more and
more interesting structures within this
game of life. Um so for example they
discovered this thing called a glider.
So a glider is a very tiny configuration
of like four or five cells which evolves
and it just moves at a certain direction
and that's like this this vortex rings
this um yeah so this is an analogy the
game of life is kind of like a discrete
equation and and um the flu navis is a
continuous equation but mathematically
they have some similar features um and
um so over time people discovered more
and more interesting things you could
build within the game of life. The game
life is a very simple system. It only
has like three or four rules um to to do
it, but but you can design all kinds of
interesting configurations inside it. Um
there's something called a glider gun
that does nothing to spit out gliders
one at a one one at a time. Um and then
after a lot of effort, people managed to
to create um and gates and or gates for
gliders. Like there's this massive
ridiculous structure which if you if a
if you have a stream of gliders um
coming in here and a stream of gliders
coming in here then you may produce a
stream of gliders coming out. If so
maybe if both of of the um streams um
have gliders then there'll be an output
stream but if only one of them does then
nothing comes out. Mhm. So they could
build something like that. And once you
could build and um these basic gates
then just from software engineering you
can build almost anything. Um you can
build a touring machine. I mean it's
like an enormous steampunk type things.
They look ridiculous. But then people
also generated self-replicating objects
in the game of life. A massive machine a
bon machine which over a huge period of
time and it always look like glider guns
inside doing these very steampunk
calculations. it would create another
version of itself which could replicate.
It's so incredible. A lot of this was
like community crowdsourced by like
amateur mathematicians actually. Um so I
knew about that that that work and so
that is part of what inspired me to
propose the same thing with Navia
Stokes. Um which is a much as I said
analog is much worse than digital like
it's going to be um you can't just
directly take the constructions in the
game of life and plunk them in. But
again it just it shows it's possible.
You know, there's a kind of emergence
that happens with these cellular automa.
Local rules.
Maybe it's similar to fluids. I don't
know. But local rules operating at scale
can create these incredibly complex
dynamic structures. Do you think any of
that is amendable to mathematical
analysis?
Do we have the tools to say something
profound about that? The thing is you
can get this emerg in very complicated
structures but only with very carefully
prepared initial conditions. Yeah. So so
these these these glider guns and and
gates and and so forth machines if you
just plunk down randomly some cells and
you and you will not see any of these.
Um and that's the analogous situation
with Navia Stokes again you know that
that with with typical initial
conditions you you will not have any of
this weird computation going on. Um but
basically through engineering you know
by by by specially designing things in a
very special way you can make clever
constructions. I wonder if it's possible
to prove the sort of the negative of
like basically prove that only through
engineering can you ever create
something interesting. This this is a
recurring challenge in mathematics that
um I call it the dichotomy between
structure and randomness. That most
objects that you can generate in
mathematics are random. They look like
rand like the digits of pi. Well, we
believe is a good example. Um, but
there's a very small number of things
that have patterns. Um, but um, now you
can prove something has a pattern by
just constructing, you know, like if
something has a simple pattern and you
have a proof that it it does something
like repeat itself every so often. You
can do that. But um, and you you can
prove that that for example, you can you
can prove that most sequences of of
digits have no pattern. Um, so like if
you just pick digits randomly, there's
something called low large numbers. It
tells you you're going to get as many
ones as as twos in the long run. Um but
um we have a lot fewer tools to to to if
I give you a specific pattern like the
digits of pi how can I show that this
doesn't have some weird pattern to it.
Some other work that I spend a lot of
time on is to prove what are called
structure theorems or inverse theorems
that give tests for when something is is
very structured. So some functions are
what's called additive like if you have
a function that maps natural numbers
with natural numbers. So maybe um you
know two maps to four three maps to six
and so forth. um some functions what's
called additive which means that if you
add if you add two inputs together the
output gets gets added as well uh for
example multiplying by a constant if you
multiply a number by 10 um if you if you
multiply a plus b by 10 that's the same
as multiplying a by 10 and b by 10 and
then adding them together so some um
functions are additive some are kind of
additive but not completely additive um
so for example if I take a number n I
multiply by the square root of two and I
take the integer part of that So 10 by
square of two is like 14 point
something. So 10 up to 14. Um 20 up to
28. Um so in that case additively is
true then. So 10 + 10 is 20 and 14 + 14
is 28. But because of this rounding
sometimes there's roundoff errors and
and sometimes when you um add a plus b
this function doesn't quite give you the
sum of of the two individual outputs but
the sum plus minus one. Um so it's
almost additive but not quite additive.
Um so there's a lot of useful results in
mathematics and I've worked a lot on
developing things like this to the
effect that if if a function exhibits
some structure like this then um it's
basically there's a reason for why it's
true and the reason is because there's
there's some other nearby function which
is actually um completely structured
which is explaining this sort of partial
pattern that you have. Um and so if you
have these so inverse theorems it um it
creates this sort of dichotomy that they
either the objects that you study are
either have no structure at all or they
are somehow related to something that is
structured. Um and in either way in
either um in either case you can make
progress. Um a good example of this is
that there's this old theorem in
mathematics called sim theorem proven in
the 1970s. It concerns trying to find a
certain type of pattern in a set of
numbers. the patterns that have make
progression things like 3 five and seven
or or or 10 15 and 20 andreli
proved that um any set of of numbers
that are sufficiently big um what's
called positive density has um
arithmetic progressions in it of of any
length you wish um so for example um the
odd numbers have a set of density 1/2 um
and they contain arithmetic progressions
of any length um so in that case it's
obvious because the the odd numbers are
really really structured I can just take
11 13 15 17 I just I can I can easily
find arithmetic progressions in in in
that set. Um but um zerminism also
applies to random sets. If I take the
set of odd numbers and I flip a coin um
and for each number and I only keep the
numbers which for which I got a heads
okay so I just flip coins. I just
randomly take out half the numbers I
keep one half. So that's a set that has
no no patterns at all. But just from
random fluctuations, you will still get
a lot of um um of arithmetic
progressions in that set. Can you prove
that
there's arithmetic progressions of
arbitrary length within a random? Yes.
Um have you heard of the infinite monkey
theorem? Usually mathematicians give
boring names to theorists, but
occasionally they they give colorful
names. Yes. The popular version of the
infinite monkey theorem is that if you
have an infinite number of monkeys in a
room with each with a typewriter they
type out uh text randomly almost surely
one of them is going to generate the
entire screw of Hamlet or any other
finite string of text. Uh it will just
take some time quite a lot of time
actually but if you have an infinite
number then it happens. Um so um
basically the the if you take an
infinite string of of digits or whatever
um eventually any finite pattern you
wish will emerge. Um it may take a long
time but it will eventually happen. Um
in particular arithmetic progressions of
any length will eventually happen. Okay.
But you need that but you need an
extremely long random sequence for this
to happen. I suppose that's intuitive.
It's just infinity. Yeah. Infinity
absorbs a lot of sins. Yeah. How are we
humans supposed to deal with infinity?
Well, you can think of infinity as as as
just an abstraction of um a finite
number for which you you do not have a
bound for um that uh you know I mean so
nothing in real life is truly infinite.
Um but you know you can um you know you
can ask yourself questions like you know
what if I had as much money as I wanted
you know or what if I could go as fast
as I wanted and a way in which
mathematicians formalize that is
mathematics has found a formalism to
idealize instead of something being
extremely large or extremely small to
actually be exactly infinite or zero. Um
and often the the mathematics becomes a
lot cleaner when you do that. I mean in
physics we we joke about uh assuming
spherical cows. um you know like real
world problems have got all kinds of
real world effects but you can idealize
send certain things to infinity send
certain things to zero um and um and the
mathematics becomes a lot simpler to
work with there. I wonder how often
using infinity
uh forces us to deviate from um the
physics of reality. Yeah. So there's a
lot of pitfalls. Um so you know we we
spend a lot of time in undergraduate
math classes teaching analysis. Um and
analysis is often about how to take
limits and and and and whether you you
know so for example a plus b is always b
plus a. Um so when you have a finite
number of terms you add them you can
swap them and there there's no problem.
But when you have infinite number of
terms there these sort of shell games
you can play where you can have a series
which converges to one value but you
rearrange it and it suddenly converges
to another value. And so you can make
mistakes. You have to know what you're
doing when you allow infinity. Um you
have to introduce these epsilons and
deltas and and this there's a certain
type of way of reasoning that helps you
avoid mistakes. Um
in more recent years um people have
started taking results that are true in
infinite limits and what's called
finetizing them. Um so you know that
something's true eventually but um you
don't know when. Now give me a rate.
Okay. Okay, so it's such a if I have
don't have an infinite number of monkeys
but but a large finite number of
monkeys, how long do I have to wait for
H to come out? Um and that's a more
quantitative question. Um and this is
something that you can you can um attack
by purely finite methods and you can use
your finite intuition. Um and in this
case it turns out to be exponential in
the length of the text that you're
you're trying to generate. Um so um and
so this is why you never see the monkeys
create Hamilton. you can maybe see them
create a four-letter word, but nothing
that big. And so I personally find once
you finitize an infinite statement, it's
it does become much more intuitive and
it's no longer so so weird. Um so even
if you're working with infinity, it's
good to finitize so that you can have
some intuition. Yeah. The downside is
that the finite groups are just much
much messier and and uh yeah. So so the
infinite ones are found first usually
like decades earlier and then later on
people finize them. So since we
mentioned a lot of math and a lot of
physics uh what is the difference
between mathematics and physics as
disciplines as ways of understanding of
seeing the world maybe we can throw in
engineering in there you mentioned your
wife is an engineer give it new
perspective on circuits right so this
different way of looking at the world
given that you've done mathematical
physics so you you've you've worn all
the hats right so I think science in
general is interaction between three
things um there's the real world um
there's is what we observe of the
reward, our observations and then our
mental models as to how we think the
world works. Um so um we can't directly
access reality. Okay. Uh all we have are
the observations which are incomplete
and they they have errors. Um and um
there are many many cases where we would
um uh we want to know for example what
is the weather like tomorrow and we
don't yet have the observation we'd like
to a prediction. Um and then we have
these simplified models sometimes making
unrealistic assumptions you know
spherical cow type things. Those are the
mathematical models. Mathematics is
concerned with the models. Science
collects the observations and it
proposes the models that might explain
these observations. What mathematics
does we we stay within the model and we
ask what are the consequences of that
model? what observations would what
predictions would the model make of the
of future observations um or past
observations does it fit observed data
um so there's definitely a symbiosis um
it's ma I guess mathematics is is
unusual among other disciplines is that
we start from hypothesis like the axims
of a model and ask what conclusions come
up from that that model um in almost any
other discipline uh you start with the
conclusions you know I want to do this I
want to build a bridge, you know, I I
want to to make money. I want to do
this. Okay. And then you you you find
the path to get there. Um
a lot there there's a lot less sort of
speculation about suppose I did this,
what would happen? Um you know, planning
and and and modeling um uh speculative
fiction maybe is one other place. Uh but
uh that's about it actually. Most of
things we do in life is conclusions
driven including physics and science.
You I mean they want to know you know
where is this asteroid going to go? What
was what what is the weather going to be
tomorrow? Um but um Bathe also has this
other direction of of going from the uh
the axioms. What do you think there is
this tension in physics between theory
and experiment? Mhm. What do you think
is the more powerful way of discovering
truly novel ideas about reality? Well,
you need both top down and bottom up. Um
yeah, it's it's a real interaction
between all these things. So over time
the observations and the theory and the
modeling should both get closer to
reality. But initially and it is I mean
this is um this is always the case. You
know they're always far apart to begin
with. Um but you need one to figure out
where to push the other you know. So um
if your model is predicting anomalies um
that are not picked up by experiment
that tells experimenters where to look
you know um to to to to find more data
to refine the models. Um yeah so it it
it goes it goes back and forth. Um
within mathematics itself there's
there's also a theory and experimental
component. It's just that until very
recently theory has dominated almost
completely like 99% of mathematics is
theoretical mathematics and there's a
very tiny amount of experimental
mathematics. Um I mean people do do it
you know like if they want to study
prime numbers or whatever they can just
generate large data sets and with a so
once we had computers um we be to do it
a little bit. Um although even before
well like Gaus for example he discovered
he conjectured the most basic theorem in
in number theory to call the prime
number theorem which predicts how many
primes that up to a million up to a
trillion. It's not an obvious question
and basically what he did was that he
computed I mean mostly um by himself but
also hired human computers um people who
whose professional job it was to do
arithmetic um to compute the first
100,000 tribes or something and made
tables and made a prediction um that was
an early example of experimental
mathematics
um but until very recently it was not um
yeah I mean theoretical mathematics was
just much more successful I mean because
doing complicated mathematical
computations is uh was just not not
feasible until very recently. Uh and
even nowadays, you know, even though we
have powerful computers, only some
mathematical things can be um explored
numerically. There's something called
the comatorial explosion. If you want us
to study, for example, Zodius the you
want to study all possible subsets of
the numbers 1 to a,000. There's only
1,000 numbers. How bad could it be? It
turns out the number of different
subsets of of 1 to a,000 is 2 to the^
1,000 which is way bigger than than that
any computer can currently can can in
fact anybody ever will ever um
enumerate. Um so you have you have to be
um there are certain math problems that
very quickly become just intractable to
attack by direct brute force
computation. Uh chess is another um
famous example. The number of chess
positions uh we can't get a computer to
fully explore.
But now we have AI um um we have tools
to explore this space not with 100%
guarantees of success but with
experiment you know so like um we can
empirically solve chess now for example
we have we have very very good AIs that
that can you know they don't explore
every single position in in the game
tree but they have found some very good
approximation um and people are using
actually these chess engines to make uh
to do experimental chess um that they're
revisiting old chess theories about, oh,
you know, when you this type of opening,
you know, this is a good, this is a good
type of move, this is not, and they can
use these chess engines to actually
refine in some case overturn um um
conventional wisdom about chess. And I
do hope that uh that mathematics will
will have a larger experimental
component in the future perhaps powered
by AI. We'll of course talk about that
but in the case of chess and there's a
similar thing in mathematics that I
don't believe it's providing a kind of
formal explanation of the different
positions. It's just saying which
position is better or not that you can
intuit it as a human being and then from
that we humans can construct a theory of
the matter. You've mentioned the Plato's
cave allegory. Mhm. So in case people
don't know, it's where people are
observing shadows of reality, not
reality itself, and they believe what
they're observing to be reality. Is that
in some sense what mathematicians and
maybe all humans are doing is um looking
at shadows
of reality? Is it possible for us to
truly access
reality? Well, there these three
onlogical things. there's actual
reality, there's our observations and
our our models. Um, and technically they
are distinct and I think they will
always be distinct. Um, but they can get
closer um over time. Um, you know, so um
and the process of getting closer often
means that you you have to discard your
initial intuitions. Um so um like
astronomy provides great examples you
know like you know like you an initial
model of the world is is flat because it
looks flat you know and um and that it's
and it's big you know and the rest of
the universe the skies is not you know
like the sun for example looks really
tiny um and so you start off with a
model which is actually really far from
reality um but it fits kind of the
observations that you have um you know
so you know so things look good you know
but but over time as you make more and
more observations bring it closer to to
reality Okay. Um the model gets dragged
along with it and so over time we had to
realize that the earth was round that it
spins. It goes around the solar system.
Solar system goes around the galaxy and
so on and so forth. And the guys
universe is expanding the expansion
itself expanding accelerating and in
fact very recently in this year. So this
uh even the acceleration of the universe
itself is this evidence that this
non-constant and uh the explanation
behind why that is it's catching up. Um
it's catching up. I mean it's still you
know the dark matter or dark energy this
this kind of thing. We have we have a
model that sort of explains that fits
the data really well. It just has a few
parameters that um you have to specify.
Um but so you know people say that's
fudge factors you know with with enough
fudge factors you can explain anything.
Um but uh the mathematical point of the
model is that um you want to have fewer
parameters in your model than data
points in your observational set. So if
you have a model with 10 parameters that
explains 10 10 observations that is a
completely useless model. It's what's
called overfitted. But like if you have
a model with you know two parameters and
it explains a trillion observations
which is basically uh so yeah the the
the dark matter model I think has like
14 parameters and it explains pabytes of
data um that that that the astronomers
have. Um you can think of of a theory
like one way to think about um physical
math theory theory is it's a compression
of of the universe um and data
compression. So you know you have these
pabytes of observations you'd like to
compress it to a model which you can
describe in five pages and specify a
certain number of parameters and if it
can fit to reasonable accuracy you know
almost all of your observations. I mean
the more compression that you make the
better your theory. In fact, one of the
great surprises of our universe and of
everything in it is that it's
compressible at all. It's the
unreasonable effectiveness of
mathematics. Yeah, Einstein had a quote
like that. The the most incomprehensible
thing about the universe is that it is
comprehensible, right? And not just
comprehensible. You can do an equation
like E= MC². There is actually a some
mathematical possible explanation for
that. Um, so there's this phenomenon in
mathematics called universality. So many
complex systems at the macro scale are
coming out of lots of tiny interactions
at the macro scale and normally because
of the common form of explosion you
would think that uh the macros scale
equations must be like infinitely
exponentially more complicated than than
the uh the microscale ones and they are
if you want to solve them completely
exactly like if you want to model um all
the atoms in a box of of air that's like
Avagadro's number is humongous right
there's a huge number of particles if
you actually have to track each one
it'll be ridiculous. this but certain
laws emerge at the microscopic scale
that almost don't depend on what's going
on at the micros scale or only depend on
a very small number of parameters. So if
you want to model a gas um of you know
quintilion particles in a box you just
need to know it temperature and pressure
and volume and a few parameters like
five or six and it models almost
everything you need need to know about
these 10 to 23 or whatever particles. Um
so we we have um we we don't understand
universality anywhere near as we would
like mathematically but there are much
simpler toy models where we do um have a
good understanding of why univers
universality occurs. Um um most basic
one is is the central limit theorem that
explains why the bell curve shows up
everywhere in nature that so many things
are distributed by what's called a
Gaussian distribution famous bell curve.
There's now even a meme with this curve
and even the meme applies broadly
universality to the meme. Yeah. Yes, you
can go meta if you like. But there are
many many processes for example you can
take lots and lots of independent um
random variables and average them
together um uh in in various ways. you
take a simple average or more
complicated average and we can prove in
various cases that that these these bell
curves these gaussians emerge and it is
a satisfying satisfying explanation. Um
sometimes they don't. Um so so if you
have many different inputs and they're
all correlated in some systemic way then
you can get something very far from a
bow curve show up. Uh and this is also
important to know when this system
fails. So universality is not a 100%
reliable thing to rely on that um um the
global financial crisis was a a famous
example of this. Uh people thought that
uh um mortgage defaults um had this sort
of um Gaussian type behavior that that
if you if you ask if a population of of
of uh you know 100,000 Americans with
mortgages ask what what proportion of
them would default on the mortgages. Um
if everything was decorated it would be
an asset bell curve and and like you can
you can manage risk with options and
derivatives and so forth and um and it
there's a very beautiful theory um but
if there are systemic shocks in the
economy uh that can push everybody to
default at the same time that's very
non-gian behavior um and uh this wasn't
fully accounted for in 2008
now I think there's some more awareness
that this is systemic risk is actually a
much bigger issue and uh just because
the model is pretty uh and nice uh it
may not match reality. Right. So, so the
mathematics of working out what models
do is really important. Um, but um also
the science of validating when the
models fit reality and when they don't.
Um, I mean that you need both. Um, and
but mathematics can help because it it
can for example these central limit
theorems it tells you that if you have
certain aums like like non-correlation
that if all the inputs were not
correlated to each other um then you
have this kind of behavior things are
fine. it it tells you where to look for
weaknesses in the model. So if you have
a mathematical understanding of central
limit theorem and someone proposes use
these Gaussian copy or whatever to to
model um default risk um if you're
mathematically um trained you would say
okay but what if this systemic
correlation between all your inputs and
so then then you can ask the economists
you know how how how much of a risk is
that um and then you can you can you can
go look for that. So there's always this
this this synergy between science and
and mathematics. A little bit on the
topic of universality. Mhm.
You're known and celebrated for working
across an incredible breadth of
mathematics reminiscent of Hilbert a
century ago. In fact, the great Fields
Medal winning mathematician Tim Gow has
said that you are the closest thing we
get to Hilbert.
He's a colleague of yours. Oh yeah. Good
friend. But anyway, so you are known for
this ability to go both deep and broad
in mathematics. So you're the perfect
person to ask, do you think there are
threads that connect all the disparate
areas of mathematics? Is there a kind of
deep underlying structure
uh to all of mathematics? There's
certainly a lot of connecting threads.
Um and a lot of the progress of
mathematics has can be represented by
taking by stories of two fields of
mathematics that were previously not
connected and finding connections. Um an
ancient example is um geometry and
number theory you know. So so in the
times of the ancient Greeks these were
considered different subjects. Um I mean
mathematicians worked on both. You know
you could work both on on geometry most
famously but also on numbers. Um but
they were not really considered related.
Um I mean a little bit like you know you
could say that that this length was five
times this length because you could take
five copies of this length and so forth.
But it wasn't until Deart who really
realized that who developed analytic
geometry that you can you can
parameterize the plane a geometric
object by um by two real numbers. Every
point can be and so geometric problems
can be turned into into problems about
numbers. Um and the the today this feels
almost trivial like like there's there's
there's no content to this like of
course uh you you know um a plane is xx
and y and because that's what we teach
and it's internalized. Um but it was an
important development that these these
two fields were unified. Um and this
process has just gone on throughout
mathematics over and over again. algebra
and geometry were separated and now we
have a student algebraic geometry that
connects them and over and over again
and that's certainly the type of
mathematics that that I enjoy the most.
So I think there's sort of different
styles to being a mathematician. I think
hedgehogs and fox a fox knows many
things a little bit but a hedgehog knows
one thing very very well. Um and in
mathematics there's definitely both
hedgehogs and foxes. Um and then there's
people who are kind of uh who can play
both roles. Um and I think like ideal
collaboration between mathematicians
involves a very you need some diversity
like um a fox working with many
hedgehogs or or vice versa. So yeah but
I identify mostly as a fox certainly I I
like uh arbitrage somehow you like like
um learning how one field works learning
the tricks of that field and then going
to another field which people don't
think is related but I can I can adapt
the tricks. So see the connections
between the fields. Yeah. So there are
other mathematicians who are far deeper
than I am. Like who really they're
really hedgehogs. They they know
everything about one field and they're
much faster and and and more effective
in that field. But I can I can give them
these extra tools. I mean you said that
you can be both the hedgehog and and the
fox depending on the context depending
on the collaboration. So what can you if
it's at all possible speak to the
difference between those two ways of
thinking about a problem? say you're
encountering a new problem, you know,
searching for the connections versus
like very singular focus. I'm much more
comfortable with with the uh the uh the
fox paradigm. Yeah. So, um yeah, I I
like looking for analogies, narratives.
Um I I spend a lot of time if there's a
result I see in one field and I like the
result, it's a cool result, but I don't
like the proof. like it uses types of
mathematics that I'm not super familiar
with. Um I often try to reprove it
myself using the tools that I favor. Um
often my proof is worse. Um but um by
the exercise of doing so um I can say oh
now I can see what the other proof was
trying to do. Um and from that I can get
some understanding of of the tools that
are used in in that field. So it's very
exploratory, very doing crazy things in
crazy fields and like reinventing the
wheel a lot. Yeah. Whereas the hedgehog
style is uh I think much more scholarly,
you know, you you you're very knowledge
based. You you you you stay up to speed
on like all the developments in this
field. You you know all the history. Um
you have a very good understanding of of
exactly the strengths and weaknesses of
of each particular uh technique. Um
yeah uh I think you you rely a lot more
on sort of calculation than sort of
trying to find narratives. Um so yeah I
mean I can do that too but uh there are
other people who are extremely good at
that. Let's step back and uh
uh maybe look at the the a bit of a
romanticized version of mathematics.
Mhm. So, uh I think you've said that
early on in your life, uh math was more
like a puzzle solving activity when you
were uh young. When did you first
encounter a problem or proof where you
realize math can have a kind of elegance
and beauty to it?
That's a good question. Um when I came
to graduate school uh in Princeton, um
so John Conway was there at the time. He
he passed away a few years ago. But uh I
remember one of the very first research
talks I I went to was a talk by Conway
on what he called extreme proof. So
Conway had just had this this amazing
way of of thinking about all kinds of
things in in a way that you would
normally think of. So um he thought of
proofs themselves as occupying some sort
of space, you know. So, so um if you
want to prove something, let's say that
there's infinitely many primes, okay,
you avoid different proofs, but you
could you could rank them in different
axes like some proofs are elegant, some
are long, some proofs are are um
elementary and so forth. Um and so
there's this cloud. So the space of all
proofs itself has some sort of shape. Um
and so he was interested in in extreme
points of this shape like out of all all
these proofs what is one that is the
shortest at the the extent of every
everything else or or the most
elementary or or whatever. Um and so he
gave some examples of well-known
theorems and then he would give what he
thought was was the extreme proof um in
these different aspects. Um and I I just
found that really eye opening um that
that um you know it's not just getting a
proof for a result was interesting but
but once you have that proof you know
trying to to uh to optimize it in
various ways. Um that that proof um uh
proofing itself had some craftsmanship
to it. Um it it certainly informed my
writing style. Um but you know like when
you do your your math assignments and as
undergraduate your homework and so
forth, you you're sort of encouraged to
just write down any proof that works,
okay, and hand it in and get a get as
long as it gets a tick mark, you you
move on. Um but if you want your your
results to actually be influential and
be read by people, um it can't just be
correct. It should also um be a pleasure
to read, you know, um motivated um be
adaptable to to generalize to other um
things. Um it's the same in many other
disciplines like like coding. It's a
there's a lot of analogies between math
and coding. I like analogies if you
haven't noticed. Um but um you know like
you can code something spaghetti code
that works for a certain task and it's
quick and dirty and it works. But uh
there's lots of good principles for for
um writing code well so that other
people can use it build upon it and so
on and has fewer bugs and whatever. Um
and there's similar things with mathemat
mathematics. So yeah the first of all
there's so many beautiful things there
and and is one of the great minds uh in
mathematics ever and computer science.
Uh just even considering the space of
proofs. Yeah. and saying, "Okay, what
does this space look like and what are
the extremes?" Uh, like you mentioned,
coding as an analogy is interesting
because there's also this activity
called the code golf. Oh, yeah. Yeah.
Yeah. Which I also find beautiful and
fun where people use different
programming languages to try to write
the shortest possible program that
accomplishes a particular tasks. Then I
believe there's even competitions on
this. Yeah. And uh it's also a nice way
to stress test not just the
sort of the programs or in this case the
proofs but also the different languages
maybe that's the different notation or
whatever to use to to accomplish a
different task. Yeah, you learn a lot. I
mean it may seem like a frivolous
exercise but it can generate all these
insights which if you didn't have this
artificial um objective to to to pursue
you might not see. What to you is the
most beautiful or elegant equation in
mathematics? I mean one of the things
that people often look to in in beauty
is the simplicity. So if you look at E=
MC² so when when a few concepts come
together that's why the oiler identity
is often considered uh the most
beautiful equation in mathematics. Do
you do you find beauty in that one and
the oil identity? Yeah. Well, as I said,
I mean, what I find most appealing is is
connections between different things
that um so the if ei= minus one um so
yeah people oh uses all the fundamental
constants okay that that's I mean that's
cute um but but to me so the exponential
function was interested by oil to
measure exponential growth you know so
compound interest or decay anything
which is continuously growing
continuously decreasing growth and decay
or dilation or contraction is modeled by
the exponential function Um whereas pi
uh comes around from circles and
rotation right if you want to rotate a
needle for example 180° you need to
rotate by pi radians and i complex
numbers represents the swing between
imagine axis of a 90° rotation so a
change in direction so the x function
represents growth and decay in the
direction where you really are um when
you stick an i in the exponential it now
it's it's instead of motion in the same
direction as your current position it's
the motion has right angles to
composition. So rotation um and then so
e e pi equ= minus 1 tells you that if
you rotate for time pi you end up at the
other direction. So it unifies geometry
through dilation and exponential growth
or dynamics through this act of of
complexification rotation by by i. So it
connects together all these tools
mathematics. Yeah. Yeah. dynamic
structure and complex and complex and um
the complex numbers they all considered
almost yeah they were all next door
neighbors in mathematics because of this
identity. Do do you think the thing you
mentioned is cute the the the collision
of notations from these disperate
fields?
Um it's just a frivolous side effect or
do you think there is legitimate like
value in when the notation all the our
old friends come together
night? Well, it's it's it's confirmation
that you have the right concepts. Um so
when you first study anything um you you
have to measure things and give them
names. Um and initially sometimes your
because your your model is again too far
off from reality you give the wrong
things the best names and you only find
out later what's what's really important
physicists can do this sometimes I mean
but it turns out okay so actually with
physics okay so E= MC² okay so one of
the the big things was the E right so
when when Aristotle first came up with
his laws of of motion and then and then
um Galileo or Newton and so forth you
know they saw the things they could they
could measure they could measure mass
and acceleration and force and so forth
and so Newtonian mechanics for example
F= ma was the famous Newton second law
of motion so those were the the primary
objects so they gave them the central
building in the theory it was only later
after people started analyzing these
equations that there always seemed to be
these quantities that were conserved um
so momentum and energy um uh and it's
not obvious that things happen energy
like it's not something you can directly
measure the same way you can measure
mass and and and velocity so forth but
over time people realize is that this
was actually a really fundamental
concept. Hamilton eventually in 19th
century reformulated Newton's laws of
physics into what's called Hamiltonian
mechanics where the energy which is now
called the Hamiltonian was the dominant
object once you know how to measure the
Hamiltonian of any system. You can
describe completely the dynamics like
what happens to to all the states like
it's um it it really was a central actor
which was not obvious initially. Um and
this uh helped actually uh this change
of perspective really helped when
quantum mechanics came along. Uh because
um the early physicists who studied
quantum mechanics, they had a lot of
trouble trying to adapt their Newtonian
thinking because everything was a
particle and so forth to to to quantum
mechanics, you know, because I think
because it was a wave. It just looked
really really weird. Um like you ask
what is the quantum version of F equals
MA? And it's really really hard to to
give an answer to that. Um but it turns
out that the Hamiltonian which was so um
secretly behind the scenes in classical
mechanics also is the key uh object in
um um in quantum mechanics that there's
there's also an object called
Hamiltonian. It's a different type of
object. It's what's called an operator
rather than than a function. But um and
um but again once you specify it you
specify the entire dynamics. So there's
something called Shingers equation that
tells you exactly how quantum systems
evolve once you have a Hamiltonian. So
side by side they look completely
different objects you know like so one
involves particles one involves waves
and so forth but with this centrality
you could start actually transferring a
lot of intuition and facts from
classical mechanics to quantum
mechanics.
For example, in classical mechanics,
there's this thing called ner's theorem.
Every time there's a symmetry in a
physical system, there is a conservation
law. So the laws of physics are
translation invariant. Like if I move 10
steps to the left, I experience the same
laws of physics as if I was here. And
that corresponds to conservation
momentum. Um if I turn around by by some
angle again, I experience the same laws
of physics. This corresponds to
conservation angular momentum. If I wait
for 10 minutes, um I still have the same
laws of physics. Um so this time
translation variance. this corresponds
to the low conservation of energy. Um,
so there's this fundamental connection
between symmetry and conservation. Um,
and that's also true in quantum
mechanics. Even though the equations are
completely different, but because
they're both coming from the
Hamiltonian, the Hamiltonian controls
everything. Um, every time the
Hamiltonian has a symmetry, the
equations will will have a conservation
law. Um, so it's it's it's it's once you
have the right language, it actually
makes things um a lot a lot cleaner. One
of the problems why we can't unify
quantum mechanics and general relativity
yet we haven't figured out what the
fundamental objects are like for example
we have to give up the notion of space
and time being these almost uklidian
type spaces and there has to be um you
know and you know we kind of know that
at very tiny scales um there's going to
be quite fluctuations of space
space-time foam um and trying to to use
cartigian coord xyz is going to be it's
it's just it's it's a non-starter but we
don't know how to what to replace it
with um We don't actually have the
mathematical um um concepts the analog
Hamiltonian that sort of organized
everything. Does your gut say that there
is a theory of everything. So this is
even possible to unify to find this
language that unifies general relativity
and quantum mechanics. I believe so. I
mean the history of physics has been out
of unification much like mathematics um
over the years. You know electricity and
magnetism were separate theories and
then Maxwell unified them. you know,
Newton unified the the motions of the
heavens with the motions on of objects
on the earth and so forth. So, it should
happen. It's just that the um u again to
go back to this model of the
observations and and theory. Part of our
problem is that physics is a victim's
own success that our two big theories of
of of physics general relativity and
quantum mechanics are so are so good now
that together they cover 99.9% of sort
of all the observations we can make. Um,
and you have to like either go to
extremely insane particle accelerations
or or the early universe or or or things
that are really hard to measure um in
order to get any deviation from either
of these two theories to the point where
you can actually figure out how to how
to combine them together. Um, but I have
faith that we, you know, we've we've
been doing this for centuries and we've
made progress before. There's no reason
why we should stop. Do you think it will
be a mathematician that develops uh
theory of everything? What often happens
is that when the physicists need uh um
some of mathematics, there's often some
precursor that the mathematicians um
worked out earlier. So when Einstein
started realizing that space was curved,
he went to some mathematician and asked
is there is there some theory of curved
space that the mathematicians already
came up with that could be useful and he
said oh yeah there's I think Reman came
up with something um and so yeah Reman
had developed remmaning geometry um
which is precisely you know a theory of
spaces that occurred in various general
ways which turned out to be almost
exactly what was needed um for
Einstein's theory. This is going back to
Dwick's unreasonable effectiveness of
mathematics. I think the theories that
work well to explain the universe tend
to also involve the same mathematical
objects that work well to solve
mathematical problems. Ultimately,
they're just sort of both ways of
organizing data um in in in useful ways.
It just feels like you might need to go
some weird land that's very hard to to
intuit it like you know you have like
string theory. Yeah, that that's that
was that was a leading candidate for
many decades. It's I think it's slowly
falling out of fashion because it's it's
not matching experiment. So one of the
big challenges of course like you said
is experiment is very tough. Yes.
Because of the how effective both
theories are. But the other is like just
you know you're talking about you're not
just deviating from spaceime. You're
going into like some crazy number of
dimensions. You're doing all kinds of
weird stuff that to us we've gone so far
from this flat earth that we started at
like now we're just it's it's very hard
to use our limited ape descendants of uh
uh cognition to intuitit what that
reality really is like. This is why
analogies are so important, you know. I
mean, so yeah, the round earth is not
intuitive because we're stuck on it, but
you know, but you know, but round
objects in general, we have pretty good
intuition over uh and we have intuition
about light works and so forth. And like
it's it's actually a good exercise to
actually work out how eclipses and
phases of of the sun and the moon and so
forth can be really easily explained by
by by by round earth and round moon, you
know, um and models. Um and and you can
just take you know a basketball and a
golf ball and and and a light source and
actually do these things yourself. Um so
the intuition is there. Um but yeah you
have to transfer it. That is a big leap
intellectually for us to go from flat to
round earth because you know our life is
mostly lived in flat land. Yeah. To load
that information and we all like take it
for granted. We take so many things for
granted because science has established
a lot of evidence for this kind of
thing. But you know, we're on a round
rock. Yeah. Flying through space. Yeah.
Yeah. And it's a big leap and you have
to take a chain of those leaps the more
and more and more we progress. Right.
Yeah. So modern science is maybe again a
victim of its own success is that you
know in order to be more accurate it has
to to move further and further away from
your initial intuition. And so um for
someone who hasn't gone through the
whole process of science education it
looks more more suspicious because of
that. So, you know, we we need we need
more grounding. I mean, I I think um I
mean, you know, there are there are
scientists who do excellent outreach. Um
but there's this there this there's
there there's lots of science things
that you can do at home. There's lots of
YouTube videos. I did a YouTube video
recent of Grant Sanderson. We talked
about this earlier that uh you know how
the ancient Greeks were able to measure
things like the distance to the moon,
distance to the earth, and you know,
using techniques that you you could also
replicate yourself. Um it doesn't all
have to be like fancy space telescopes
and and very intimidating mathematics.
Yeah, that's uh I highly recommend that.
I believe you give a lecture and you
also did an incredible video with Grant.
It's a beautiful experience to try to
put yourself in the mind of a person
from that time. Mhm. Shrouded in
mystery, right? You know, you're like on
this planet, you don't know the shape of
it, the size of it. You see some stars,
you see some you see some things and you
try to like localize yourself in this
world. Yeah. Yeah. And try to make some
kind of general statements about
distance to places. Change your
perspective is really important. You say
travel bordens the mind. This is
intellectual travel. You know put
yourself in the mind of the ancient
Greeks or or some other person some
other time period. Make hypothesis
spherical cows whatever you know
speculate. Um and you know this is this
is what mathematicians do and some what
artists do actually. It's just
incredible that given the extreme
constraints, you could still say very
powerful things. That's why it's
inspiring looking back in history. How
much can be figured out right when you
don't have much to figure out stuff like
if you propose axioms then the
mathematics lets you follow those a to
their conclusions and sometimes you can
get quite a quite a long way from you
know initial hypothesis. If we can stay
in the land of the weird, you mentioned
general relativity. You've uh you've
contributed uh to the mathematical
understanding of Einstein's field
equations. Can you explain this work and
from a sort of mathematical standpoint
uh what aspects of general relativity
are intriguing to you, challenging to
you? I have worked on some equations.
There's something called the the wave
maps equation or the sigma field model
which is not quite the equation of
space-time gravity itself but of certain
fields that might exist on top of
spaceime. Um so Einstein's equations of
relativity just describes space and time
itself. Um but then there's other fields
that live on top of that. There's the
electromagnetic field. Um there's
control fields and there's this whole
hierarchy of different equations of
which Einstein is considered one of the
most nonlinear and difficult. But
relatively low in the hierarchy was this
thing called the wave maps equation. So
it's a wave which at any given point uh
is fixed to be like on a sphere. Um so
uh I can think of a bunch of arrows in
space and time and and the arrows
pointing in in different directions. Um
but they propagate like waves. If you
wiggle an arrow it was it will propagate
and make all the arrows move kind of
like sheets of wheat in the wheat field.
And I was interested in the global
regularity problem again for this
question like is it possible for for all
the energy here to collect at a point.
So the equation I considered was
actually what's called a critical
equation where it's actually the
behavior at all scales is roughly the
same. Um and I was able barely to show
that um that you couldn't actually force
a scenario where all the energy
concentrated at one point that the
energy had to disperse a little bit and
the moment it dis little bit it it would
it would stay regular. Yeah. This was
back in 2000. That was part of why I got
interested in narrows afterwards
actually. Yeah. So I developed some
techniques to um solve that problem. So
part of it is it was um this problem is
really nonlinear uh because of the
curvature of the sphere. Um this there
was a certain nonlinear effect which was
a non-perturbative effect. It was when
you sort of looked at it normally it
looked larger than the linear effects of
the wave equation. Um and so it was hard
to to keep things under control even
when the energy was small. But I
developed what's called a gauge
transformation. So the equation is kind
of like an evolution of of of heaves of
wheat and and they're all bending back
and forth and so there's a lot of
motion. Um but like if you imagine like
stabilizing the flow by attaching little
cameras at different points in space
which are trying to move in a way that
captures most of the motion and under
this stabilized flow the flow becomes a
lot more linear. I discovered a way to
transform the the equation to reduce the
amount of of nonlinear effects. Um and
then I was able to to to to solve the
equation. I found this transformation
while visiting my aunt in Australia and
I was trying to understand the dynamics
of all these fields and I I couldn't do
it with pen and paper. Um and I had not
enough facility of computers to do any
computer simulations. So I ended up
closing my eyes being on on the floor
and just imagining myself to actually be
this vector field and rolling around to
try to to see how to change coordinates
in such a way that somehow things in all
directions would behave in a reasonably
linear fashion. And yeah, my aunt walked
in on me while I was doing that and she
was asking what do I what am I doing
doing this? It's complicated is the
answer. Yeah. Yeah. And you know, okay,
fine. You know, you're a young man. I
don't ask questions. I I I have to ask
about the you know um how do you
approach solving difficult problems?
What if it's possible
to go inside your mind when you're
thinking? Are you visualizing
in your mind the mathematical objects
symbols maybe what are you visualizing
in your mind usually when you're
thinking um a lot of pen and paper one
thing you pick up as a mathematician is
sort of uh I call it cheating
strategically um so u the the beauty of
mathematics is that is that you get to
change the rule change the problem
change the rules as you wish this you
don't get to do this for any other field
like you know if if you're an engineer
and someone says build a bridge over
this this You can't say I want to build
this up bridge over here instead or I
want to build out of paper in instead of
steel. Um but a mathematician you can
you can do whatever you want. Um
it's it's like trying to solve a
computer game where you can there's
unlimited cheat codes available. Uh and
so you know you you can you can set
this. So there's a dimension that's too
large. I'll set it to one. I'd solve the
one dimension problem first. So there's
a main term and an error term. I'm going
to make a spherical car assumption. I'll
assume the error term is zero. And so
the way you should solve these problems
is is not in sort of this iron man mode
where you make things maximally
difficult. Um but actually the way you
should you should approach any
reasonable math problem is that you if
if there are 10 things that are making
your life difficult. Find a version of
the problem that turns off nine of the
difficulties but only keeps one of them.
Um and so that um and then that just so
you you you install nine cheats. Okay.
You install 10 cheats then then the game
is trivial. You saw nine cheats, you
solve one problem that that that teaches
you how how to deal with that particular
difficulty and then you turn that one
off and you turn someone else something
else else on and then you solve that one
and after you you know how to solve the
10 problems 10 difficulties separately
then you have to start merging them a
few at a time. Um I I as a kid I watched
a lot of these Hong Kong action movies.
Um it's from a culture. Um and uh one
thing is that every time there was a
fight scene, you know, so maybe the the
hero will get swarmed by a hundred bad
guy goons or whatever. But it would
always be choreographed so that he'd
always be only fighting one person at a
time and then he would defeat that
person and move on and and because of
that he could he could defeat all of
them, right? But whereas if they had
fought a bit more intelligently and just
swarmed the guy at once, uh it would
make for much much worse um cinema, but
uh but they would win. Are you usually
uh pen and paper? Are you working uh
with computer and latte? I'm mostly pen
and paper actually. So in my office, I
have four giant blackboards. Um and
sometimes I just have to write
everything I know about the problem on
the four blackboards and then sit my
couch and just sort of see the whole
thing. Is it all symbols like notation
or is there some drawings? Oh, there's a
lot of drawing and a lot of bespoke
doodles that that only make sense to me.
Um I mean and and the beauty of
blackboard is you erase and it's it's
very organic thing. Um I'm beginning to
use more and more computers. Um partly
because AI makes it much easier to do
simple coding things that you know if I
wanted to plot a function before which
is moderately complicated as some
iteration or something you know I'd have
to to remember how to set up a Python
program and and and and and how does a
for loop work and and and debug it and
it would take two hours and so forth and
and now I can do it in 10 15 minutes is
much um yeah I'm using more and more uh
computers to do simple explorations.
Let's talk about AI a little bit if we
could. So um maybe a good entry point is
just talking about computer assisted
proofs in general. Can you describe the
lean formal proof programming language
and how it can help as a proof assistant
and maybe how you started using it and
how uh it has helped you. So um we is a
computer language um much like sort of
standard languages like Python and C and
so forth except that in most languages
the focus is on producing executable
code. Lines of code do things you know
they they flip bits or or they make a
robot move or or they they deliver you
text on the internet or something. Um so
lean is a language that can also do
that. Uh it can also be run as a
standard traditional language but it can
also produce certificates. So a software
like like Python might do a computation
and give you that the answer is seven.
Okay, that does a sum of 3+ 4 is equal
to 7 but uh lean can produce not just
the answer but but a proof that how it
got the the answer of seven as 3+ 4 and
all the steps involved in in so it
creates these more complicated objects
not just statements but statements with
proofs attached to them. um and um every
line of code is just a way of p piecing
together previous statements to to
create new ones. So the idea is not new.
These things are are called proof
assistants and so they provide languages
for which you you can create quite
complicated um intricate mathematical
proofs and um they produce these
certificates that that give a 100% um
guarantee that your arguments are
correct if you trust the compiler of but
they made the compiler really small and
you can there are several different
compilers available for the same for um
can you give people some intuition about
the the difference between writing on
pen and paper versus using lean
programming language How hard is it to
formalize
statement? So lean a lot of
mathematicians were involved in the
design of lean. So it's it's designed so
that individual lines of code resemble
individual lines of mathematical
argument like you might want to
introduce a variable. You want want to
prove a contradiction. You you um there
are various standard things that you can
do and and it's it's written so ideally
it should like a one correspondence. In
practice, it isn't because lean is like
explaining a proof to an extremely
pedantic colleague who will will point
out okay did you really mean this like
what what happens if this is zero? Okay.
Um did you how do you justify this? Um
so lean has a lot of automation in it um
to try to to uh to be less annoying. Um
so for example um every mathematical
object has to come with a type like if I
if I talk about X is X a real number or
um a natural number or or a function or
something um if you write things
informally um it's up in terms of
context you say you know um clearly x is
equal to let x be the sum of y and z and
y and z were already real numbers so x
should also be a real number um so lean
can do a lot of that um but every so
often it it says wait a minute can you
tell me more about what this object is
uh what type of object it is. You see,
you have to think more um at a
philosophical level. Well, not just sort
of computations you're doing, but sort
of what each object actually um is in
some sense. Is he using something like
LLMs to do uh the type inference or like
you mention the real number? It's it's
using much more traditional what's
called good old fashioned AI. Yeah, you
can represent all these things as trees
and there's always algorithm to match
one tree to another tree. So it's
actually doable to figure out if
something is a a real number or a
natural number. Yeah. Every object sort
of comes with a history of where it came
from and you can you can kind of trace.
Oh, I see. Um yeah, so it's it's
designed for reliability. So uh modern
AIs are not used in it's a disjoint
technology. People are beginning to use
AIS on top of lean. So when a
mathematician tries to program um a
proof in lean um often there's a step
okay now I want to use um the
fundamental thing of calculus say okay
to do the next step so the lean
developers have built this this massive
project called methal liib a collection
of tens of thousands of useful facts
about mathematical objects and somewhere
in there is the fundamental theme of
calculus but you need to find it so a
lot the bottleneck now is actually lema
search you know there's a tool that that
you know is in there somewhere and you
need to find it um and so you can there
are various search engines specialized
for math loop that you can do um but
there's now these large language models
that you can say um I need the
fundamental calculus at this point and
it say okay uh um uh for example um when
I code I have GitHub copilot installed
as a plugin to my IDE and it scans my
text and it sees what I need says you
know I might even type here okay now I
need to use the final thing with
calculus okay and then it might suggest
okay try this and like maybe 25% of the
time it works exactly and then another
10 15% of the time it doesn't quite work
but it it's close enough that I can say
oh if I just change it here and here it
it will work and then like half the time
it gives me complete rubbish um so but
people are beginning to use AI a little
bit on top um mostly on the level of
basically fancy autocomplete um but uh
you can type half of one line of a proof
and it will find it will tell you yeah
but a fancy especially fancy with the
sort of capital letter F is uh uh
removes some of the friction
mathematician might feel when they move
from pen and paper to formalizing. Yes.
Yeah. So, right now I estimate that the
effort time and effort taken to
formalize a proof is about 10 times the
amount taken to to write it out. Yeah.
So, it's doable, but uh you don't it's
it's annoying. But doesn't it like kill
the whole vibe of being a mathematician?
Yeah. So, I mean having a pedantic
coworker, right? Yeah. If if that was
the only aspect of it. Okay. But um
Okay. there there are some there's some
case it was actually more pleasant to do
things formally. So there was there was
a theorem I formalized and there was a
certain constant 12 um that that came
out at um in the final statement and so
this 12 had to be carried all through
the proof um and like everything had to
be checked that it goes all the all
these other numbers had to be consistent
with this final number 12 and so we
wrote a paper through this theorem with
this number 12 and then a few weeks
later someone said oh we can actually
improve this 12 to an 11 by reworking
some of these steps and when this
happens with pen and paper um like every
time you change a parameter you have to
check line by line that every single
line of your proof still works and there
can be subtle things that you didn't
quite realize. Some properties on the
number 12 that you didn't even realize
that you were taking advantage of. So a
proof can break down at a subtle place.
Um so we had formalized the proof with
this constant 12 and then when this this
new paper came out uh we said okay let's
so that took like 3 weeks to formalize
and and like 20 people to formalize this
this this original proof. I said oh but
now now let's let's um uh uh let's
update the 12 to 11. And what you can do
with lean is that you just in your
headline theorem you you change a 12 to
11. You run the compiler and like of the
thousands of lines of code you have 90%
of them still work and there's a couple
that are lined in red. Now I can't
justify this these steps but it it
immediately isolates which steps you
need to change but you can skip over
everything which which works just fine.
Um, and if you program things correctly,
um, with sort of good programming
practices, most of your lines will not
be read. Um, and there'll just be a few
places where you, I mean, if if you
don't hard code your constants, but you
sort of, uh, um, um, you use smart
tactics and so forth. Yeah, you can
localize um, the things you need to
change to to a very small um, period of
time. So like within a day or two, we
had updated our proof to this is very
quick process. You um, you make a
change, there are 10 things now that
don't work. for each one you make a
change and now there's five more things
that don't work but but the process
converges much more smoothly than with
pen and paper. So that's for writing are
you able to read it like if somebody
else sends a proof are you able to like
how what's what's the uh versus paper
and yeah so the proofs are longer but
each individual piece is easier to read.
So, um, if you take a math paper and you
jump to page 27 and you look at
paragraph 6 and you have a line of of of
text of math, I often can't read it
immediately because it assumes various
definitions which I have to to go back
and and maybe 10 pages earlier this was
defined and this um the proof is
scattered all over the place and you
basically are forced to read fairly
sequentially. Um, it's it's not like say
a novel where like you know in theory
you could you open up a novel halfway
through and start reading. there's a lot
of context. But when a proven lean, if
you put your cursor on a line of code,
every single object there, you can hover
over it and it would it would say what
it is, where it came from, where stuff
is justified. You can trace things back
much easier than sort of flipping
through a math paper. So, one thing that
lean really enables is actually
collaborating on proofs at a really
atomic scale that you really couldn't do
in the past. So traditionally with pen
and paper um when you want to
collaborate with another mathematician
um either you do it as a blackboard
where you um you can really interact but
if you're doing it sort of by email or
something um basically yeah you have to
segment it say I'm going to I'm going to
finish section three you do section four
but uh you can't really sort of work on
the same thing collaboratively at the
same time but with lean you can be
trying to formalize some portion of the
proof and say I got stuck at line 67
here I need to prove this thing but it
it doesn't quite work here is like the
three lines of code I'm having trouble
with. Um, but because all the context is
there, someone else can say, "Oh, okay.
I recognize what you need to do. You
need to to apply this trick or this tool
and you can do extremely atomic level
conversations. So, because of lean, I
can collaborate, you know, with dozens
of people across the world, most of whom
I don't have never met in person. Um,
and I may not know actually even whether
they're um how reliable they are in in
in their um um in in the process, but
lean gives me a certificate of of of
trust. Um, so I can do I can do
trustless mathematics. So there's so
many interesting questions there's. So
one, you're you're known for being a
great collaborator. So what is the right
way to approach
solving a difficult problem in
mathematics? When you're collaborating,
are you doing a divide and conquer type
of thing or are you brains are you
focusing on a particular part and you're
brainstorming? There's always a
brainstorming process first. Yeah. So
math research projects sort of by their
nature when you start you don't really
know how to do the problem. Um it's not
like an engineering project where
somehow the theory has been established
for decades and it's it's implementation
is the main difficulty. You have to
figure out even what is the right path.
So so this is what I said about about
cheating first you know um it's like um
to go back to the bridge building
analogy you know so first assume you
have infinite budget and and like
unlimited amounts of of of workforce and
so forth. Now can you can you build this
bridge? Okay. Okay. now have infinite
budget but only finite workforce right
now can you do that and so forth um so
uh I mean of course you know no engineer
can actually do this like I say they
have fixed requirements yes there's this
sort of jam sessions always at the
beginning where you try all kinds of
crazy things and you you make all these
assumptions that are unrealistic but you
plan to fix later um and you try to see
if there's even some skeleton of an
approach that might work um and then
hopefully that breaks up the problem
into smaller sub problems which you
don't know how to do but then you uh you
focus on on sub ones and sometimes
different collaborators are better at at
working on on certain things. Um so one
of my themes I'm known for is a theorem
of Ben Green which called the green
tower theorem. Um it's a statement that
the primes contain arithmetic
progressions of any length. So it was a
modification of this theoret
and the way we collaborated was that Ben
had already proven a similar result for
progressions of length three. Um he
showed that sets like the primes contain
lots and lots of progressions of length
three. Um even and even um subsets of
the prime certain subsets do um but his
techniques only worked for um for length
three progressions. They didn't work for
longer progressions. Um but I had these
techniques coming from agotic theory
which is something that I had been
playing with and and uh I knew better
than Ben at the time. Um and so um if I
could justify certain randomness
properties of some set relating to
primes like there there's a certain
technical condition which if I could
have it if if Ben could supply me this
fact I could I could conclude the
theorem but I what I asked was a really
difficult question in number theory
which um he said there's no way we can
prove this can so he said can you prove
your part of the theorem using a weaker
hypothesis that I have a chance to prove
it and he proposed something which he
could prove but it was too weak for me I
can't use this. Um, so there's this
there was this conversation going back
and forth. Um, so different cheats to
Yeah. Yeah. I want to cheat more, he
wants to cheat less. But eventually we
found a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a a a a a a a a a a a
a a a a a a a a a a property which a he
could prove and b I could use um and
then we we could prove our view and um
yeah so there's there's a there all
kinds of dynamics you know I mean it's
every every um collaboration has a has a
has some story no two are the same. And
then on on the flip side of that like
you mentioned with lean programming now
that's almost like a different story
because you can do you can create I
think you've mentioned a kind of a
blueprint
right for a problem and then you can
really do a divide and conquer with lean
where you're working on separate parts
right and they're using the computer
system proof checker essentially to make
sure that everything is correct along
the way. Yeah. So it makes everything
compatible and uh yeah and trustable. Um
yeah so currently only a few
mathematical projects can be cut up in
this way at the current state of the art
most of the lean activity is on
formalizing boos that have already been
proven by humans a math paper basically
is a boop a blueprint in a sense it is
taking a a difficult statement like big
theorem and breaking up into 100 little
lemas um but often not all written with
enough detail that each one can be sort
of directly formalized. A blueprint is
like a really pedantically written
version of a paper where every step is
explained as to as much detail as as as
possible and trying to make each step
kind of self-contained um and or
depending on only a very specific number
of of previous statements that been
proven so that each node of this
blueprint graph that gets generated can
be tackled independently of of the
others and you don't even need to know
how the whole thing works. Um so it's
like a modern supply chain you know like
if you want to create an iPhone or or
some other complicated object um no one
person can can build up um a single
object but you can a specialist who who
just if they're given some widgets from
some other company they can combine them
together to form a slightly bigger
widget. I think that's a really exciting
possibility because you can have if you
can find problems that could be
broken down this way then you can have
you know thousands of contributors right
distributed. So I told you before about
the split between theoretical and
experimental mathematics and right now
most mathematics is theoretical and when
you type it it's experimental. I think
the platform that lean and and other
software tools so um GitHub and things
like that um allow they will allow
experimental mathematics to be to scale
up um to a much greater degree than we
can do now. So right now if you want to
um um do any mathematical exploration of
some mathematical pattern or something
you need some code to write out the
pattern and I mean sometimes there are
some computer algebra packages that help
but often it's just one mathematician
coding lots and lots of Python or
whatever and because coding is such an
errorprone activity it's not practical
to allow other people to collaborate
with you on writing modules for your
code because if one of the modules has a
bug in it the whole thing is unreliable.
Um, so it's these are uh so you get
these bespoke uh spaghetti code that
written by not not professional
programmers but by mathematicians you
know and they're clunky and and and slow
and um and so because of that it's it's
hard to to really massproduce
experimental results um but um yeah but
I think with lean I mean so I'm already
starting some projects where we are not
just experimenting with data but
experimenting with proofs. So I have
this project called the equation
theories project. Basically we generated
about 22 million little problems in
abstract algebra. Maybe should back up
and tell you what what the project is.
Okay. So abstract algebra studies
operations like multiplication and
addition and the abstract properties.
Okay. So multiplication for example is
commutive. X * Y is always Y * X at
least for numbers. Um and it's also
associative. X * Y * Z is the same as X
* Y * Z. Um so um these operations obey
some laws that don't obey others. For
example, x * x is not always equal to x.
So that law is not always true. So given
any any operation, it obeys some laws
and not others. Um, and so we generated
about 4,000 of these possible laws of
algebra that certain operations can
satisfy. And our question is which laws
imply which other ones? Um, so for
example, does commutivity imply
associativity? And the answer is no
because it turns out you can describe an
operation which obeys the commitive law
but doesn't obey the associative law. So
by producing an example you can you can
show that commitivity does not imply
associativity but some other laws do
imply other laws by substitution and so
forth and you can write down some some
algebraic proof. So we look at all the
pairs between these 4,000 laws and this
22 million of these pairs and for each
pair we ask does this law imply this um
law? If so give a give u give a proof.
If not give a counter example. Mhm. Um
so 22 million problems each one of which
you could give to like an undergraduate
algebra student and they had a decent
chance of solving the problem. Although
there are a few of these 22 million
there like 100 or so that are really
quite hard. Okay. But a lot are easy and
the project was just to to work out to
determine the entire graph like like
which ones imply which other ones.
That's an incredible project by the way.
Such a good idea. Such a good test of
the very thing we've been talking about
at a scale that's remarkable. Yeah. So
it would not have been feasible. Yeah, I
mean the state-of-the-art in the
literature was like, you know, 15
equations and sort of how they apply.
That's sort of at the limit of what a
human repentant paper can do. So, so you
need to scale it up. So, you need to
crowdsource, but you also need to trust
all the um I mean no one person can
check 22 million of these proofs. You
needed to be computerized and so it only
became possible with with lean. Um we
were hoping to use a lot of AI as well.
Um so the project is almost complete. Um
so of these 22 million all but two had
been settled. Um wow and uh well
actually and of those two we have a pen
and paper proof of the two uh and we
we're formalizing it. In fact I was this
morning I was working on finishing it.
Um so we're almost done on this um
incredible is yeah fantastic. How many
people were able to get about 50 um
which in mathematics is is considered a
huge number. It's a huge number. That's
crazy. Yeah. So we kind of have a paper
with 50 authors uh and a big appendex of
who contribute to what. Here's an
interesting question. Now to maybe speak
even more generally about it. When you
have this pool of people,
is there a way to uh organize the
contributions by level of expertise of
the people of the contributors? Now
okay, uh I'm asking you a lot of pthead
questions here, but I I'm imagining a
bunch of humans and maybe in the future
some AIS. Can there be like an ELO
rating type of situation where
like a gamification of this? The beauty
of of these lean projects is is that
automatically you get all this data, you
know, so like like everything has to be
uploaded for this GitHub and GitHub
tracks who contributed what. Um so you
could generate statistics from at any at
any later point in time. You can say oh
this person contributed this many this
many lines of code or whatever. I mean
these are very crude metrics. Um I would
I would definitely not want this to
become like you know part of your tenure
review or something. Uh um but um I mean
I think already in in in enterprise
computing right people do use some of
these metrics as part of of the
assessment of of performance of a of an
employee. Um again this is a direction
which is a bit scary for academics to go
down. We we don't like metrics so much
and yet academics use metrics they just
use old ones. Number of papers. Yeah.
Yeah. It's true. It's true that Yeah. I
mean um it feels like this is a metric
while flawed is is going in the more in
the right direction. Right. Yeah. It's
an interesting at least it's a very
interesting metric. Yeah. I think it's
interesting to study. I mean I I think
you can you can do studies of of whether
these are better predictors. Um there's
this problem called good heart's law. If
a statistic is actually used to
incentivize performance, it becomes
gained. Um and then it is no longer a
useful measure. Oh, humans always. Yeah.
Yeah. I know. It's rational. So what
we've done for this project is is
self-report. So um there are actually
standard categories um from the sciences
of what types of contributions people
give. So there's there's concept and
validation and resources and and and and
coding and so forth. Um, so we we we
there's a standard list of troll or so
categories. Um, and we just ask each
contributor to there's a big matrix of
all the of all the authors in all the
categories just to tick the boxes where
they think that they contributed. Um,
and just give a rough idea you know like
oh so you did some coding and and uh and
you provided some compute but you didn't
do any of the pen and paper verification
or whatever. And I think that that works
out traditionally mathematicians just
order alphabetically by surname. So we
don't have this tradition as in the
sciences of you know lead author and
second author and so forth like which
we're proud of you know we make all the
authors equal status but it doesn't
quite scale to this size so a decade ago
I was involved in these things called
polymath projects it was the crowd
sourcing mathematics but without the
lean component so it was limited by you
needed a human moderator to actually
check that all the contributions coming
in were actually valid and and this was
a huge bottleneck actually um but still
we had projects that were you know 10
author
or so. But we had decided at the time um
not to try to decide who did what um but
to have a single pseudonym. So we
created this fictional character called
DHJ Polymath in the spirit of Bwaki.
Baki is is the pseudonym for a famous
group of mathematicians in the 20th
century. But um and so the paper was a
authored under the pseudonym. So none of
us got the author credit. Um this
actually turned out to be not so great
for a couple of reasons. So, so one is
that if you actually wanted to be
considered for tenure or whatever, you
could not use this paper in your uh uh
as your submitted as one of your
publications because it wasn't you
didn't have the formal author credit. Um
um but the other thing that we've
recognized much later is that when
people referred to these projects, they
naturally refer to the most famous
person who was involved in the project.
Oh, so this was Tim Gow's P project.
This was ter project and not mention the
the other 19 or whatever people that
were involved. Yeah. So we're trying
something different this time around
where we have everyone's an author. Um
but we will have an an appendix with
this matrix and we'll see how that
works. I mean uh so both projects are
incredible just the fact that you're
involved in such huge collaborations.
But I think I saw a talk from Kevin
Buzzard about uh the lean programming
language just a few years ago and he was
saying that uh this might be the future
of mathematics. And so it's also
exciting that you're embracing uh one of
the greatest mathematicians in in the
world embracing this
what seems like the paving of the future
of mathematics. Um so I have to ask you
here about
the integration of AI into this whole
process. So deep mind's alpha proof was
trained using reinforcement learning on
both failed and successful formal lean
proofs of IMO problems. So this is sort
of highlevel high school oh very high
level yes very high level high school
level mathematics problems. What do you
think about the system and maybe what is
the gap between this system that is able
to prove the high school level problems
uh versus gradual level uh problems.
Yeah, the difficulty increases
exponentially with the the number of
steps involved in the proof. It's a
commentatorial explosion, right? So the
thing with large language models is is
that they make mistakes. And so if a
proof has got 20 steps and your model
has a 10% failure rate um at each step
um of of going in the wrong direction
like u it's just extremely unlikely to
actually um reach the end. Actually uh
just to take a small tangent here is how
hard is the problem of mapping from
natural language to the formal program?
Oh yeah it's extremely hard actually. Um
natural language you know it's very
fault tolerant. Um like you can make a
few minor grammatical errors and a
speaker in the second language can get
some idea of what you're saying. Um yeah
but but formal language yeah you if you
get one little thing wrong um like the
whole thing is is is nonsense. um even
formal to formal is is is very hard.
There there are different incompatible
um uh proofist languages. Uh there's
lean but also coaul and Isabel and so
forth and actually even converting from
a formal language to formal language um
is is an unsolved basically unsolved
problem. That is fascinating. Okay. So
uh but once you have an informal
language
they're using um their RL train model.
So some something akin to alpha zero
that they used to go to then try to come
up with poos they also have a model I
believe it's a separate model for
geometric problems so what impresses you
about the system and um what do you
think is the gap yeah we talked earlier
about things that are amazing over time
become kind of normalized um so yeah now
somehow it's oh of course geometry is a
silver problem right that's true that's
true I mean it's still beautiful yeah
these are great works it shows what's
possible I mean um it's it um the
approach doesn't scale currently is yeah
3 days of Google's survey server time to
solve one high school math problem. This
is not a scalable uh prospect. Um
especially with the exponential increase
in um as as the complexity um increases.
We should mention that they got a silver
medal performance the equivalent of I
mean yeah equivalent of a silver so
first of all they took way more time
than was allotted um and they had this
assistance where where the humans
started helped by by formalizing um but
uh also they they're giving us those
full marks for the solution which I
guess is formally verified. So I guess
that that's that's fair. Um yeah um
there there are efforts there was there
will be a proposal at some point to
actually have an an AI math olympiate
where at the same time as the human
contestants get the the actual Olympia
um problems AIS will also be given the
same problems with the same time period
um and the outputs will have to be
graded by the same judges um um and
which means that will have be written in
natural language rather than formal
language. Oh I hope that happens. I hope
that this IMO it happens. I hope I hope
next one it won't happen this IMO the
performance is not good enough in in the
time period and and uh um but there are
smaller competitions um there are
competitions where the the answer is a
is a number rather than a long form
proof um and that's that's um AI are
actually a lot better at um problems
where there's a specific numerical
answer um because it's it's easy to to
to uh to reinforce do reinforcement
learning on it. Yeah, you got the right
answer, you got the wrong answer. It's
it's a very clear signal. But a long
form proof either has to be formal and
then the lean can give it a thumbs up,
thumbs down, or it's informal. Um, but
then you need a human to grade it to
tell uh and if you're trying to do
billions of of reinforcement learning um
you know um um runs, you're not you
can't hire enough humans to uh to grade
those. um it's already hard enough for
for the last language to do
reinforcement learning on on just the
regular text that that people get. But
now if you actually hire people not just
give thumbs up, thumbs down, but
actually check the the output
mathematically. Yeah, that's too
expensive. So if we uh just explore this
possible future,
what what what is the thing that humans
do that's most special in um in
mathematics? So that you could see AI
uh not cracking for a while. So
inventing new theories. So coming up
with new conjectures versus uh proving
the conjectures,
right? Building new abstractions, new
representations, maybe uh an AI turn
style with seeing new connections
between disparate fields. It's a good
question. Um I think the nature of what
mathematicians do over time has changed
a lot. um you know um so a thousand
years ago mathematicians had to compute
the date of Easter uh and there was
really complicated uh calculations you
know but it's all automated been
automated for centuries we don't need
that anymore you know they used to
navigate to do spherical navigation
spherical trigonometry to navigate how
to get from from um the old world to the
new or very complicated calculations
again we've been automated um you know
even a lot of undergraduate mathematics
even before AI um like wolf from alpha
for example It's not a language model,
but it can solve a lot of undergraduate
level math tasks. So on the
computational side, verifying routine
things like having a a problem and um
and say here's a problem in partial
equations. Could you solve it using any
of the 20 standard techniques? Um and
they say yes, I've tried all 20 and here
are the 100 different permutations and
and here's my results. Um and that type
of thing I think it will work very well.
um type of scaling to once you solve one
problem to to make the AI attack 100
adjacent problems. Um the things that
humans do still Yeah. So so where the AI
really struggles right now um is knowing
when it's made a wrong turn. Um that it
can say, "Oh, I'm going to solve this
problem. I'm going to split up this
problem into um into these two cases.
I'm going to try this technique." And um
sometimes if you're lucky and it's a
simple problem, it's the right technique
and you solve the problem and sometimes
it it will get it will have a problem it
would propose an approach which is just
complete nonsense. Um and but like it
looks like a proof. Um so this is one
annoying thing about LM generated
mathematics. So um yeah we we we've had
human generated mathematics as very low
quality um uh like you know submissions
people who don't have the formal
training and so forth. But if a human
proof is bad, you can tell it's bad
pretty quickly. It makes really basic
mistakes. But the AI generated proofs,
they can look superficially flawless. Uh
and that's partly because that's what
the reinforcement learning has actually
trained them to do, right? To to make
things to to produce text that looks
like um what is correct, which for many
applications is good enough. Um uh so
the errors often really subtle and then
when you spot them, they're really
stupid. Um like you know like no human
would have actually made that mistake.
Yeah, it's actually really frustrating
in the programming context because I I
program a lot and yeah, when a human
makes when lowquality code, there's
something called code smell, right? You
can you can tell you can tell
immediately like, okay, there's signs.
But with with a generate code of and
then you're right eventually you find an
obvious dumb thing that just looks like
good code. Yeah. So, um it's very tricky
to and frustrating for some reason to
Yeah. to work. Yeah. So the sense of
smell. Okay, there you go. This is this
is one thing that humans have. Um and
there's a metaphorical mathematical
smell that uh this we it's not clear how
to get the AI to duplicate that
eventually. Um I mean so the way um
Alpha Zero and so forth make progress on
go and and chess and so forth is is in
some sense they have developed a sense
of smell for go and chess positions you
know that that this position is good for
white is good for black. um they can't
initiate why. Um but just having that
that sense of smell lets them
strategize. So if AIs gain that ability
to sort of a sense of viability of
certain proof strategies say so so you
can say I'm going to try to break up
this problem into two small subtasks and
they can say well this looks good two
tasks look like they're simpler tasks
than than your main task and they still
got a good chance of being true. Um so
this is good to try or no you've you
made the problem worse because each of
the two sub problems is actually harder
than your original problem which is
actually what normally happens if you
try a random uh thing to try normally
actually it's very easy to transform a
problem into even harder problem. Mhm.
Very rarely do you problem transport a
simpler problem. Um yeah so if they can
pick up a sense of smell then they could
maybe start competing with human level
mathematicians. So, this is a hard
question, but not competing, but
collaborating. Yeah. If Okay,
hypothetical.
If I gave you an oracle
that was able to do some aspect of what
you do, and you could just collaborate
with it. Yeah. Yeah. What would that
oracle What would you like that oracle
to be able to do? Would you like it to
uh maybe be a verifier? Like check Mhm.
Do the codes like you're Yes. uh
professor to this is the correct this is
a good this is a promising fruitful
direction. Yeah. Yeah. Yeah. Or or would
you like it to
uh generate possible proofs and then you
see which one is the right one? Um or
would you like it to maybe generate
different representation different
totally different ways of seeing this
problem? Yeah, I think all of the above.
Um a lot of it is we don't know how to
use these tools because it's a paradigm
that is not um yeah we have not had in
the past systems that are competent
enough to understand complex
instructions. Mhm. Um that can work at
massive scale but are also unreliable.
Uh like it's it's an interesting uh bit
unreliable in subtle ways while we while
providing sufficiently good output. Um
it's a interesting combination. um you
know I mean you have you have like
graduate students that you work with who
kind of like this but not at scale um
you know and and and we have previous
software tools that um can work at scale
but but very narrow um so we have to
figure out how to how to use um I mean
um so Tim C actually imagine he actually
foresaw like in in 2000 he was
envisioning what mathematics would look
like in in actually two and a half
decades
and that's funny yeah He he wrote in his
in in his article like a a a
hypothetical conversation between a
mathematical assistant of the future um
and himself you know trying to solve a
problem and they would have have a
conversation that sometimes the human
would would propose an idea and the AI
would would evaluate it and sometimes
the AI would propose an idea um and u
and sometimes that computation was
required and a would just go and say
okay I've checked the 100 cases needed
here or um the first you you said this
is true for all n I've checked for n up
to 100 um and it looks good so far or
hang on there's a problem at n equals 46
you so just a free form conversation
where you don't know in advance where
things are going to go but just based on
on I think ideas get proposed on both
sides calculations get proposed on both
sides I've had conversations with AI
where I say okay let's we're going to
collaborate to solve this math problem
and it's a problem that I already know
the solution to so I I try to prompt it
okay so here's the problem I suggest
using this tool and then you'll find
this this lovely argument using a
totally different tool which eventually
goes you know, into the weeds and say,
"No, no, no. If I using this, okay, and
it might start using this and then it'll
go back to the tool that I wanted to to
before." Um, and like you have to keep
railroading it um onto the path you
want. And like I I could eventually
force it to give the proof I wanted. Um,
but it was like hurting cats um like and
the amount of personal effort I had to
take to not just sort of prompt it, but
also check it output because it like a
lot of what it looked like was going to
work. I know there's a problem on online
17 and basically arguing with it. um
like it was more exhausting than doing
it unassisted. So like it but that's the
current state of the art. I wonder if
there's there's a phase shift that
happens to where it's no longer feels
like hurting cats and
maybe it'll surprise us how quickly that
comes. I I believe so. Um so in
formalization I I mentioned before that
it takes 10 times longer to formalize a
proof than to write it by hand with
these modern AI tools is and also just
better tooling um the lean um um
developers are doing a great job adding
more and more features and making it
user friendly. It's going up from 9 to 8
to 7. Okay, no big deal. But one day it
will drop below one. Um and that's a
phase shift because suddenly um it makes
sense when you write a paper to to write
it in lean first or through a
conversation with AI who is generally um
on the fly with you and it becomes
natural for journals to accept you know
maybe they'll offer expedite refereeing
you know if if a paper has already been
formalized in in lean um they'll just
ask the referee to comment on on the
significance of the results and how it
connects to literature and not worry so
much about the correctness.
um because that's been certified. Um
papers are getting longer and longer in
mathematics and actually it's harder and
harder to get good refereeing for um the
really long ones unless they're really
important. It is actually an issue which
and the formalization is coming in at
just the right time for this to be and
the easier and easier to guess because
of the tooling and all the other factors
then you're going to see much more like
math lib will grow potentially
exponentially. It's a it's a it's a
virtuous uh cycle. Okay. I mean one
facet of this type that happened in the
past was the adoption of latte. So so
latte is this type seting language that
all mians use now. So in the past people
use all kinds of word processors and
typewriters and whatever but at some
point latte became easier to use than
all other competitors and that people
just switched you know within a few
years like it was just a dramatic um pay
shift. It's a wild out there question,
but what
what year how far away are we from
a uh AI system being a collaborator
on a proof that wins the Fields medal.
So that level. Okay. Um well, it depends
on the level of collaboration. I mean,
no, like it deserves to be to get the
Fields Medal. like so half and half
already like I I can imagine if it was a
winning paper having some AI systems in
writing it you know uh just you know
like the order complete alone is already
I I use it like it speeds up my my own
writing um um like you know you you can
have a theorem you have a proof and the
proof has three cases and I I write down
the proof of the first case and the
autocomplete just suggests all right now
now here's how the proof of second case
could work and like it was exactly
correct that was great saved me like 5
10 minutes of uh of typing but in that
case The AI system doesn't get the
Fields medal. No. Uh
are we talking 20 years, 50 years, 100
years? What do you think? Okay. So I I
gave a prediction in print. So by 2026,
which is now next year, um there will be
math collaborations, you know, where the
AI, so not Fields Medal winning, but but
like actual research level math like
published ideas that in part generated
by AI. Um maybe not the ideas but at
least uh some of the computations um um
the verifications. Yeah. I mean has that
already happened? Has that already
happened? Yeah. There are there are
problems that were solved uh by a
complicated process conversing with AI
to propose things and the human goes and
tries it and the contract doesn't work
but it might propose a different idea.
Um it it's it's hard to disentangle
exactly. Um there are certainly math
results which could only have been
accomplished because there was a math
method human mathematician and an AI
involved. Um but it's hard to sort of
disentangle credit. Um
I mean these tools they they do not uh
replicate all the skills needed to do
mathematics but they can replicate sort
of some non-trivial percentage of them
you know 30 40%. they can fill in gaps.
Um, you know, so, uh, coding is is is a
is a good example, you know. So, I I um
um it's annoying for me to code in
Python. I'm not I'm not a native um I'm
not a professional um programmer. Um,
but um the with AI that the the friction
cost of of doing it is is is much
reduced. Uh so it it fills in that gap
for me. Um
AI is getting quite good at literature
review. Um I mean there's still a
problem with um hallucinating you know
the references that don't exist. Um but
this I think is a civil war problem if
you train in the right way and so forth
you can you can and um and verify um you
know using the internet um you know um
you should in a few years get to the
point where you you have a a lema that
you need and uh we say has anyone proven
this lema before and it will do
basically a fancy web search AI
assistant and say yeah yeah there are
these six papers where something similar
has happened and I mean it you can ask
it right now and it'll give you six
papers of which maybe one is is
legitimate and relevant. One exists but
is not relevant and four are
hallucinated. Um it has a non-zero
success rate right now, but uh it's
there's so much garbage. Uh so much the
signal to noise ratio is so poor that
it's it's um it's most helpful when you
already somewhat know the literature. Um
and you just need to be prompted to be
reminded of a paper that was already
subconsciously in your memory versus
helping you discover new you were not
even aware of but is the correct
citation. Yeah, that's yeah, that it can
sometimes do. But but when it does, it's
it's buried in in a list of options for
which the other that are bad. Yeah. I
mean, being able to automatically
generate a related work section that is
correct. Yeah. That's actually a
beautiful thing that might be another
phase shift because it assigns credit
correctly. Yeah. It does. It breaks you
out of the silos of Yeah. Yeah. Yeah.
thought, you know. Yeah. No, there's a
big hump to overcome right now. I mean,
it's it's like self-driving cars, you
know. the the safety margin has to be
really high for it to be um uh to be
feasible. So yeah, so there's a last
mile problem um with a lot of AI
applications um that uh you know they
can develop tools that work 20% 80% of
the time but it's still not good enough
um and in fact even worse than good some
ways. I mean another way of asking the
Fields metal question is what year do
you think you'll wake up and be like
real surprised? you read the headline,
the news of something happened that AI
did like you know real breakthrough
something it doesn't you know like feels
metal even hypothesis it could be like
really just
this alpha zero moment with go that kind
of thing right um yeah this this decade
I can I can see it like making a
conjecture
between two unrelated two two things
that people thought was unrelated oh
interesting generating a conjecture
that's a beautiful conjecture Yeah. And
and actually has a real chance of being
correct and and and meaningful and um
because that's actually kind of doable I
suppose but the word of the data is
Yeah. No, that would be truly amazing.
Um the current models struggle a lot. I
mean so um a version of this is um I
mean the physicists have a dream of
getting the AI to discover new new laws
of physics. Um you know the the dream is
you just feed it all this data. Okay.
and and this is here's a new patent that
we didn't see before but it actually
even struggle the current state of the
art even struggles to discover old laws
of physics um from the data uh or if it
does there's a big concern contamination
that that it did it only because like
somewhere in this training data it some
new um you know boils law or whatever
ball you're trying to to to reconstruct
um part of it is that we don't have the
right type of training data for this um
yeah so for laws of physics like we we
don't have like a million different
universes with a million infant laws of
nature. Um
and um like a lot of what we're missing
in math is actually the negative space
of so we have published things of things
that people have been able to prove um
and conjectures that ended up being
verified um or maybe counter examples
produced but um we don't have data on on
things that were proposed and they're
kind of a good thing to try but then
people quickly realized that it was the
wrong conjecture and then they they said
oh but we we should actually change um
our claim to modify it in this way to
actually make it more plausible. Um
there's this there's a trial and error
process which is a real integral part of
human mathematical discovery which we
don't record cuz it's embarrassing. Uh
we make mistakes and and we only like to
publish our wins. Um and uh the AI has
no access to this data to train on. Um I
sometimes joke that basically AI has to
go through um grad school and actually
you know go to grad courses, do the
assignments, go to office hours, make
mistakes, um get advice on how to
correct the mistakes and learn from
that. Let me uh ask you if I may about
uh Gregori Pearlman. Mhm. You mentioned
that you try to be careful in your work
and not let a problem completely consume
you. just you really fall in love with
the problem and really cannot rest until
you solve it. But you also hasted to add
that sometimes this approach actually
can be very successful. An example you
gave is Gregoria Pearlman who proved the
point conjecture and did so by working
alone for 7 years with basically little
contact with the outside world. Can you
explain this one millennial prize
problem that's been solved point
conjecture and maybe speak to the
journey that Gagora Pearlman's been on.
All right. So it's it's a question about
curb spaces. Earth is a good example. So
you can think of a 2D surface in being
round could maybe be a Taurus with a
hole in it or it can have many holes and
there there are many different
topologies up priori that that a surface
could have. um even if you assume that
it's it's bounded and and uh and smooth
and so forth. So we have figured out how
to classify surfaces as a first
approximation everything is determined
by something called the genus how many
holes it has. So a sphere has genus 0 a
donut has genus one and so forth and one
way you can tell these surfaces apart
probably the sphere has which is called
simply connected if you take any closed
loop on the sphere like a big closed
little rope you can contract it to a
point and while staying on the surface
and the sphere has this property but a
taurus doesn't if on a taurus and you
take a rope that goes around say the the
outer diameter taurus there's no way it
can't get through the hole there's no
way to to contract it to a point so it
turns out that the this the sphere is
the only surface with this property of
contractability up to like continuous
deformationations of the sphere. So um
things that I want to call topologically
um equivalent of the sphere. So point
asked the same question in higher
dimensions. Um so this it becomes hard
to visualize because um surface you can
think of as embedded in three dimensions
but a curved free space we don't have
good intuition of 4D space to to to live
and and there are also 3D spaces that
can't even fit into four dimensions. you
need five or six or or higher. But
anyway, uh mathematically you can still
pose this question that if you have a
bounded threedimensional space now which
is also has this simply connected
property that every loop can be
contracted. Can you turn it into a
threedimensional version of a sphere?
And so this is the point conjecture.
Weirdly in higher dimensions four and
five it was actually easier. So uh it
was solved first in higher dimensions.
There's somehow more room to do the
deformation. It's easier to to to move
things around to a sphere. But three was
really hard. So people tried many
approaches. There sort of commentary
approaches where you chop up the the
surface into little triangles or or
tetrahedra and you you just try to argue
based on how the faces interact each
other. Um there were um algebraic
approaches. There's there's various
algebraic objects like things called the
fundamental group that you can attach to
these homology and coology and and and
all these very fancy tools. Um they also
didn't quite work. Um but Richard
Hamilton's proposed a um partial
differential equations approach. So you
take um you take so the problem is that
you so you have this object which is so
secretly is a sphere but it's given to
you in a in a really um in in a weird
way. So like like think of a ball that's
been kind of crumpled up and twisted and
it's not obvious that it's a ball. Um
but um like if you if you have some sort
of surface which is which is a deformed
sphere, you could um u you could for
example think of it as a surface of a
balloon. You could try to inflate it.
You you blow it up. Um and naturally as
you fill it with air um the the wrinkles
will sort of smooth out and it will turn
into um um a nice round sphere. Um
unless of course it was a Taurus or
something in which case it would get
stuck at some point like if you instead
of Taurus it would there'll be a point
in the middle when the inner ring
shrinks to zero you get you get a
singularity and you can't blow up any
further. You can't flow any further. So
he created this flow which is called
Richie flow which is a way of taking an
arbitrary surface or or space and
smoothing it out to make it rounder and
rounder to make it look like a sphere.
And he wanted to show that that either
uh this process would give you a sphere
or it would create a singularity. Um
actually very much like how PDS either
they have global regularity or finite
blow like basically it's almost exactly
the same thing. It's all connected. Um
and so and and he showed that for two
dimensions two dimensional services
surfaces um if you started simply
connected no singularities ever formed
um you never ran into trouble and you
could flow and it would give you a
sphere and it so he he got a new proof
of the two dimensional result but by the
way that's a beautiful explanation of
reach flow and its application in this
context how difficult is the mathematics
here like for the 2D case is it yeah
these are quite sophisticated equations
on par with the Einstein equations
slightly simpler but um Um yeah but but
they were considered hard nonlinear
equations to solve um and there's lots
of special tricks in 2D that that that
helped but in 3D the problem was that uh
this equation was actually super
critical the same problems as Nabia
Stokes as you blow up um maybe the
curvature could get constraint in finer
smaller smaller regions and it um it
looked more and more nonlinear and
things just look worse and worse and
there could be all kinds of
singularities that showed up. um some
singularities um like if there's these
things called neck pinches where where
the uh the surface sort of creates
behaves like like a like a a barbell and
it it pinches at a point. Some some
singularities are simple enough that you
can sort of see what to do next. You
just make a snip and then you can turn
one surface into two and evol them
separately. But there was there was a
the prospect that there's some really
nasty like knotted singularities showed
up that you you couldn't see how to um
resolve in any way that you couldn't do
any surgery to. Um so you need to
classify all the singularities like what
are all the possible ways that things
can go wrong. Um so what Pearlman did
was first of all he he made the problem
he turned the problem a super critical
problem to a critical problem. Um I said
before about how um the invention of the
of of energy the Hamiltonian like really
clarified um Newtonian mechanics. Um uh
so he introduced something which is now
called permanence reduced volume and
permanence entropy. He introduced new
quantities kind of like energy that look
the same at every single scale and
turned the problem into a critical one
where the nonlinearities actually
suddenly looked a lot less scary than
they did before. Um and then he had to
solve he still had to analyze the
singularities of this critical problem.
uh and that itself was a problem similar
to this wake up thing I worked on
actually um so on the on the level of
difficulty of that. So he managed to
classify all the singularities of this
problem and show how to apply surgery to
each of these and through that was able
to to resolve the point Cray conjecture.
um quite like a lot of really ambitious
steps um and like like nothing that a
large language model today for example
could I mean um at best uh I could
imagine model proposing this idea as one
of hundreds of different things to try
um but the other 99 would be complete
dead ends but you'd only find out after
months of work he must have had some
sense that this was the right track to
pursue because you know I it takes years
to get them from A to B so you've done
like you said Actually you see even
strictly mathematically but more broadly
in terms of the process he's done
similarly difficult
things what what can you infer from the
process he was going through because he
was doing it alone what are some low
points in a process like that when you
start to like you've mentioned hardship
like uh AI doesn't know when it's
failing what happens to you you're
sitting in your office when you realize
the thing you did for the last few days
maybe weeks weeks. Yeah. Is a failure.
Well, for me, I switch to different
problem. Uh so, uh as said, I'm I'm a
fox. I'm not a hedgehog. But you
legitimately that is a break that you
can take is is to step away and look at
a different problem. Yeah, you can
modify the problem too. Um I mean um
yeah, you can ask some cheat if if
there's a specific thing that's blocking
you that this um some bad case keeps
showing up that that that for which your
tool doesn't work, you can just assume
by fiat this this bad case doesn't
occur. So you you do some magical
thinking um for the but but but
strategically okay for the point to see
if the rest of the argument goes through
um if there's multiple problems uh with
with with your approach then maybe you
just give up okay but if this is the
only problem that you know but
everything else checks out then it's
still worth fighting um so yeah you have
to do some some sort of forward
reconnaissance sometimes to uh you know
and that is sometimes productive to
assume like okay we'll figure it out oh
yeah yeah eventually um Sometimes
actually it's even productive to make
mistakes. So um one of the I mean um
there was a project which actually u we
won some prizes for actually
four other people. Um we worked on this
PD problem again actually this blow of
regularity type problem. Um and it was
considered very hard. Um Sean Bain who
was another field methodist who worked
on a special case of this but he could
not solve the general case. Um and we
worked on this problem for two months
and we found we thought we solved it. We
we had this this cute argument that if
everything fit and we were excited uh we
were planning celebrationally um to all
get together and have champagne or
something. Um and we started writing it
up. Um and one of one of us, not me
actually, but another co-author said,
"Oh, um in this in this lema here, we um
we have to estimate these 13 terms that
that show up in this expansion." And we
estimate 12 of them, but in our notes, I
can't find the estimation of the 13th.
Can you can someone supply that? And I
said, "Sure, I'll look at this." and
actually yeah we didn't cover we
completely omitted this term and this
term turned out to be worse than the
other 12 terms put together um in fact
we could not estimate this term um and
we tried for a few more months and all
different permutations and there was
always this one thing one term that we
could not control um and so like um this
was very frustrating um but because we
had already invested months and months
of effort into this already um we stuck
at this we we tried increasingly
desperate things and and crazy things um
and after two is we found an approach
which was actually somewhat different by
quite a bit from our initial um strategy
which did actually didn't generate these
problematic terms and and and actually
solve the problem. So we we solve a
problem after 2 years but if we hadn't
had that initial false dawn of nearly
solving a problem we would have given up
by month two or something and and worked
on an easier problem. Um yeah if we had
known it would take two years not sure
we would have started the project. Yeah
sometimes actually having the incorrect
you know it's like Columbus New
incorrect version of measurement of the
size of the earth. He thought he was
going to find a new trade route to India
or at least that was how he sold it in
his perspectus. I mean it could be that
he actually secretly knew but just on
the psychological element.
Do you have like emotional or
like self-doubt that just overwhelms you
moments like that? You know, because
this stuff it feels like math is is so
engrossing
that like it can break you when you like
invest so much yourself in the problem
and then it turns out wrong. You could
start to
similar way chess has broken some
people. Yeah. Um I I think different
mathematicians have different levels of
emotional investment in what they do. I
mean I think for some people it's just a
job. you know you you have a problem and
if it doesn't work out you you you go on
the next one. Um yeah so the fact that
you can always move on to another
problem um it reduces the emotional
connection. I mean
there are cases you know so there are
certain problems that are what I call
back diseases where where where just
latch on to that one problem and they
spend years and years thinking about
nothing but that one problem and um you
know maybe their career suffers and so
forth but okay this big win this will
you know once I once I finish this
problem I will make up for all the years
of of of lost opportunity but that's
that's I mean occasionally occasionally
it works But I I um I really don't
recommend it for people without the the
right fortitude. Yeah. So I I've never
been super invested in any one problem.
Um one thing that helps is that we don't
need to call our problems in advance. Uh
um well uh when we do grant proposals we
s say we we will study this set of
problems. But even then we don't promise
definitely by 5 years I will supply a
proof of all these things. you know, or
um you promise to make some progress or
discover some interesting phenomena. Uh
and maybe you don't solve the problem,
but you find some related problem that
you you can say something new about and
that's that's a much more feasible task.
But I'm sure for you there's problems
like this. You have you have
um made so much progress towards the
hardest problems in the history of
mathematics. So is there is there a
problem that just haunts you? It sits
there in the dark corners, you know,
twin prime conjecture, reman hypothesis,
global conjecture. Twin prime that
sounds again. So, I mean, the problem is
like a reman hypothesis, those are so
far out of reach. Why do you think so?
Yeah. there's no even viable strate like
even if I activate all my all the cheats
that I know of in this problem like it
there's just still no way to get me to
be um like it's um I think it needs a
breakthrough in another area of
mathematics to happen first and for
someone to recognize that it that would
be a useful thing to transport into this
problem. So we we should maybe step back
for a little bit and just talk about
prime numbers. Okay. So they're often
referred to as the atoms of mathematics.
Can you just speak to the structure that
these uh atoms the natural numbers have
two basic operations attached to them?
Addition and multiplication. Um so if
you want to generate the natural
numbers, you can do one of two things.
You can just start with one and add one
to itself over and over again and that
generates you the natural numbers. So
additively they're very easy to generate
1 2 3 4 5. Or you can take the prime if
you want to generate multiplicatively
you can take all the prime numbers 2 3
57 and multiply them all together. um
and together that gives you all the the
natural numbers except maybe for one. So
there these two separate ways of
thinking about the natural numbers from
an additive point of view and point of
view. Um and separately they're not so
bad. Um so like any question about that
only was addition is relatively easy to
solve and any question that only was
multiplication is easy to solve. Um but
what has been frustrating is that you
combine the two together. Um and
suddenly you get this extremely rich I
mean we know that there are statements
in number theory that are actually as
undecidable. There are certain polomials
in some number of variables. You know is
there a solution in the natural numbers
and the answer depends on on an
undecidable statement um like like
whether um the aims of of mathematics
are consistent or not. Um
but um yeah but even this the simplest
problems that combine something
multiplicative such as the primes with
something additive such as shifting by
two uh separately we understand both of
them well but if you ask when you shift
the prime by two do you can you get a
how often can you get another prime we
it's been amazingly hard to relate the
two and we should say that the twin
prime conjecture is just that it posits
that there are infinitely many pairs of
prime numbers that differ by do. Yes.
Now the interesting thing is that you
have been very successful at pushing
forward the field in answering these
complicated questions uh of this variety
like you mentioned the green tile
theorem. It proves that prime numbers
contain arithmetic progressions of any
length, right? Which is mind-blowing
that you can prove something like that,
right? Yeah. So, what we've realized
because of this this this type of of
research is that there's different
patterns have different levels of uh
indestructibility. Um so, so what makes
the twin prime problem hard is that if
you take all the primes in the world,
you know, 3, 5, 7, 11, so forth, there
are some twins in there. 11 and 13 is a
twin prime pair of twin primes and so
forth. But you could easily if you
wanted to um redact the primes to get
rid of to get rid of the um these twins
like the twins they show up and they're
infinitely many of them but they're
actually reasonably sparse. Um not there
there's not I mean initially there's
quite a few but once you got to the
millions the trillions they become rarer
and rarer and you could actually just
you know if if someone was given access
to the database of primes you just edit
out a a few primes here and there they
could make the trim pan conjure false by
just removing like 01% of the primes. or
something um just well well chosen to to
um to do this. And so you could present
a censored database of the primes which
passes all of the statistical tests of
the primes. You know that it it obeys
things like the paralle theorem and and
other texts about the primes but doesn't
contain any true primes anymore. Um and
this is a real obstacle for the twin
prime conjecture. It means that any
proof strategy to actually find twin
primes in the ecto primes must fail when
applied to these slightly edited primes.
And so it must be some very um subtle
delicate feature of the primes that you
can't just get from like like aggregate
statistical analysis. Okay. So that's
all yeah on the other hand progressions
has turned out to be much more robust.
um like you can take the primes and you
can eliminate 99% of the primes actually
you know and you can take take any 99%
you want and uh it turns out and another
thing we prove is that you still get
arithmetic progressions um arithmetic
progressions are much you know they're
like cockroaches of arbitrary length yes
that's crazy I mean so so this for for
people who don't know arithmetic
progressions is a sequence of numbers
that differ by some fixed amount yeah
but it's again like it's infinite monkey
type phenomenon for any fixed length of
your set. You don't get arbitrary as
progressions. You only get quite short
progressions. But you're saying twin
prime is not an infinite monkey
phenomena. I mean, it's a very subtle
monkey. It's still an infinite monkey
phenomen. Yeah. If the primes were
really genuinely random, if the primes
were generated by monkeys, um then yes,
in fact, the infinite monkey theorem
would Oh, but you're saying that twin
prime is it doesn't you can't use the
same tools like the it doesn't appear
random almost. Well, we don't know.
Yeah, we we we we believe the prior
behave like a random set. And so the
reason why we care about the trim
conjecture is is a test case for whether
we can genuinely confidently say with
with 0% chance of error that the primes
behave like a random set. Okay. Random.
Yeah. Random versions of the primes we
know contain twins. Um at least with
with 100% probability or probably
tending to 100% as you go out further
and further. Um yeah. So the primes we
believe that they're random. Um the
reason why arithmetic progressions are
indestructible is that regardless of
whether you looks random or looks um
structured like periodic in both cases
um arithmetic regressions appear but for
different reasons. Um and this is
basically all the ways in which the
there are many proofs of of these sort
of arithmetic region epithems and
they're all proven by some sort of
dichotomy where your set is either
structured or random and in both cases
you can say something and then you put
the two together. Um but in twin primes
if if the primes are random then you're
happy you win. But if your primes are
structured they could be structured in
in a specific way that eliminates the
twin the twins. Uh and we can't rule out
that one conspiracy and yet you were
able to make a as I understand progress
on the Kupal version. Right. Yeah. So um
the one funny thing about conspiracies
is that any one conspiracy theory is
really hard to disprove that you know if
if you believe the water is won by
lizards you say here's some evidence
that that it it's not run by lizards
well that that evidence was planted by
the lizards. Yeah. Right. You may have
encountered this kind of phenomen. Yeah.
So like like um a pure like there's
there's almost no way to um definitively
rule out a con and the same is true in
mathematics that a con is to solely
devote devoted to learning twin primes
you know like it would you have to also
infiltrate other areas of mathematics to
sort of but but like it could be made
consistent at least as far as we know
but there's a weird phenomenon that you
can make one um one conspiracy rule out
other conspiracies so you know if the if
the world is is run by lizardist it
can't also be run by Right.
Right. So one unreasonable thing is is
is is hard to dispute but but more than
one there are there are tools. Um so
yeah so for example we we know there's
infinitely many primes that are um no
two which are um so there infinite pair
of primes which differ by at most um 246
actually is is a is the current. So
there's like a bound yes on the right.
So like there's twin primes this thing
called cousin primes that differ by by
four. Um there's called sexy primes that
differ by six. Uh, what are sexy primes?
Primes that differ by six. The name is
much less the concept is much less
exciting than the name suggests. Got it.
Um, so you can make a conspiracy rule
out one of these, but like once you have
like 50 of them, it turns out that you
can't rule out all of them at once. It
just it requires too much energy somehow
in this conspiracy space. How do you do
the bound part? How do you how do you
develop a bound for the difference
between the prize that okay so um that
there's an infinite number of so it's
ultimately based on what's called the
pigeon hole principle um so the pigeon
hole principle uh it's a statement that
if you have a number of pigeons and they
all have to go into into pigeon holes
and you have more pigeons than pigeon
holes then one of the pigeon holes has
to have at least two pigeons in so there
has to be two pigeons that that are
close together. So for instance if you
have 100 numbers and they all range from
one to a thousand um two of them have to
be at most 10 apart. Mhm. because you
can divide up the numbers one to 100
into 100 pigeon holes. Let's let's say
you have if you have 101 numbers 100 one
numbers then two of them have to be
distance less than 10 apart because two
of them have to belong to the same
pigeon hole. So it's a basic um basic
feature of uh a basic principle in
mathematics. Um so it doesn't quite work
with the primes directly because the
primes get sparer and sparser as you go
out that fewer and fewer numbers are
prime. But it turns out that there's a
way to assign weights to the to to
numbers like um so there are numbers
that are kind of almost prime but
they're not they they don't have no
factors at all other than themselves in
one but they have very few factors. Um
and it turns out that we understand
almost primes a lot better than primes.
Um and so for example it was known for a
long time that there were twin almost
primes. This has been worked out. So
almost primes are something we can't
understand. So you can actually restrict
attention to a a suitable set of almost
primes and uh whereas the primes are
very sparse overall relative to the
almost primes actually are much less
sparse. They make um you can set up a
set of almost primes where the primes
have density like say 1%. Um and that
gives you a shot at proving by applying
some sort of original principle that
that those pairs of primes are just only
100 100 apart. But in order to with the
twin pan conjecture you need to get the
density of primes inside the also size
up to up to a first of 50%. Um once you
get up to 50% you will get twin primes.
But uh unfortunately there are barriers.
Um we know that that no matter what kind
of good set of almost primes you pick
the density primes can never get above
50%. It's called the parody barrier. Um
and I would love to find yes. So one of
my long-term dreams is to find a way to
breach that barrier because it would
open up not only the trip conjecture the
go back conjecture and many other
problems in number theory are currently
blocked because our current techniques
would require improve going beyond this
theoretical um parody barriers. It's
like it's like pulling past the speed of
light. Yeah. So we just say a twin prime
conjecture one of the biggest problems
in the history of mathematics go by
conjecture also um they feel like
nextdoor neighbors. Uh has there been
days when you felt you saw the path? Oh
yeah. Um um yeah uh sometimes you try
something and it it works super well. Um
you you again again the sense of methac
smell uh we talked about earlier uh you
learn from experience when things are
going too well
because there are certain difficulties
that you sort of have to encounter. Um
um I think the way a colleague might put
it is that um you know like if if you
are on the streets in New York and you
put in a blindfold and you put in a car
and and um after some hours um you the
blindfold's off and you're in Beijing.
Um you know I mean that was too easy
somehow like like there was no ocean
being crossed. Even if you don't know
exactly what how what what was done
you're suspecting that that something
wasn't right. But is that still in the
back of your head to do you return to
these to the prime do you return to the
prime numbers every once in a while to
see yeah when I have nothing better to
do which is less and less tired which is
I get busy with so many things these
days but yeah when I have free time and
I'm not and I'm too frustrated to to
work on my sort of real research
projects and I also don't want to do my
administrative stuff I don't want to do
some errands for my family um I can play
with these these things um for fun uh
and usually you get nowhere Yeah, you
have you have to learn to just say okay
fine once again nothing happened I I
will move on. Um yeah very occasionally
one of these problems I actually solved
or sometimes as you say you think you
solved it and then you're euphoric for
maybe 15 minutes and then you think I
should check this because this is too
easy too good to be true and it usually
is. What's your gut say about when these
problems would be uh solved when prime
and go back? Prime I think we'll keep
getting keep getting more partial
results. Um
it does need at least one this parody
barrier is is the biggest remaining
obstacle. Um there are simpler versions
of the conjecture where we are getting
really close. Um so I think we will in
10 years we will have many more much
closer results. May not have the whole
thing. Um yeah so trens is somewhat
close reman hypothesis I have no I mean
it has to happen by accident I think so
the reman hypothesis is a kind of more
general conjecture about the
distribution of prime numbers right yeah
it's it's states are sort of viewed
multiplicatively like for questions only
involving multiplication no addition the
primes really do behave as randomly as
as you could hope so there's a
phenomenon in probability called square
root cancellation that um you know like
if you want to poll say America upon on
on some issue. Um, and you you ask one
or two voters and you may have sampled a
bad sample and then you get you get a
really imprecise um measurement of of
the full average, but if you sample more
and more people, the accuracy gets
better and better and it actually
improves like the square root of the
number of people you you sample. So
yeah, if you sample a thousand people,
you can get like a 2 3% margin of error.
So in the same sense if you measure the
primes in a certain multiplicative sense
there's a certain type of statistic you
can measure and it's called the reman's
data function and it fluctuates up and
down but in some sense um as you keep
averaging more and more if you sample
and more and more the fluctuation should
go down as if they were random and
there's a very precise way to quantify
that and the reman hypothesis is a very
elegant way that captures this but um as
with many others in mathematics we have
very few tools to show that something
really genuinely behaves like really
random And this is actually not just a
little bit random but it's it's asking
that it behaves as random as it actually
random set this this square root
cancellation and we know actually
because of things related to the parity
problem actually that most of us usual
techniques cannot hope to settle this
question. Um the proof has to come out
of left field. Um
yeah but uh what that is yeah no one has
any serious proposal. Um yeah and and
there's there's various ways to sort of
as I said you can modify the primes a
little bit and you can destroy the human
hypothesis. Um so like it has to be very
delicate. You can't apply something that
has huge margins of error. It has to
just barely work. Um and like um there's
like all these pits pitfalls that you
have like dodge very adeptly. The prime
numbers are just fascinating. Yeah.
Yeah. What what to you is um most
mysterious about the prime numbers.
So that's a good question. So like
conjecturally we have a good model of
them. I mean like as I said I mean they
have certain patterns like the primes
are usually odd for instance but apart
from this of obvious patterns they
behave very randomly and just assuming
that they behave so there's something
called the crema random model of the
primes that that after a certain point
primes just behave like a random set. Um
and there's various slight modifications
this model but this has been a very good
model. It matches the numeric. It tells
us what to predict. Like I can tell you
with complete certainty the truth is
true. Uh the random model gives
overwhelming odds it is true. I just
can't prove it. Most of our mathematics
is optimized for solving things with
patterns in them. Um and the primes have
this anti-attern um as do almost
everything really. But we can't prove
that. Yeah. I guess it's not mysterious
that the prize be kind of random because
there no reason for them to be um uh to
have any kind of secret pattern but what
is mysterious is what is the mechanism
that really forces the randomness to
happen. Uh and this is just absent.
Another incredibly surprisingly
difficult problem is the colots's
conjecture. Oh yes. simple to state,
beautiful to visualize in its simplicity
and yet extremely
uh difficult to solve and yet you have
been able to make progress. Uh Paular
said about the coloss conjecture that
mathematics may not be ready for such
problems. Others have stated that it is
an extraordinarily difficult problem
completely out of reach this is in 2010
out of reach of present- day mathematics
and yet you have made some progress. Why
is it so difficult to make? Can you
actually even explain what it is? Oh,
yeah. So, it's it's it's a problem that
you can explain. Um yeah, it um it helps
with some um visual aids, but yeah, so
you take any natural number like say 13.
And you apply the the following
procedure to it. So, if it's even, you
divide it by two and if it's odd, you
multiply by three and add one. So, even
numbers get smaller, odd numbers get
bigger. So, 13 will become 40 because 13
* 3 is 39. Add one, you get 40. So, it's
a simple process for odd numbers and
even numbers. They're both very easy
operations. And then you put it
together. It's still reasonably simple.
Um, but then you ask what happens when
you iterate it. You take the output that
you just got and feed it back in. So, 13
becomes 40. 40 is now even divide by 2
is 20. 20 is still even divide by 10 2
10 5 and then 5 * 3 + 1 is 16. And then
8 4 2 1. So, uh, and then from 1 it goes
1 4 2 1 421. It cycles forever. So this
sequence I just described um yeah 13 40
20 10 so these are also called hailstone
sequences because there's an
oversimplified model of of hailstone
formation yeah which is not actually
quite correct but it's so somehow taught
to high school students as a first
approximation is that um like a a little
nugget of ice gets gets an ice crystal
forms in a cloud and it it goes up and
down because of the wind and sometimes
when it's cold it get acquires a bit
more mass and maybe it melts a little
bit and this process of going up down
creates this s of partially melted ice
which event hell stone and eventually it
falls out the earth. So the conjecture
is that no matter how high you start up
like you take a number which is in the
millions or billions you go this process
that that goes up if you're odd and down
if you're even eventually um goes down
to to earth all the time no matter where
you start with this very simple
algorithm you end up at one and you
might climb for a while right yeah so
yeah if you plot it um these sequences
they look like brownie in motion um they
look like the stock market you know they
just go up and down in a in a seemingly
random pattern and in Usually that's
what happens that that if you plug in a
random number, you can actually prove at
least initially that it would look like
um random walk. Um and that's actually a
random walk with a downward drift. Um
it's like if you're always gambling on
on roulette at at the casino with odds
slightly weighted against you. So
sometimes you you win, sometimes you
lose, but over in the long run you lose
a bit more than you win. Um and so
normally your wallet will hit will go to
zero um if you just keep playing over
and over again. So statistically it
makes sense. Yes. So, so the result that
I I proved roughly speaking such that
that statistically like 99% of all
inputs would would drift down to maybe
not all the way to one, but to be much
much smaller than what you started. So,
it's it's like if I told you that if you
go to a casino, most of the time you end
up if you keep playing for long enough,
you end up with a smaller amount in your
wallet than when you started. That's
kind of like the what the result that I
proved. So why is that result like can
you continue down that thread
to prove the full conjecture? Well, the
problem is that um my I I used arguments
from probability theory um and there's
always this exceptional event. So you
know, so in probability we have this
this law of large numbers um which tells
you things like if you play a casino
with a um a game at a casino with a
losing um expectation over time you are
guaranteed or almost surely with
probably probability as close to 100% as
you wish you're guaranteed to lose
money. But there's always this
exceptional outlier. Like it is
mathematically possible that even in
when the game is is the odds are not in
your favor, you could just keep winning
slightly more often than you lose. Very
much like how in Navia Stokes there
could be, you know, um most of the time
um your waves can disperse. There could
be just one outlier choice of initial
conditions that would lead you to blow
up. And there could be one outlier
choice of um um special number that they
stick in that shoots off infinity while
all other numbers crash to earth uh
crash to one. Um in fact um there's some
mathematicians um who Alex Kovvich for
instance who've proposed that um that
actually um these collat uh iterations
are like the similar automator um
actually if you look at what they happen
on in binary they do actually look a
little bit like like these game of life
type patterns. Um and in an analogy to
how the game of life can create these
these massive like self-plicating
objects and so forth possibly you could
create some sort of heavier than air
flying machine a number which is
actually encoding this machine which is
just whose job it is is to encode is to
create a version of itself which which
is larger heavier than air machine
encoded in a number that flies forever.
Yeah. So Conway in fact worked on worked
on this problem as well. Oh wow. So
Conway so similar in fact that was one
of inspirations for the Nebby Stokes
project that Conway studied
generalizations of the collapse problem
where instead of multiplying by three
and adding one or dividing by two you
have a more complicated branch but but
instead of having two cases maybe you
have 17 cases and then you go up and
down and he showed that once your
iteration gets complicated enough you
can actually encode touring machines and
you can actually make these problems
undecidable and and do things like this.
In fact, he invented a programming
language for uh these kind of fractional
linear transformations. He called a
factrat as a play on forrat. Uh and he
showed that that you could um you can
program it was too incomplete. You could
you could you could uh um you could make
a program that if if your number you
insert in was encoded as a prime, it
would sync to zero. It would go down
otherwise it would go up uh and things
like that. Um so the general class of
problems is is really uh as complicated
as all of mathematics. some of the
mystery of the cellular automa that we
talked about uh having a fra
mathematical framework to say anything
about cellular automa maybe the same
kind of framework is required yeah
injecture yeah if you want to do it not
statistically but you really want 100%
of all inputs to to fall to earth yeah
so what might be feasible is is
statistically 99% you know go to one but
like everything yeah that looks hard
what would you say is out of these
within reach famous problems is the
hardest problem we have today. Is there
a reman hypothesis? We want is up there.
Um POS MP is a good one because like uh
that's that's a meta problem like if you
solve that in the um in the positive
sense that you can find a PMP algorithm
that potentially this solves a lot of
other problems as well and we should
mention some of the conjectures we've
been talking about. You know a lot of
stuff is built on top of them. Now
there's ripple effects. P equ= 1 P has
more ripple effects than basically any
other right if the reman hypothesis is
disproven um that would be a big mental
shock to the number theorist uh but it
would have follow on effects for um
cryptography
um because a lot of cryptography uses
number theory um it uses number theory
constructions involving primes and so
forth and um it relies very much on the
intuition that number theories are built
over many many years of what operations
involving prime behave randomly and what
ones don't. Um, and in particular, our
encryption
um methods are designed to turn text
with information on it into text which
is indistinguishable from um from random
noise. So um and hence we believe to be
almost impossible to crack um at least
mathematically. Um but uh if something
has core to our belief as human
hypothesis is is wrong it means that
there are there are actual patterns of
the primes that we not aware of and if
there's one there's probably going to be
more. Um and suddenly a lot of our
crypto systems are in doubt. Yeah.
But then how do you then say stuff about
the the primes? Yeah. That you're going
towards the collect conjecture again. Um
because if I I you you want it to be
random, right? You want it to be
randomly. Yeah. So more broadly, I'm
just looking for more tools, more ways
to show that that that things are
random. How do you prove a conspiracy
doesn't happen, right? Is there any
chance to you that P equals NP? Is there
some Can you imagine a possible
universe? It is possible. I mean there's
there's various uh scenarios. I mean
there there's one where it is
technically possible but in practice is
never actually implementable. The
evidence is sort of slightly pushing in
favor of no that we probably is not
equal to NP. I mean it seems like it's
one of those cases similar similar to
reman hypothesis that I think the
evidence is le leaning pretty heavily on
the no. Certainly more on the no than on
on the yes. The funny thing about
picompy is that we have also a lot more
obstructions than we do for almost any
other problem. Um so while there's
evidence we also have a lot of results
ruling out many many types of approaches
to the problem. Uh this is the one thing
that the computer scientists have
actually been very good at. It's
actually saying that that certain
approaches cannot work. No go theorems.
It could be undecidable. We don't Yeah,
we don't know. There's a funny story I
read that when you won the Fields Medal,
somebody from the internet wrote you
and asked uh you know what are you going
to do now that you've won this
prestigious award? and and then you just
quickly very humbly said that, you know,
this a shiny metal is not going to solve
any of the problems I'm currently
working on. So, I'm just I'm going to
keep I'm going to keep working on them.
It's just first of all, it's funny to me
that you would answer an email in that
context, and second of all, it um it
just shows your humility. But anyway, uh
maybe you could speak to the Fields
Medal, but it's another way for me to
ask uh
about Gregoria Pearlman. What do you
think about him famously declining the
Fields Medal and the Millennial Prize,
which came with a $1 million of prize
money? He stated that I'm not interested
in money or fame. The prize is
completely irrelevant for me. If the
proof is correct, then no other
recognition is needed. Yeah. No, he's
he's somewhat of an outlier. Um even
among mathematicians who tend to uh to
have uh somewhat idealistic views. I've
never met him. I think I'd be interested
to meet him one day, but I I never had
the chance. I know people who met him,
but he's always had strong views about
certain things. Um, you know, I mean,
it's it's not like he was completely
isolated from the math community. I
mean, he would he would give talks and
write papers and so forth. Um, but at
some point he just decided not to engage
with the rest of the community. He was
he was disillusioned or something. Um, I
don't know. Um, and he decided to to uh
uh to peace out uh and you know, collect
mushrooms in St. Petersburg or
something. And then that's that's fine.
you know and you can you can do that. Um
I mean that's another sort of flip side.
I mean we are not a lot of our problems
that we solve you know they some of them
do have practical application and that's
that's great but uh like if you stop
thinking about a problem you know so
he's he hasn't published since in in
this field but that's fine there's many
many other people who've done so as
well. Um yeah so I guess one thing I
didn't realize initially with the fields
medal is that it it sort of makes you
part of the establishment. Um you know
so you know most mathematicians you
there's uh just career mathematicians
you know you just focus on publishing
the next paper maybe getting one to
promote one one rank you know and and
starting a few projects maybe taking
some students or something. Yeah. But
then suddenly people want your opinion
on things and uh you have to think a
little bit about you know things that
you might just so foolishly say because
you know no one's going to listen to
you. Uh it's more important now. Is it
constraining to you? Are you able to
still have fun and be a rebel and try
crazy stuff and well play with ideas? I
have a lot less free time than I had
previously. Um I mean mostly by choice.
I mean I I I obviously I have the option
to sort of uh decline. So I decline a
lot of things. I I could decline even
more. Um or I could acquire a reputation
for being so unreliable that people
don't even ask anymore. Uh this is I
love the different algorithms here. This
is great. This is it's always an option.
Um but you know um there are things that
are like
I mean so I mean I I I don't spend as
much time as I do as a postto you know
just just working on one problem at a
time or um fooling around. I still do
that a little bit but yeah as you
advance in your career somehow the more
soft skills so math somehow frontloads
all the technical skills to the early
stages of your career. So um yeah, so
it's as a post office publisher or
parish you're you're incent you're
incentivized to basically focus on on
proving very technical themsel
um as well as proof the theorems. Um but
then as as you get more senior you have
to start you know mentoring and and and
and giving interviews uh and uh and
trying to shape um direction of the
field both research wise and and you
know uh sometimes you have to uh u you
know do various administrative things
and it's kind of the right social
contract because you you need to to work
in the trenches to see what can help
mathematicians. the other side of the
establishment sort of the the really
positive thing is that um you get to be
a light that's an inspiration to a lot
of young mathematicians or young people
that are just interested in mathematics.
It's like it's just how the human mind
works. This is where I would probably uh
say that I like the fields metal
that it does inspire a lot of young
people somehow. I don't this just how
human brains work. Yeah. At the same
time, I also want to give sort of
respect to somebody like Gregoria
Pearlman who
is critical of awards in his mind. Those
are his principles and any human that's
able for their principles to like do the
thing that most humans would not be able
to do. It's beautiful to see. Some
recognition is is necessarily important.
Uh but yeah, it's it's also important to
not let these things take over your
life. um and like only be concerned
about uh getting the next big award or
whatever. Um I mean yeah so again you
see these people try to only solve like
a really big math problems and not work
on on on things that are less uh sexy if
you wish but but but actually still
interesting and instructive as you say
like the way the human mind works it's
um we understand things better when
they're attached to humans um and also
uh if they're attached to a small number
of humans like this this way our human
mind is is wired we can comprehend and
the relationships between you know 10 or
20 people you know but once you get
beyond like 100 people like there
there's a there's a limit I think
there's a name for it um beyond which uh
it just becomes the other um and so we
have you have to simplify the pole
master you know 99.9% of humanity
becomes the other um and uh often these
models are are incorrect and this causes
all kinds of problems but um so yeah so
to humanize a subject you know if you
identify a small number of people and
say you know these
representative people of the subject
role models for example um that has some
role um but it can also be um uh yeah
too much of it can be harmful because
it's
I'll be the first to say that my own
career path is not that of a typical
mathematician um I the very accelerated
education I skipped a lot of classes um
I think I was had very fortunate
mentoring opportunities um and I think I
was at the right place at the right time
just because someone does doesn't have
my um trajectory, you it doesn't mean
that they can't be good mathematicians.
I mean they be ma good mathematician in
a very different style. Uh and we need
people of a different style. Um and you
know even if and sometimes too much
focus is given on the on the person who
does the last step to complete um a
project in mathematics or elsewhere
that's that's really taken you know
centuries or decades with lots and lots
of building lots of previous work. Um,
but that's a a story that's difficult to
tell um if you're not an expert because,
you know, it's easier to just say one
person did this one thing. You know, it
makes for a much simpler history. I
think on the whole it um is a hugely
positive thing to to talk about Steve
Jobs as a representative of Apple when I
personally know and of course everybody
knows the incredible design, the
incredible engineering teams, just the
individual humans on those teams.
They're not a team. They're individual
humans on a team. And there's a lot of
brilliance there. But it's just a nice
shorthand like a very like pi. Yeah.
Steve Jobs. Yeah. Yeah. As as a starting
point, you know, as a first
approximation that's how you and then
read some biographies and then look into
much deeper. First approximation. Yeah.
That's right. Uh so you mentioned you
were a Princeton to um Andrew Wilds at
that time. He's a professor there. It's
a funny moment how history is just all
interconnected. And at that time he
announced that he proved the form last
theorem. What did you think maybe
looking back now with more context about
that moment in math history? Yes. So I
was a graduate student at the time. I
mean I I vaguely remember you know there
was press attention and uh um we all had
the same um we had pigeon holes in the
same mail room you know. So we all
picked our mail and like suddenly Andrew
W's mailbox exploded to be overflowing.
That's a good that's a good metric.
Yeah. um you know so yeah we we all
talked about it at at tea and so forth I
mean we we didn't understand most of us
didn't understand the proof um we
understand sort of high level details um
fact there's an ongoing project to
formalize it in lean right Kevin puzzly
yeah can can we take that small tangent
is it is it how difficult does that cuz
as as I understand the for last the
proof for uh for last theorem has like
super complicated objects yeah really
difficult to formalize now yeah I guess
yeah you're right the objects that they
use um you can define them. Uh so
they've been defined in lean. Okay. So
so just defining what they are can be
done. Uh that's really not trivial but
it's been done. But there's a lot of
really basic facts about um these
objects that have taken decades to prove
and that they're in all these different
math papers and so lots of these have to
be formalized as well. Um Kevin's uh
Kevin Buzzard's goal actually he has a
five-year grraft to formalize fossil
theorem and his aim is that he doesn't
think he will be able to get all the way
down to the basic axioms but he wants to
formalize it to the point where the only
things that he needs to rely on as black
boxes are things that were known by 1980
to um to number theorist at the time. Um
and then some other person some other
work would have to done to to to get
from there. Um so it's it's a different
area of mathematics than um the type of
mathematics I'm used to. Um um in
analysis, which is kind of my area, um
the objects we study are kind of much
closer to the ground. We study I study
things like prime numbers and and
functions and things that are within
scope of a high school um math education
to at least uh define. Um yeah, but then
there's this very advanced algebraic
side of number theory where people have
been building structures upon structures
for quite a while. Um and it's it's a
very sturdy structure. It's it's been
it's been very um at the base at least
is extremely well developed in the
textbooks and so forth. But um um it
does get to the point where um if you if
you haven't taken these years of study
and you want to ask about what what is
going on at um like level six of of this
tower, you have to spend quite a bit of
time before they can even get to the
point where you can see you see
something you recognize. What uh
inspires you about his journey that we
similar as we talked about seven years
mostly working in secret? Yeah. Uh that
is a romantic uh Yeah. So it kind of
fits with sort of the the romantic image
I think people have of mathematicians to
the extent they think of them at all as
these kind of eccentric uh you know
wizards or something. Um so that
certainly kind of uh uh accentuated that
perspective you know I mean it's it is a
great achievement his style of solving
problems is so different from my own um
but which but which is great. I mean we
we need people speak to it like what uh
in in terms of like the you like the
collaborative I like moving on from a
problem if it's giving too much
everybody. Um got it. But you need the
people who have the tenacity and the
fearlessness. Um you I've collaborated
with with people like that where where I
want to give up uh cuz the first
approach that we tried didn't work and
the second one didn't approach but
they're convinced and they have the
third fourth and the fifth approach
works. Um and I have to eat my words.
Okay. I didn't think this was going to
work, but yes, you were right all along.
And we should say for people who don't
know, not only are you known for the
brilliance of your work, but the
incredible productivity, just the number
of papers, which are all of very high
quality. So there's something to be said
about being able to jump from topic to
topic. Yeah, it works for me. Yeah, I
mean there also people who are very
productive and they focus very deeply on
Yeah. I think everyone has to find their
own workflow. Um like one thing which is
a shame in mathematics is that we have
mathematics there's sort of a one size
fits all approach to teach teaching
mathematics um and you know so we have a
certain curriculum and so forth I mean
you know maybe like if you do math
competitions or something you get a
slightly different experience but um I
think many people um they don't find
their their native math language uh
until very late or usually too late so
they they stop doing mathematics and
they have a bad experience with a
teacher who's trying to teach them one
way to do mathematics. They don't like
it. Um my theory is that um humans don't
come evolution has not given us a math
center of a brain directly. We have a
vision center and a language center and
some other centers um which have
evolution has honed but we it doesn't we
don't have innate sense of mathematics.
Um but our other centers are
sophisticated enough that different
people we we we can repurpose other
areas of our brain to do mathematics. So
some people have figured out how to use
the visual center to do mathematics and
so they think very visually when they do
mathematics. Some people have repurposed
their their language center and they
think very symbolically. Um, you know,
um, some people like if they are very
competitive and they they like gaming,
there's a type there's this part of your
brain that's very good at at at uh at
solving puzzles and games and and and
that can be repurposed. But like when I
talked about the mathematicians, you
know, they don't quite think they I can
tell that they're using some different
styles of of thinking than I am. I mean,
not not disjoint, but they they may
prefer visual. Like I I don't actually
prefer visual so much. I need lots of
visual aids myself. Um, you know,
mathematics provides a common language.
So, we can still talk to each other even
if we are thinking in in different ways.
But you can tell there's a different
set of subsystems being used in the
thinking process like they take
different paths. They're very quick at
things that I struggle with and vice
versa. Um, and yet they still get to the
same goal. Um, that's beautiful. And
yeah, but I mean the way we educate
unless you have like a personalized
tutor or something. I mean education
sort of just by natural scale has to be
mass-produced you know you have to teach
to 30 kids and you know if they have 30
different styles you can't you can't
teach 30 different ways. On that topic
what advice would you give to students
uh young students who are struggling
with math and but are interested in it
and would like to get better. Is there
something in this Yeah. um in this
complicated educational context, what
what would you Yeah, it's a tricky
problem. One nice thing is that there
are now lots of sources for mathematical
enrichment outside the classroom. Um so
in in in my day there already there are
math competitions. Um and you know there
also like popular math books in the
library. Um yeah but but now you have
you know YouTube uh there there are
forums just devoted to solving you know
math puzzles and um and math shows up in
other places you know like um for
example there there are hobbyists who
play poker for fun uh and um they they
you know they for very specific reasons
are interested in very specific
probability questions um and and they
actually know there's a community of
amateur proists in in in poker um in
chess, in baseball. I mean, there's
there's there's uh yeah um there's math
all over the place. Um and I'm I'm I'm
hoping actually with with these new sort
of tools for lean and so forth that
actually we can incorporate the broader
public into math research projects um
like this is almost is doesn't happen at
all currently. So in the sciences
there's some scope for citizen science
like astronomers uh they amateurs who
discover comets and there's biologists
there people who could identify
butterflies and so forth. Um and in
mathematics
whereum amateur mathematicians can like
discover new primes and so forth but but
previously because we have to verify
every single contribution um like most
mathematical research projects it would
not help to have input from the general
public. In fact, it would it would just
be be timeconuming because just error
checking and everything. Um but you know
one thing about these formalization
projects is that they are bringing
together more bringing in more people.
So I'm sure there are high school
students who've already contributed to
some of these these formalizing projects
who contributed into math liib. Um you
know you don't need to be a PhD holder
to just work on one atomic thing.
There's something about the
formalization here that also at as a
very first step opens it up to the
programming community too. The people
who are already comfortable with
programming. It seems like programming
is somehow maybe just the feeling but it
feels more accessible to folks than
math. Math is seen as this like extreme
especially modern mathematics seen as
this extremely difficult to enter area
and programming is not. So that could be
just an entry point. you can execute
code and you can get results. You know,
you can print a hello world pretty
quickly. Um, you know, like if uh if
programming was taught as almost
entirely theoretical subject where you
just taught the the computer science,
the theory of functions and and and
routines and so forth and and outside of
some some very specialized homework
assignments, you're not actually program
like on the weekend for fun. Yeah. Or
Yeah. They would be as considered as
hard as math. Mhm. Um Yeah. Yeah. So, as
I said, you know, there are communities
of non- mathematicians where they're
deploying math for some very specific
purpose, you know, like like optimizing
their poker game and and for them then
math becomes fun for them. Uh what
advice would you give in general to
young people how to pick a career, how
to find themselves like that's a tough
tough tough question. Yeah. So um
there's a lot less certainty now in the
world you know I mean I there was this
period after the war where uh at least
in the west you know if you came from a
good demographic you uh you know like
you there was a very stable path to to a
good career you go to college you get an
education you pick one profession and
you stick to it becoming much more a
thing of the past so I think you just
have to be adaptable and flexible I
think people have to get skills that are
transferable you know like like learning
one specific programming language or one
specific subject of mathematics or
something. It's it's it's that itself is
not a super transferable skill but sort
of knowing how to um reason with with
abstract concepts or how to problem
solve when things go wrong. So these are
things which I think we will still need
even as our tools get get better and you
know you you would be working with AI
sport and so forth. But actually you're
an interesting case study. I mean you're
like a
one of the great living mathematicians
right and then you had a way of doing
things and then all of a sudden you
start learning I mean first of all you
kept learning new fields but you learn
lean that's not that's a non-trivial
thing to learn like that's a that's a
for a lot of people that's an extremely
uncomfortable leap to take right yeah
mathematicians um first of all I've
always been interested in new ways to do
mathematics I I I feel like a lot of the
ways we do things right now are
inefficient. Um I I I I spend me my
colleagues, we spend a lot of time doing
very routine computations or doing
things that other mathematicians would
instantly know how to do and we don't
know how to do them. Uh and why can't we
search and get a quick response and so
that's why I've always been interested
in exploring new workflows.
About four or five years ago, I was on a
committee where we had to ask for ideas
for interesting workshops to run at a
math institute. And at the time, Peter
Schulzer had just formalized one of his
his um new theorems. And um there are
some other developments in computer
assisted proof that look quite
interesting. And I said, "Oh, we should
we should uh um we should run a workshop
on this. This be a good idea." Um and
then I was a bit too enthusiastic about
this idea. So I I got volunte.
Um, so I did with a bunch of other
people, Kevin Bisard and Jordan
Ellenburg and and a bunch of other
people. Um, and it was it was a a nice
success. We brought together a bunch of
mathematicians and computer scientists
and other people and and we got up to
speed and state um and it was really
interesting um developments that that
most mathematicians didn't know was
going on. Um that lots of nice proofs of
concept, you know, just sort of hints of
of what was going to happen. this was
just before chat GBD but there was even
then there was one talk about language
models and the potential um capability
of those in the future. So that got me
excited about the subject. So I started
giving talks um about this is something
we should more of us should start
looking at um now that I' arranged to
run this conference and then chat GPT
came out and like suddenly AI was
everywhere and so uh I got interviewed a
lot um about about this topic um and in
particular um the interaction between AI
and formal proof assistance and I said
yeah they should be combined this this
is this is um this perfect synergy to
happen here and at some point I realized
that I have to actually do not just talk
the talk but walk the book you know like
you know I don't work in machine
learning I and I don't work in proof
formalization and there's a limit to how
much I can just rely on authority and
saying you know I I'm a I'm a warn
mathematician just trust me you know
when I say that this is going to change
athletics and I'm not doing it any when
I don't do any of it myself so I felt
like I had to actually uh uh justify it
yeah a lot of what I get into actually I
don't quite see in advance as how much
time I'm going to spend on it and it's
only after I'm sort of waste deep in in
in in in a project that I I I realized
by that point I'm committed. Well,
that's deeply admirable that you're
willing to go into the fray be in some
small way a beginner, right? Or have
some of the sort of challenges that a
beginner would, right?
new concepts, new ways of thinking also,
you know, sucking at a thing that others
I think I think in that talk you could
be a fields med metal winning
mathematician and undergrad knows
something better than you. Yeah. Um I
think mathematics inherently I mean
mathematics is so huge these days that
nobody knows all of modern mathematics.
Um and inevitably we make mistakes and
um you know uh you can't cover up your
mistakes with just sort of bravado and
and uh I mean because people will ask
for your proofs and if you don't have
the proofs you don't have the proofs. Um
I don't love math. Yeah. So it does keep
us honest. I mean not not I mean you can
still it's not a perfect uh panacea but
I think uh we do have more of a culture
of admitting error than because we're
forced to all the time. Big ridiculous
question. I'm sorry for it once again.
Who is the greatest mathematician of all
time? Maybe one who's no longer with us.
Uh who are the candidates? Zyler, Gaus,
Newton, Raman, Hilbert. So, first of
all, as as mentioned before, like
there's there's some time dependent
on the day. Yeah. Like like if if you if
you if you plot cumulatively over time,
for example, Uklid like like sort of
like is is one of the leading
contenders. Um and then maybe some
unnamed anonymous mathematicians before
that um you know whoever came up with
the concept of of numbers you know you
know um do mathematicians today still
feel the impact of Hilbert just oh yeah
directly of everything that's happened
in the 20th century yeah Hilbert spaces
we have lots of things that are named
after him of course just the arrangement
of mathematics and just the introduction
of certain concepts I mean 23 problems
have been extremely influential
there's some strange power to the
declaring ing which problems are hard to
solve. The statement of the open
problems. Yeah. I mean this is bystander
effect in everywhere. Like if no one
says you should do X, everyone just
moves around waiting for somebody else
to to uh to do something and and like
nothing gets done. Um so and and like it
like it's one one thing that actually uh
you have to teach undergraduates in
mathematics is that you should always
try something. So um you see a lot of
paralysis um in an undergraduate trying
a math problem if they recognize that
there's a certain technique that that
can be applied they will try it but
there are problems for which they see
none of their standard techniques
obviously applies and the common
reaction is then just paralysis I don't
know what to do or um or I think there's
a quote from the Simpsons I've tried
nothing and I'm all out of ideas um so
you know like the next step then is to
try anything like no matter how stupid
um and in fact almost as stupid of the
better um which you know and one a
technique which is almost guaranteed to
fail but the way it fails is going to be
instructive um like it fails because you
you you're not at all taking into
account this hypothesis oh this
hypothesis must be useful that's a clue
I I think you also suggested somewhere
this this fascinating approach which
really stuck with me I started using it
and really works I think you said it's
called structured procrastination no yes
it's when you really don't want to do a
thing. Do you imagine a thing you don't
want to do more? Yes. That's worse than
that. And then in that way, you
procrastinate by not doing the thing
that's worse. Yeah. Yeah. It's a nice
It's a nice hack. It actually works.
Yeah. Yeah. This um I mean with anything
like you know I mean like you um
psychology is really important like you
you talk to athletes like marathon
runners and so forth and and they talk
about what's the most important thing is
it their training regimen or the diet
and so forth. Actually so much of it is
actually psychology. Um you know just
tricking yourself to to think that the
problem is feasible um so that you can
you're motivated to do it. Is there
something our human mind will never be
able to comprehend?
Well I sort of as a mathematician I mean
you
there must be some suffer that you can't
understand. That was the first thing
that came to mind. So that but even
broadly is there are we li is there
something about our mind that's we're
going to be limited even with the help
of mathematics well okay I mean like how
much augmentation are you willing like
like for example if if I didn't even
have pen and paper um like if I had no
technology whatsoever okay so I'm not
allowed blackboard pen and paper right
you're already much more limited than
you would be incredibly limited even
language the English language is a
technology
It's a It's one that's been very
internalized. So, you're right. There
really the the the formulation of the
problem is incorrect because there
really is no longer a just a solo human.
We're already augmented in extremely
complicated intricate ways, right? Yeah.
Yeah. We're already like a collective
intelligence. Yes. Yeah. Yes. So,
humanity plural has much more
intelligence in principle on it good
days than than the individual humans put
together. It can also have less. Okay.
But uh um yeah, so yeah, mathemat
mathematical community plural is is is
incredibly super intelligent uh entity
um that uh no single human mathematician
can can come closer to to replicating.
You see it a little bit on these like
question analysis sites. Um so this math
overflow which is the math version of
stack overflow and like sometimes you
get like this very quick responses to
very difficult questions from the
community. Um, and it's it's it's a
pleasure to watch actually as a as an
expert. I'm a fan spectator of that uh
of that site, just seeing the brilliance
of the different people, the um the
depth of knowledge that people have and
the the willingness to engage in the in
the rigor and the nuance of the
particular question. It's pretty cool to
watch. It's fun. It's almost like just
fun to watch. Uh what gives you hope
about this whole thing we have going on,
human civilization? I think uh yeah. Um
the younger generation is always like
like really creative and enthusiastic
and and inventive. Um it's a pleasure
working with with with uh with uh with
young students. Um
you know the uh the progress of science
tells us that the problems that used to
be really difficult can become extremely
you know can become like trivial to
solve. you know, I mean, like it was
like navigation, you know, just just
knowing where you were on the planet was
this horrendous problem. People died um
you know, or or lost fortunes because
they couldn't navigate, you know, and we
have devices in our pockets that do this
automatically for us, I guess, a
completely solved problem, you know. So
things that are seem unfeasible for us
now could be maybe just sort of homework
exercises for
Yeah. But one of the things I find
really sad about the finitness of life
is that I won't get to see all the cool
things we create as a civilization. You
know that cuz in the next 100 years, 200
years, just imagine showing showing up
in 200 years. Yeah. Well, already plenty
has happened, you know, like if if you
could go back in time and and talk to
your teenage self or something, you know
what I mean? Yeah. and just the internet
and and our AI. I mean again they
they've been in they're beginning to be
internalized and say yeah of course an
AI can understand our voice and and give
reasonable you know slightly incorrect
answers to to any question but yeah this
was mind-blowing even 2 years ago and in
the moment it's hilarious to watch on
the internet and so on the the drama uh
people take everything for granted very
quickly and then they we humans seem to
entertain ourselves with drama out of
anything that's created somebody needs
to take one opinion another person needs
to take an opposite opinion, argue with
each other about it. But when you look
at the arc of things, I mean just even
in progress of robotics. Yeah. Just to
take a step back and be like, "Wow, this
is beautiful that we humans are able to
create this." Yeah. When the
infrastructure and the culture is is
healthy, you know, the community of
humans can be so much more intelligent
and mature and and and rational than the
individuals within it. Well, one place I
can always count on rationality is the
comment section of your blog, which I'm
a big fan of. There's a lot of really
smart people there. And thank you, of
course, for uh for putting those ideas
out on the blog, and it's I can't tell
you how
uh honored I am that you would spend
your time with me today. I was looking
forward this for a long time, Terry. I'm
a huge fan. Um you inspire me. You
inspire millions of people. Thank you so
much for talking. Oh, thank you. It was
a pleasure.
Thanks for listening to this
conversation with Terrence Tao. To
support this podcast, please check out
our sponsors in the description or at
lexfreedman.com/sponsors.
And now, let me leave you with some
words from Galileo Galile.
Mathematics is a language with which God
has written the universe.
Thank you for listening and hope to see
you next time.
Loading video analysis...