Terence Tao: Hardest Problems in Mathematics, Physics & the Future of AI | Lex Fridman Podcast #472
HUkBz-cdB-k • 2025-06-14
Transcript preview
Open
Kind: captions
Language: en
The following is a conversation with
Terrence Tao. Widely considered to be
one of the greatest mathematicians in
history. Often referred to as the Mozart
of math, he won the Fields Medal and the
Breakthrough Prize in mathematics and
has contributed groundbreaking work to a
truly astonishing range of fields in
mathematics and physics.
This was a huge honor for me for many
reasons, including the humility and
kindness that Terry showed to me
throughout all our interactions. It
means the world. This is the Lex
Freedman podcast. To support it, please
check out our sponsors in the
description or at
lexfreedman.com/sponsors.
And now, dear friends, here's Terren
Tao.
What was the first really difficult
research level math problem that you
encountered? One that gave you pause
maybe. Well, I mean in your
undergraduate um education, you learn
about the really hard impossible
problems like the reman hypothesis, the
twin primes conjecture. You can make
problems arbitrarily difficult. That's
not really a problem. In fact, there's
even problems that we know to be
unsolvable. What's really interesting
are the problems just at the on the
boundary between what we can do
relatively easily and what are hopeless.
Um but what are problems where like
existing techniques can do like 90% of
the job and then you just need that
remaining 10%. Um I think as a PhD
student the CA problem certainly caught
my eye and it just got solved actually.
It's a problem I've worked on a lot in
my early research. Historically it came
from a little puzzle by the Japanese
mathematician Soji Kaya uh in like 1918
or so. Um, so the puzzle is that you you
you have um a needle um in on the plane.
Um think like like a like driving like
on on on a road something and you you
want it to execute a U-turn. You want to
turn the needle around. Um but you want
to do it in as little space as possible.
So you want to use as little area in
order to turn it around. So um but the
needle is infinitely maneuverable.
So you can imagine just spinning it
around its um as a unit needle. You can
spin it around its center. Um, and I
think, um, that gives you a disc of of
area, I think pi over four. Um, or you
can do a three-point U-turn, which is
what they we teach people in in the
driving schools to do. Uh, and that
actually takes area pi over 8. So, it's
it's a little bit more efficient than um
a rotation. And so, for a while, people
thought that was the most efficient uh
way to turn things around. But,
Mazikovich uh showed that in fact, you
could actually uh turn the needle around
using as little area as you wanted. So
0001 there was some really fancy multi-
um u back and forth U-turn thing that
you could you could do that that you
could turn a needle around and in so
doing it would pass through every
intermediate direction. Is this in the
two dimensional plane? This is in the
two dimensional plane. Yeah. So we
understand everything in two dimensions.
So the next question is what happens in
three dimensions. So suppose like the
Hubble space telescope is tube in space
and you want to observe every single
star in the universe. So you want to
rotate the telescope to reach every
single direction. And here's unrealistic
part. Suppose that space is at a
premium, which it totally is not. Uh you
want to occupy as little volume as
possible in order to rotate your your
needle around in order to see every
single star in the sky. Um how small a
volume do you need to do that? And so
you can modify basic construction. And
so if your telescope has zero thickness,
then you can use as little volume as you
need. That's a simple modification of
the two dimensional construction. But
the question is that if your telescope
is not zero thickness but but just very
very thin some thickness delta what is
the minimum volume needed to be able to
see every single direction as a function
of delta. So as delta gets smaller as
you need gets thinner the volume should
go down but but how fast does it go
down? Um and the conjecture was that it
goes down very very slowly um like
logarithmically um uh roughly speaking
and that was proved after a lot of work.
So this seems like a puzzle. Why is it
interesting? So it turns out to be
surprisingly connected to a lot of
problems in partial differential
equations, in number theory, in
geometry, comics. For example, in in
wave propagation, you splash some some
water around um you create water waves
and they they travel in various
directions. Um but waves exhibit both
both particle and wave type behavior. So
you can have what's called a wave
packet, which is like a a very localized
wave that is localized in space and
moving a certain direction in time. And
so if you plot it in both space and
time, it occupies a region which looks
like a tube. And so what can happen is
that you can have a wave which initially
is very dispersed but it all comes it
all focuses at a single point later in
time. Like you can imagine dropping a
pebble into a pond and ripples spread
out. But then if you time reverse that
that um that scenario and the equations
of wave motion are time reversible. You
can imagine ripples that are converging
um to a single point and then a big
splash occurs um maybe even a
singularity.
Um and so it's possible to do that. Uh
and geometrically what's going on is
that there's always s of light rays. Um
so like if if if this wave represents
light for example um you can imagine
this wave as a superp position of
photons um all traveling at the speed of
light. They all travel on these light
rays and they're all focusing at this
one point. So you can have a very
dispersed wave focus into a very
concentrated wave at one point in space
and time, but then it defocuses again
and it separates. But potentially if the
conjecture had a negative solution. So
what that meant is that there's there's
a very efficient way to pack um tubes
pointing different directions into a
very very narrow region of of of very
narrow volume. Then you would also be
able to create waves that start out some
there'll be some arrangement of waves
that start out very very dispersed but
they would concentrate not just at a
single point but um um there'll be a
large um there'll be a lot of
concentrations in space and time and uh
um and you could create what's called a
blowup where these waves their amplitude
becomes so great that the laws of
physics that they're governed by are no
longer wave equations but something more
complicated and nonlinear. Um and so in
mathematical physics we care a lot about
whether certain equations in in wave
equations are stable or not whether they
can create um these singularities.
There's a famous unsolved problem called
the Navia Stokes regularity problem. So
the Navia Stokes equations equations
that govern the fluid flow for
incompressible fluids like water. The
question asks if you start with a smooth
velocity field of water can it ever
concentrate so much that like the
velocity becomes infinite at some point
that's called a singularity. We don't
see that um in real life. You know, if
you splash around water on the bathtub,
it won't explode on you. Um or or have
have water leaving at the speed of
light, I think. But potentially, it is
possible. Um and in fact, in recent
years, the the consensus has has drifted
towards the uh the belief that uh that
in fact for certain very special initial
configurations of of say water that
singularities can form. But people have
not yet been able to uh to actually
establish this. The clay foundation has
these seven millennium prize problems
has a million dollar prize for solving
one of these problems that this is one
of them. Of these seven only one of them
has been solved the point conjecture by
Pelman. So the Ka conjecture is not
directly directly related to the Navis
Stokes problem but understanding it
would help us understand some aspects of
things like wave concentration which
would indirectly probably help us
understand the Navis problem better. Can
you speak to the neighbors? So the
existence and smoothness like you said
millennial prize problem right you've
made a lot of progress on this one in
2016 you published a paper finite time
blow up for an averaged threedimensional
navia stoke equation right
so we're trying to figure out if this
thing usually doesn't blow up right but
can we say for sure it never blows up
right yeah so yeah that is literally the
the million- dollar question yeah so
this is what distinguishes
mathematicians from pretty much
everybody else like it
If something holds 99.99% of the time,
um that's good enough for most, you
know, uh for for most things, but
mathematicians are one of the few people
who really care about whether every like
100% really 100% of all um situations
are covered by by um yeah, so most fluid
most of the time um water that does not
blow up. But could you design a very
special initial state that does this?
And maybe we should say that this is a
this is a set of equations that govern
in the field of fluid dynamics. Trying
to understand how fluid behaves and it's
actually turns out to be a really comp
you know fluid is yeah extremely
complicated thing to try to model. Yeah.
So it has practical importance. So this
clay price problem concerns what's
called the incompressible navio stokes
which governs things like water. There's
something called the compressible navio
stokes which governs things like air.
And that's particularly important for
weather prediction. Weather prediction
it does a lot of computational fluid
dynamics. A lot of it is actually just
trying to solve the ny stokes equations
as best they can. Um also gathering a
lot of data so that they can get they
can in initialize the equation. There's
a lot of moving parts. So it's very
important practically. Why is it
difficult to prove general things
about the set of equations like it not
not blowing up? Short answer is
Maxwell's demon. Um so exos demon is a
concept in thermodynamics like if you
have a box of two gases and oxygen and
hydrogen uh and maybe you start with all
the oxygen one side and nitrogen the
other side but there's no barrier
between them right then they will mix um
and they should stay mixed right there
there's no reason why they should unmix
but in principle because of all the
collisions between them there could be
some sort of weird conspiracy that that
um like maybe there's a microscopic
demon called Maxwell's demon that will
um every time a oxygen and nitrogen atom
collide they will bounce off in such a
way that the oxygen sort of drifts onto
one side and then goes to the other and
uh you could have an extremely
improbable configuration emerge. Uh
which we never see. Um and and we
statistically it's extremely unlikely
but mathematically it's possible that
this can happen and we can't rule it
out. Um and this is a situation that
shows up a lot in mathematics. Um a
basic example is the digits of pi
3.14159 and so forth. The digits look
like they have no pattern and we believe
they have no pattern. On the long term,
you should see as many ones and twos and
threes as fours and fives and sixes.
There should be no preference in the
digits of pi to favor let's say 7 over
8. Um, but maybe there's some demon in
the digits of pi that that like every
time you compute more digits, it sort of
biases one digit to another. Um, and
this is a conspiracy that should not
happen. There's no reason it should
happen, but um there's there's there's
no way to prove it.
uh with our current technology. Okay. So
getting back to Nabia Stokes, a fluid
has a certain amount of energy and
because a fluid is in motion, the energy
gets transported around and water is
also viscous. So if the energy is spread
out over many different locations, the
natural viscosity of the fluid will just
damp out the energy and will it will go
to zero. Um and this is what happens um
in um uh when we actually experiment
with water like you splash around there.
there's some turbulence and waves and so
forth. But eventually it it settles down
and and and the the lower the amplitude,
the smaller the velocity, the the more
calm it gets. Um but potentially there
is some sort of a demon that keeps
pushing the uh the energy of the fluid
into a smaller and smaller scale and it
will move faster and faster and at
faster speeds the effective viscosity is
relatively less. And so it could happen
that that it it creates a some sort of
um um what's called a self similar
blowup scenario where you know um the
energy of fluid starts off at some um
large scale and then it all sort of um
transfers it energy into a smaller um
region of of of the fluid which then at
a much faster rate um moves into um an
even smaller region and so forth. Um and
and each time it does this uh it takes
maybe half as as long as as the previous
one and then you you could you could
actually uh converge to all the energy
concentrating in one point in a finite
amount of time. Um and that that's uh
that scenario is called finite blow up.
Um so in practice this doesn't happen.
Um so water is what's called turbulent.
Um so it is true that um if you have a
big eddy of water it will tend to break
up into smaller eddies but it won't
transfer all the the energy from one big
eddy into one smaller eddy. It will
transfer into maybe three or four and
then those must split up into maybe
three or four small edies of their own
and so the energy gets dispersed to the
point where the viscosity can can then
keep that thing under control. Um but if
it can somehow um concentrate um all the
energy keep it all together um and do it
fast enough that the viscous effects
don't have enough time to calm
everything down then this blob can
occur. So there were papers who had
claimed that oh you just need to take
into account conservation energy and
just carefully use the viscosity and you
can keep everything under control for
not just Navia Stokes but for many many
types of equations like this and so in
the past there have been many attempts
to try to obtain what's called global
regularity for Navio Stokes which is the
opposite of final time blow up that
velocity say smooth and it all failed
there was always some sign error or some
subtle mistake and and it couldn't be
salvaged. Um so what I was interested in
doing was trying to explain why we were
not able to disprove um planet time blow
up. I couldn't do it for the actual
equations of fluids which were too
complicated. But if I could average the
equations of motion of naval basically
if if um if I could turn off certain
types of of ways in which water
interacts and only keep the ones that I
want. Um, so in particular, um, if, um,
if there's a fluid and it could transfer
energy from a large Eddie into this
small Eddie or this other small Eddie, I
would turn off the energy channel that
would transfer energy to this this one
and and direct it only into um, this
smaller Eddie while still preserving the
law of conservation of energy. So you're
trying to make it blow up. Yeah. Yeah.
So I I I basically engineer um, a blow
up by changing the laws of physics,
which is one thing that mathematicians
are allowed to do. We can change the
equation. How does that help you get
closer to the proof of something? Right?
So, it provides what's called an
obstruction in mathematics. Um, so, so
what I did was that uh basically if I
turned off the um certain parts of the
equation, so which usually when you turn
off certain interactions make it less
nonlinear, it makes it more regular and
less likely to blow up. But I found that
by turning off a very well-designed set
of of of of interactions, I could force
all the energy to blow in finite time.
So what that means is that if you wanted
to prove um global regularity for Navia
Stokes um for the actual equation you
had you must use some feature of the
true equation which which my artificial
equation um does not satisfy. So it it
rules out certain um certain approaches.
So um the thing about math is is it's
not just about finding you know taking a
technique that is going to work and
applying it but you you need to not take
the techniques that don't work. Um and
for the problems that are really hard,
often there are dozens of ways that you
might think might apply to solve the
problem. But uh it's only after a lot of
experience that you realize there's no
way that these methods are going to
work. So having these counter examples
for nearby problems um kind of rules out
um uh it saves you a lot of time because
you you're not wasting um energy on on
things that you now know cannot possibly
ever work. How deeply connected is it to
that specific problem of fluid dynamics
or just some more general intuition you
build up about mathematics? Right. Yeah.
So the key phenomenon that uh my my
technique exploits is what's called
superc criticality. So in partial
differential equations often these
equations are like a tugof-war between
different forces. So in Navia Stokes
there's the dissipation um force coming
from viscosity and it's very well
understood. It's linear. It calms things
down. If if viscosity was all there was,
then then nothing bad would ever happen.
Um but there's also transport um that
that energy from in one location of
space can get transported because the
fluid is in motion to to other
locations. Um and that's a nonlinear
effect and that causes all the all the
problems. Um so there are these two
competing terms in the Davis equation
the dissipation term and the transport
term. If the dissipation term dominates,
if it's if it's large, then basically
you get regularity. And if um if the
transport term dominates, then uh then
we don't know what's going on. It's a
very nonlinear situation. It's
unpredictable. It's turbulent. So
sometimes these forces are in balance at
small scales, but not in balance at
large scales or or vice versa. Um so
Navis Stokes is what's called
supercritical. So at at smaller and
smaller scales, the transport terms are
much stronger than the viscosity terms.
So the viscosity are the things that
calm things down. Um and so this is um
um this is why the problem is hard in
two dimensions. So the Soviet
mathematician ladish skaya she in the
60s shows in two dimensions there is no
blow up and in two dimensions the nav
equations is what's called critical the
effect of transport and the effect of
viscosity about the same strength even
at very very small scales and we have a
lot of technology to handle critical and
also subcritical equations and proof um
regularity but for superc critical
equations it was not clear what was
going on
and I did a lot of work and then there's
been a lot of follow-up showing that for
many other types of superc critical
equations you create all kinds of blow
up examples. Once the nonlinear effects
dominate the linear effects at small
scales, you can have all kinds of bad
things happen. So this is sort of one of
the main insights of this this line of
work is that superc criticality versus
criticality and subcriticality. This
this makes a big difference. I mean
that's a key qualitative feature that
distinguishes some equations for being
sort of nice and predictable and you
know like like planetary motion and I
mean there are certain equations that
that you can predict for millions of
years and or thousands at least. Again,
it's not really a problem, but but
there's a reason why we can't predict
the weather past 2 weeks into the future
because it's a super critical equation.
Lots of really strange things are going
on at very fine scales. So, whenever
there is some huge source of
nonlinearity,
yeah, that can create a huge problem for
predicting what's going to happen. Yeah.
And if the nonlinearity is somehow more
and more featured and interesting at at
small scales. Um I mean there's there's
many equations that are nonlinear but um
in in many equations you can approximate
things by the bulk. Um so for example
planetary motion you know if you want to
understand the orbit of the moon or Mars
or something you don't really need the
micro structure of like the seismology
of the moon or or like exactly how the
mass is distributed. um you just
basically you can almost approximate
these planets by point masses and just
the aggregate behavior is important um
but if you want to model a fluid um like
like the weather you can't just say in
Los Angeles the temperature is this the
wind speed is this for super critical
equations the finance confirmation is is
really important if we can just linger
on the narto's uh equations a little bit
so you've suggested maybe you can
describe it that one of the ways to uh
solve it or to negatively resolve it
would be to
sort of to construct a liquid a kind of
liquid computer, right? And then show
that the halting problem from
computation theory has consequences for
fluid dynamics. So uh show it in that
way. Can you describe this this Yeah. So
this came out of of this work of
constructing this this this average
equation that that blew up. Um so one um
as as part of how I had to do this. So
there this naive way to do it. You you
just keep pushing um um every time you
you get energy at one scale you you push
it immediately to the next scale as as
fast as possible. This is sort of the
naive way to to to to force blow up. Um
it turns out in five and high dimensions
this works. Um but in three dimensions
there was this funny phenomenon that I
discovered that if you if you keep if if
you change the laws of physics you just
always keep trying to push um the energy
into smaller smaller scales. Um what
happens is that the energy starts
getting spread out into multi many
scales at once. Um so that you you have
energy at one scale you're pushing it
into the next scale and then um as soon
as it enters that scale you also push it
to the next scale but there's still some
energy left over from the previous
scale. um you're trying to do everything
at once. Um and this spreads out the
energy too much. Um and then it turns
out that that um it makes it vulnerable
for viscosity to come in and actually
just damp out everything. So um so it
turns out this this direct bush doesn't
doesn't actually work. There was a
separate paper by some other authors
that actually showed this um in three
dimensions. Um so what I needed was to
program a delay. Um so kind of like air
locks. So um I needed an equation which
would start with a fluid doing something
at one scale. It would push this energy
into the next scale but it would stay
there until all the energy from the from
the larger scale got transferred and
only after you pushed all the energy in
then you sort of open the next gate and
and then you you push that in as well.
So um by doing that it kind of the
energy inches forward scale by scale in
such a way that it's always um localized
at one scale at a time. Um and then it
can resist the effects of viscosity
because it's not dispersed. Um so in
order to make that happen um yeah I had
to construct a rather complicated
nonlinearity. Um and it was basically
like um you know like was constructed
like electronic circuit. So I I actually
thank my wife for this because she was
trained as a electrical engineer. Um and
um you know he talked about um uh you
know he had to design circuits and so
forth. And you know if if you want a
circuit that does a certain thing like
maybe have a light that that flashes on
and then turns off and then on and then
off. You can build it from from more
primitive components you know capacitors
and resistors and so forth and you have
to build a diagram and you um and these
diagrams you can you can sort of follow
your eyeballs and say oh yeah the the
current will build up here and then it
will stop and then it will do that. So I
knew how to build the analog of basic
electronic components, you know, like
resistors and capacitors and so forth.
And and I would I would stack them
together um in in such a way that that I
would create something that would open
one gate and then there'll be a clock
that would and then once the clock hits
a certain threshold it would close it
kind of a rude Goldberg type machine but
described mathematically and this ended
up working. So what I realized is that
if you could pull the same thing off for
the actual equations. So if the
equations of water support a computation
so um like if you can imagine kind of a
steampunk but really water punk uh type
of thing where um you know so modern
computers are electronic you know they
they they're powered by by electrons
passing through very tiny wires and
interacting with other electrons and so
forth. But instead of electrons, you can
imagine these pulses of of water moving
at certain velocity and maybe it's
they're two different configurations
corresponding to a bit being up or down.
Probably if you had two of these moving
bodies of water collide, it would come
out with some new configuration which is
which would be something like an ANDgate
or orgate. you know that if the the the
output would depend in a very
predictable way on on the inputs and
like you could chain these together and
maybe create a touring machine and and
then you could you have computers which
are made completely out of water um and
if you have computers then maybe you can
do robotics so I you know hydraulics and
so forth um and so you could create some
machine which is basically a fluid
analog what's called a vonomian machine
so vonomian proposed if you want to
colonize Mars. The sheer cost of
transporting people machines to Mars is
just ridiculous. But if you could
transport one machine to Mars and this
machine had the ability to mine the
planet, create some more materials to
smelt them and build more copies of the
same machine. Um, then you could
colonize a whole planet um over time.
Um, so uh if you could build a fluid
machine, which uh yeah, so it's it's
it's a it's a robot. Okay. And what it
would do it its purpose in life, it's
programmed so that it would create a
smaller version of itself in some sort
of cold state. It wouldn't start just
yet. Once it's ready, the big robot
configuration water would transfer all
his energy into the smaller
configuration and then power down. Okay?
And then like I clean itself up. And
then what's left is this newest state
which would then turn on and do the same
thing but smaller and faster. And then
the equation has a certain scaling
symmetry. Once you do that, it can just
keep iterating. So this in principle
would create a blow up uh for the actual
Navia Stokes and this is what I managed
to accomplish for this average Navia
Stokes. So it provided the sort of road
map to solve the problem. Now this is uh
a pipe dream because uh there are so
many things that are missing for this to
actually be a reality. Um so um I I I
can't create these basic logic gates. Um
I I don't I don't have these in these
special configurations of water. Um, I
mean there's candidates there things
called vortex rings that might possibly
work but um um but also you know analog
computing is really nasty um compared to
digital computing. I mean because
there's always errors um you you have to
you have to do a lot of error correction
along the way. I don't know how to
completely power down the big machine so
that it doesn't interfere with the the
running of the smaller machine but
everything in principle can happen like
it doesn't contradict any of the laws of
physics. Um so it's sort of evidence
that this thing is possible. Um there
are other groups who are now pursuing
ways to make navis blow up which are
nowhere near as ridiculously complicated
as this. Um um they they actually are
pursuing much closer to the the direct
self similar model which can it doesn't
quite work as is but there could be some
simpler scheme than what I just
described to make this work. There is a
real leap of genius here to go from
Navia Stokes to this touring machine. So
it goes from what the self similar blob
scenario that you're trying to get the
smaller and smaller blob to now having a
liquid toying machine gets smaller and
smaller and smaller and somehow seeing
how that
could be used
to say something about a blowup. I mean
that's a big leap. So there's precedent.
I mean um so the the thing about
mathematics is that it's really good at
um spotting connections between what you
think of what you might think of as
completely different um problems. Um but
if if the mathematical form is the same
you you can you you can you can draw a
connection um so um there's a lot of
work previously on what called cellular
automator um the most famous of which is
Conway's game of life. there's this
infinite discrete grid and at any given
time the grid is either occupied by a
cell or it's empty and there's a very
simple rule that uh tells you how these
cells evolve. So sometimes cells live
and sometimes they die. Um and this um
you know um when I was a a student it
was a very popular screen saver to
actually just have these these
animations going and and they look very
chaotic. In fact they look a little bit
like turbulent float sometimes. But at
some point people discovered more and
more interesting structures within this
game of life. Um so for example they
discovered this thing called a glider.
So a glider is a very tiny configuration
of like four or five cells which evolves
and it just moves at a certain direction
and that's like this this vortex rings
this um yeah so this is an analogy the
game of life is kind of like a discrete
equation and and um the flu navis is a
continuous equation but mathematically
they have some similar features um and
um so over time people discovered more
and more interesting things you could
build within the game of life. The game
life is a very simple system. It only
has like three or four rules um to to do
it, but but you can design all kinds of
interesting configurations inside it. Um
there's something called a glider gun
that does nothing to spit out gliders
one at a one one at a time. Um and then
after a lot of effort, people managed to
to create um and gates and or gates for
gliders. Like there's this massive
ridiculous structure which if you if a
if you have a stream of gliders um
coming in here and a stream of gliders
coming in here then you may produce a
stream of gliders coming out. If so
maybe if both of of the um streams um
have gliders then there'll be an output
stream but if only one of them does then
nothing comes out. Mhm. So they could
build something like that. And once you
could build and um these basic gates
then just from software engineering you
can build almost anything. Um you can
build a touring machine. I mean it's
like an enormous steampunk type things.
They look ridiculous. But then people
also generated self-replicating objects
in the game of life. A massive machine a
bon machine which over a huge period of
time and it always look like glider guns
inside doing these very steampunk
calculations. it would create another
version of itself which could replicate.
It's so incredible. A lot of this was
like community crowdsourced by like
amateur mathematicians actually. Um so I
knew about that that that work and so
that is part of what inspired me to
propose the same thing with Navia
Stokes. Um which is a much as I said
analog is much worse than digital like
it's going to be um you can't just
directly take the constructions in the
game of life and plunk them in. But
again it just it shows it's possible.
You know, there's a kind of emergence
that happens with these cellular automa.
Local rules.
Maybe it's similar to fluids. I don't
know. But local rules operating at scale
can create these incredibly complex
dynamic structures. Do you think any of
that is amendable to mathematical
analysis?
Do we have the tools to say something
profound about that? The thing is you
can get this emerg in very complicated
structures but only with very carefully
prepared initial conditions. Yeah. So so
these these these glider guns and and
gates and and so forth machines if you
just plunk down randomly some cells and
you and you will not see any of these.
Um and that's the analogous situation
with Navia Stokes again you know that
that with with typical initial
conditions you you will not have any of
this weird computation going on. Um but
basically through engineering you know
by by by specially designing things in a
very special way you can make clever
constructions. I wonder if it's possible
to prove the sort of the negative of
like basically prove that only through
engineering can you ever create
something interesting. This this is a
recurring challenge in mathematics that
um I call it the dichotomy between
structure and randomness. That most
objects that you can generate in
mathematics are random. They look like
rand like the digits of pi. Well, we
believe is a good example. Um, but
there's a very small number of things
that have patterns. Um, but um, now you
can prove something has a pattern by
just constructing, you know, like if
something has a simple pattern and you
have a proof that it it does something
like repeat itself every so often. You
can do that. But um, and you you can
prove that that for example, you can you
can prove that most sequences of of
digits have no pattern. Um, so like if
you just pick digits randomly, there's
something called low large numbers. It
tells you you're going to get as many
ones as as twos in the long run. Um but
um we have a lot fewer tools to to to if
I give you a specific pattern like the
digits of pi how can I show that this
doesn't have some weird pattern to it.
Some other work that I spend a lot of
time on is to prove what are called
structure theorems or inverse theorems
that give tests for when something is is
very structured. So some functions are
what's called additive like if you have
a function that maps natural numbers
with natural numbers. So maybe um you
know two maps to four three maps to six
and so forth. um some functions what's
called additive which means that if you
add if you add two inputs together the
output gets gets added as well uh for
example multiplying by a constant if you
multiply a number by 10 um if you if you
multiply a plus b by 10 that's the same
as multiplying a by 10 and b by 10 and
then adding them together so some um
functions are additive some are kind of
additive but not completely additive um
so for example if I take a number n I
multiply by the square root of two and I
take the integer part of that So 10 by
square of two is like 14 point
something. So 10 up to 14. Um 20 up to
28. Um so in that case additively is
true then. So 10 + 10 is 20 and 14 + 14
is 28. But because of this rounding
sometimes there's roundoff errors and
and sometimes when you um add a plus b
this function doesn't quite give you the
sum of of the two individual outputs but
the sum plus minus one. Um so it's
almost additive but not quite additive.
Um so there's a lot of useful results in
mathematics and I've worked a lot on
developing things like this to the
effect that if if a function exhibits
some structure like this then um it's
basically there's a reason for why it's
true and the reason is because there's
there's some other nearby function which
is actually um completely structured
which is explaining this sort of partial
pattern that you have. Um and so if you
have these so inverse theorems it um it
creates this sort of dichotomy that they
either the objects that you study are
either have no structure at all or they
are somehow related to something that is
structured. Um and in either way in
either um in either case you can make
progress. Um a good example of this is
that there's this old theorem in
mathematics called sim theorem proven in
the 1970s. It concerns trying to find a
certain type of pattern in a set of
numbers. the patterns that have make
progression things like 3 five and seven
or or or 10 15 and 20 andreli
proved that um any set of of numbers
that are sufficiently big um what's
called positive density has um
arithmetic progressions in it of of any
length you wish um so for example um the
odd numbers have a set of density 1/2 um
and they contain arithmetic progressions
of any length um so in that case it's
obvious because the the odd numbers are
really really structured I can just take
11 13 15 17 I just I can I can easily
find arithmetic progressions in in in
that set. Um but um zerminism also
applies to random sets. If I take the
set of odd numbers and I flip a coin um
and for each number and I only keep the
numbers which for which I got a heads
okay so I just flip coins. I just
randomly take out half the numbers I
keep one half. So that's a set that has
no no patterns at all. But just from
random fluctuations, you will still get
a lot of um um of arithmetic
progressions in that set. Can you prove
that
there's arithmetic progressions of
arbitrary length within a random? Yes.
Um have you heard of the infinite monkey
theorem? Usually mathematicians give
boring names to theorists, but
occasionally they they give colorful
names. Yes. The popular version of the
infinite monkey theorem is that if you
have an infinite number of monkeys in a
room with each with a typewriter they
type out uh text randomly almost surely
one of them is going to generate the
entire screw of Hamlet or any other
finite string of text. Uh it will just
take some time quite a lot of time
actually but if you have an infinite
number then it happens. Um so um
basically the the if you take an
infinite string of of digits or whatever
um eventually any finite pattern you
wish will emerge. Um it may take a long
time but it will eventually happen. Um
in particular arithmetic progressions of
any length will eventually happen. Okay.
But you need that but you need an
extremely long random sequence for this
to happen. I suppose that's intuitive.
It's just infinity. Yeah. Infinity
absorbs a lot of sins. Yeah. How are we
humans supposed to deal with infinity?
Well, you can think of infinity as as as
just an abstraction of um a finite
number for which you you do not have a
bound for um that uh you know I mean so
nothing in real life is truly infinite.
Um but you know you can um you know you
can ask yourself questions like you know
what if I had as much money as I wanted
you know or what if I could go as fast
as I wanted and a way in which
mathematicians formalize that is
mathematics has found a formalism to
idealize instead of something being
extremely large or extremely small to
actually be exactly infinite or zero. Um
and often the the mathematics becomes a
lot cleaner when you do that. I mean in
physics we we joke about uh assuming
spherical cows. um you know like real
world problems have got all kinds of
real world effects but you can idealize
send certain things to infinity send
certain things to zero um and um and the
mathematics becomes a lot simpler to
work with there. I wonder how often
using infinity
uh forces us to deviate from um the
physics of reality. Yeah. So there's a
lot of pitfalls. Um so you know we we
spend a lot of time in undergraduate
math classes teaching analysis. Um and
analysis is often about how to take
limits and and and and whether you you
know so for example a plus b is always b
plus a. Um so when you have a finite
number of terms you add them you can
swap them and there there's no problem.
But when you have infinite number of
terms there these sort of shell games
you can play where you can have a series
which converges to one value but you
rearrange it and it suddenly converges
to another value. And so you can make
mistakes. You have to know what you're
doing when you allow infinity. Um you
have to introduce these epsilons and
deltas and and this there's a certain
type of way of reasoning that helps you
avoid mistakes. Um
in more recent years um people have
started taking results that are true in
infinite limits and what's called
finetizing them. Um so you know that
something's true eventually but um you
don't know when. Now give me a rate.
Okay. Okay, so it's such a if I have
don't have an infinite number of monkeys
but but a large finite number of
monkeys, how long do I have to wait for
H to come out? Um and that's a more
quantitative question. Um and this is
something that you can you can um attack
by purely finite methods and you can use
your finite intuition. Um and in this
case it turns out to be exponential in
the length of the text that you're
you're trying to generate. Um so um and
so this is why you never see the monkeys
create Hamilton. you can maybe see them
create a four-letter word, but nothing
that big. And so I personally find once
you finitize an infinite statement, it's
it does become much more intuitive and
it's no longer so so weird. Um so even
if you're working with infinity, it's
good to finitize so that you can have
some intuition. Yeah. The downside is
that the finite groups are just much
much messier and and uh yeah. So so the
infinite ones are found first usually
like decades earlier and then later on
people finize them. So since we
mentioned a lot of math and a lot of
physics uh what is the difference
between mathematics and physics as
disciplines as ways of understanding of
seeing the world maybe we can throw in
engineering in there you mentioned your
wife is an engineer give it new
perspective on circuits right so this
different way of looking at the world
given that you've done mathematical
physics so you you've you've worn all
the hats right so I think science in
general is interaction between three
things um there's the real world um
there's is what we observe of the
reward, our observations and then our
mental models as to how we think the
world works. Um so um we can't directly
access reality. Okay. Uh all we have are
the observations which are incomplete
and they they have errors. Um and um
there are many many cases where we would
um uh we want to know for example what
is the weather like tomorrow and we
don't yet have the observation we'd like
to a prediction. Um and then we have
these simplified models sometimes making
unrealistic assumptions you know
spherical cow type things. Those are the
mathematical models. Mathematics is
concerned with the models. Science
collects the observations and it
proposes the models that might explain
these observations. What mathematics
does we we stay within the model and we
ask what are the consequences of that
model? what observations would what
predictions would the model make of the
of future observations um or past
observations does it fit observed data
um so there's definitely a symbiosis um
it's ma I guess mathematics is is
unusual among other disciplines is that
we start from hypothesis like the axims
of a model and ask what conclusions come
up from that that model um in almost any
other discipline uh you start with the
conclusions you know I want to do this I
want to build a bridge, you know, I I
want to to make money. I want to do
this. Okay. And then you you you find
the path to get there. Um
a lot there there's a lot less sort of
speculation about suppose I did this,
what would happen? Um you know, planning
and and and modeling um uh speculative
fiction maybe is one other place. Uh but
uh that's about it actually. Most of
things we do in life is conclusions
driven including physics and science.
You I mean they want to know you know
where is this asteroid going to go? What
was what what is the weather going to be
tomorrow? Um but um Bathe also has this
other direction of of going from the uh
the axioms. What do you think there is
this tension in physics between theory
and experiment? Mhm. What do you think
is the more powerful way of discovering
truly novel ideas about reality? Well,
you need both top down and bottom up. Um
yeah, it's it's a real interaction
between all these things. So over time
the observations and the theory and the
modeling should both get closer to
reality. But initially and it is I mean
this is um this is always the case. You
know they're always far apart to begin
with. Um but you need one to figure out
where to push the other you know. So um
if your model is predicting anomalies um
that are not picked up by experiment
that tells experimenters where to look
you know um to to to to find more data
to refine the models. Um yeah so it it
it goes it goes back and forth. Um
within mathematics itself there's
there's also a theory and experimental
component. It's just that until very
recently theory has dominated almost
completely like 99% of mathematics is
theoretical mathematics and there's a
very tiny amount of experimental
mathematics. Um I mean people do do it
you know like if they want to study
prime numbers or whatever they can just
generate large data sets and with a so
once we had computers um we be to do it
a little bit. Um although even before
well like Gaus for example he discovered
he conjectured the most basic theorem in
in number theory to call the prime
number theorem which predicts how many
primes that up to a million up to a
trillion. It's not an obvious question
and basically what he did was that he
computed I mean mostly um by himself but
also hired human computers um people who
whose professional job it was to do
arithmetic um to compute the first
100,000 tribes or something and made
tables and made a prediction um that was
an early example of experimental
mathematics
um but until very recently it was not um
yeah I mean theoretical mathematics was
just much more successful I mean because
doing complicated mathematical
computations is uh was just not not
feasible until very recently. Uh and
even nowadays, you know, even though we
have powerful computers, only some
mathematical things can be um explored
numerically. There's something called
the comatorial explosion. If you want us
to study, for example, Zodius the you
want to study all possible subsets of
the numbers 1 to a,000. There's only
1,000 numbers. How bad could it be? It
turns out the number of different
subsets of of 1 to a,000 is 2 to the^
1,000 which is way bigger than than that
any computer can currently can can in
fact anybody ever will ever um
enumerate. Um so you have you have to be
um there are certain math problems that
very quickly become just intractable to
attack by direct brute force
computation. Uh chess is another um
famous example. The number of chess
positions uh we can't get a computer to
fully explore.
But now we have AI um um we have tools
to explore this space not with 100%
guarantees of success but with
experiment you know so like um we can
empirically solve chess now for example
we have we have very very good AIs that
that can you know they don't explore
every single position in in the game
tree but they have found some very good
approximation um and people are using
actually these chess engines to make uh
to do experimental chess um that they're
revisiting old chess theories about, oh,
you know, when you this type of opening,
you know, this is a good, this is a good
type of move, this is not, and they can
use these chess engines to actually
refine in some case overturn um um
conventional wisdom about chess. And I
do hope that uh that mathematics will
will have a larger experimental
component in the future perhaps powered
by AI. We'll of course talk about that
but in the case of chess and there's a
similar thing in mathematics that I
don't believe it's providing a kind of
formal explanation of the different
positions. It's just saying which
position is better or not that you can
intuit it as a human being and then from
that we humans can construct a theory of
the matter. You've mentioned the Plato's
cave allegory. Mhm. So in case people
don't know, it's where people are
observing shadows of reality, not
reality itself, and they believe what
they're observing to be reality. Is that
in some sense what mathematicians and
maybe all humans are doing is um looking
at shadows
of reality? Is it possible for us to
truly access
reality? Well, there these three
onlogical things. there's actual
reality, there's our observations and
our our models. Um, and technically they
are distinct and I think they will
always be distinct. Um, but they can get
closer um over time. Um, you know, so um
and the process of getting closer often
means that you you have to discard your
initial intuitions. Um so um like
astronomy provides great examples you
know like you know like you an initial
model of the world is is flat because it
looks flat you know and um and that it's
and it's big you know and the rest of
the universe the skies is not you know
like the sun for example looks really
tiny um and so you start off with a
model which is actually really far from
reality um but it fits kind of the
observations that you have um you know
so you know so things look good you know
but but over time as you make more and
more observations bring it closer to to
reality Okay. Um the model gets dragged
along with it and so over time we had to
realize that the earth was round that it
spins. It goes around the solar system.
Solar system goes around the galaxy and
so on and so forth. And the guys
universe is expanding the expansion
itself expanding accelerating and in
fact very recently in this year. So this
uh even the acceleration of the universe
itself is this evidence that this
non-constant and uh the explanation
behind why that is it's catching up. Um
it's catching up. I mean it's still you
know the dark matter or dark energy this
this kind of thing. We have we have a
model that sort of explains that fits
the data really well. It just has a few
parameters that um you have to specify.
Um but so you know people say that's
fudge factors you know with with enough
fudge factors you can explain anything.
Um but uh the mathematical point of the
model is that um you want to have fewer
parameters in your model than data
points in your observational set. So if
you have a model with 10 parameters that
explains 10 10 observations that is a
completely useless model. It's what's
called overfitted. But like if you have
a model with you know two parameters and
it explain
Resume
Read
file updated 2026-02-14 12:02:01 UTC
Categories
Manage