DeepMind Scientist: Causal AI & Intelligence | Andrew Lampinen Ep 13 | CausalBanditsPodcast.com

By Causal Python with Alex Molak

Summary

## Key takeaways - **Passive observation yields causal strategies**: LLMs trained passively on internet text can still acquire generalizable causal strategies because that text contains interventions—science papers, Stack Overflow debugging, conversations—where each utterance is itself an intervention that reveals causal structure. [02:23], [03:41] - **Over-constraining reasoning breeds fragility**: Hard-coding inductive biases like formal logic hierarchies makes AI systems brittle to distribution shift; softer signals such as explanation-prediction objectives preserve flexibility and scale better, echoing Sutton's Bitter Lesson. [44:25], [45:51] - **Symbolic logic is a tool, not intelligence**: Logical and symbolic systems are useful instruments an intelligent fuzzy reasoner can deploy, but intelligence itself is continuous and statistical—humans build theorem provers precisely because they aren't naturally good formal reasoners. [23:38], [24:10] - **Understanding is graded, not binary**: Lampinen rejects the 'just predicting next tokens' framing: understanding is a degree measured by how well a system answers questions about a domain, and by that yardstick LLMs possess some understanding, just as humans differ in how deeply they grasp concepts. [29:09], [29:42] - **Adaptive irrationality beats strict rationality**: Humans and LLMs both reason more accurately when semantic content supports logical conclusions—being selectively irrational on unfamiliar inputs is rational if it optimizes performance over the situations one actually encounters. [40:37], [41:23] - **Gopnik's infants mirror passive causal learning**: Children in Alison Gopnik's famous studies first watch an experimenter intervene on detectors and only later play themselves—strikingly paralleling how passively trained LLMs can later deploy causal strategies they never directly practiced. [27:41], [28:15]

Topics Covered

Passive Learning Yields Active Causal Strategies
Understanding Is Graded, Not Binary
Formal Reasoning Requires Rigorous Training
Selective Irrationality Can Be Adaptive
Constraints Make AI Systems Fragile

Full Transcript

you try to constrain the reasoning processes too strongly that actually is going to make the system more fragile because as soon as there's something weird in the world that doesn't quite match your assumptions the system will

totally break down the objective of humans and of language models isn't to be rational reasoners it's to hey causal Bandits welcome to the

causal Bandits podcast the best podcast on causality and machine learning on the internet this week we're traveling to London to meet our guest as a child even Lov to play chess he studied math and

physics but decided to purse a PhD in cognitive psychology because it seemed less abstract he loves rock climbing and plays guitar senior research scientist

at Google Deep Mind ladies and Gentlemen please welcome Dr Andrew lampin let me pass it to your host Alex

mola ladies and Gentlemen please welcome Dr Andrew lampinen welcome to the podcast Andrew thank you for having me how are you today doing

pretty well how are you I'm very good we have some sun in London yeah it's a lucky day you came in at the right time great Andrew in one of the recent papers

you published which was a part of a series of papers um about large language model and causality released in the second half of

2023 in the title of this paper you talk about active and passive strategies this is a little bit maybe unusual when we think about traditional

ways of talking about machine learning models in general in the context of causality where AOS would tend to extrapolate the naming convention from

Pearl's causal hierarchy and talk about models or training regimes that are Interventional or observational or counterfactual what dictated this choice

of of of wording or this choice of Concepts yeah I'm glad you brought that up because that's actually one of the key distinctions that we wanted to make in this paper um because although

language models are trained passively that is they're just processing language data from the internet that has been generated by someone else that doesn't necessarily imply that that data is

purely observational so for example if you're reading a scientific paper even though you're merely absorbing that data you're not out there making those experimental interventions yourself those data remain Interventional data

and you can learn about real causal information from those and so the first thing that we point out in the paper is that actually language data on the Internet is Interventional it contains science papers it contains stack

Overflow posts where people are debugging something and trying some experiments and seeing what works and what doesn't it contains just conversations where people are talking to each other and each thing they say is an intervention in that conversation and

so even though language models are learning passively they're learning from data that's Interventional and that is an important distinction does this fact um

impact their capabil capabilities to generalize beyond what we what we maybe traditionally think about this that is exactly what we wanted to explore in the paper so in the paper we suggest that

there's two reasons that this kind of training language data on the internet could give some sort of causal strategies or understanding that could generalize beyond the data that's it's been trained on so the first case is

what we call causal strategies and what we mean by that is that by learning from others interventions the models might be able to discover a strategy for intervening that they could apply in a

new situation to discover new causal structures then to use those for some Downstream goal and so what we suggest in the paper and suggest formally and then show empirically is that precisely

you can discover from purely passively observing someone else's interventions a generalizable strategy for intervening to determine causal structures that a system could then deploy Downstream can you tell our audience a little bit more

about how you structured the learning task for for for the language models in your paper and what were the main insights based on the results that you were able to to achieve and show in the

paper yeah so one of the hard things about studying large language models of course is that it's hard to understand everything that's going into the training Corpus even if we can search through it it's hard to know all the things that are in there that might just

be slightly rephrased or something and so one of the things that we try to do in our program of work is to do more controlled experiments where we train a simpler model on a data distribution that we really understand and then see

how well it generalizes or doesn't so what we did in this paper is that we trained a model on a distribution of data which shows interventions on causal

dag so on each document you could think or each episode in the data the model sees a series of interventions on a dag and then it is sees a goal like maximize

this variable and a series of interventions that try to achieve that goal and so what we wanted to test in the paper is if the model sees passively this data of

interventions that are trying to discover a causal structure and then use that causal structure to achieve a goal on a set of dags that hold out certain kinds of causal structures so it doesn't see everything in training can it

generalize to itself intervene actively and really discover new Cal structures and exploit them at test time and so the way we test this is just like a language model when you deploy it suddenly it

becomes active right it's talking to a user interv intervening on the user rather than just passively absorbing data anymore we similarly train the system passively we test it interactively and we show that the

system is able to apply the causal strategies that are passively observed in training and use them to actively intervene at test time discover new causal structures and exploit them then we compare the way that the model is

doing this to various heuristic or associational strategies we show that the model is much more closely approximating correct causal reasoning than any of those simpler baselines in another paper in in in the series of

papers um called causal parrots Co offered by mate zich who was our guest in the zero episode of this podcast uh the offers propose a hypothesis that

says that large language models are learning a so-called Meta Meta structural causal model and this meta structural causal model is learned based

on the how the authors call it the correlations of causal facts that are present in the in the training data and so one of the proposed

conclusions in this in this paper is that causal models can talk causality but they do not reason causally do you feel that the results from your paper are contradicting this hypothesis or

maybe are they complimentary in some sense well I would say to some extent they are contradicting it in the sense that I think our results suggest that actually the models are capable of

discovering some sort of causal reasoning algorithm that they can apply in a new setting in a generalizable way at least if they have a good enough training regime now I think that on

natural data the models probably are learning of a variety of things and of course in our training distribution there were a lot of interventions whereas natural data might have quite a few more correlations and only

occasional Interventional data and so that data mixture will change what the models are learning and they might be learning a lot of strategies that are more sort of like correlational meta like and relatively fewer causal

strategies however what we show at the end of our paper is that actually if you test language models on sort of like experimenting to discover causal Str

structure tasks similar to the ones we used for our simpler experiments in certain cases in particular if you give them explanations in the prompt they're actually able to learn to do this pretty effectively to discover new causal

structures that aren't included in the prompt and so I think that suggest that there is at least enough Interventional data going on in the language model training distributions that they are

able to discover some of these strategies but in terms of what they do on average or in like most of the cases you deploy them it could be something a little bit more like what they're describing in the Cal parot paper

definitely in your paper you also emphasize the importance of of explanations this maybe directs us to to

a broader topic of structuring the call the the training regime in a c in a certain way what are have thoughts on this is the training Paradigm or

training regime that that we today use to train those models helpful for them to use Cal structures or not necessarily I mean I think that the training Paradigm we use today is very driven by

what works and what allows you to use data at scale I think almost always if you were designing a training Paradigm for a model you would try to leverage

more kind of auxiliary tasks for example that help it to understand the data if there's something you know about the data yourself so uh what we did with explanations in this paper and what we've done in some prior papers is to

show that if you know something about a task like you know for example why a reinforcement learning agent is getting a reward from the environment giving it a natural language explanation of that reward and asking it to just predict

those explanations as part of the learning process can actually improve what it's learning and you can even use this in cases where the data are totally confounded to shape how the modeles out

of distribution so I think that to the exent that you know more about the data then it's there's definitely many things you could do to better structure the what the model is learning and to change

the way it generalizes I think the tricky thing is that you know we don't know that much about the structure of data on the internet and it's very you know there's a trade-off between trying

to do something that is that we really understand at small scale and trying to get as general a system as possible and since language modeling is a relatively new well language modeling itself isn't

a new field but training large language models is quite a new paradigm I think people are just barely starting to scrape the surface of what you can do in terms of structuring the training process in a more interesting way so one

example of this is that people have recently started conditioning these models on a signal of quality like a reward estimate for the data or a quality estimate during the training process and that actually can help in

certain cases to disentangle the training a bit more and to get a system that at the end if you say okay now produce only highquality things at test time you can still learn from the worst

data but you can push the system more to generate high quality responses at test time and this is similar to things people have done in reinforcement offline RL type contexts for a while and things like decision Transformer or

upside down RL where you really just condition on a signal of how good something is that allows you to learn from the bad data without necessarily replicating that data at test time in your paper you are talking about large

language models but also agents uh for those people in our audience who are less familiar with with reinforcement learning paradigms could you give a brief introduction or brief description

how about the intuition how large language models and agents are related so actually I think that the term agents is quite General and large language models can fit within the Paradigm of

passively trained agents so I think of an agent as just a system that takes inputs such as observations of the world and produces output actions in a

sequential decisionmaking problem and so this covers for example language models which take inputs in natural language and produce a sequence of language and response to that maybe an interaction with the user it also considers things

like the kinds of like chess and go playing engines that we've worked on in the past at Deep Mind also systems that play video games like Atari it covers quite a wide range of systems the key

part to me is is this aspect of interaction with the environment and one of the key distinctions to make here is systems that are interacting with the environment at training time or only at

testing time if you have an agent it's probably because you want it to interact at some point but often you train this these agents at least partly on data where they are not interacting but for example they're just observing what

expert Starcraft players or expert go players would do and learning from that and then later on you might do some reinforcement learning training where you're actually playing them against each other against other agents and then

giving them rewards if they win and then using that as a way to make to improve them and that's similar to the paradigms people are using with language models now where they first train them on a lot of passive data just language that

humans have generated on the internet and then they do some fine tuning and then they finally do a reinforcement learning step where they train a reward model based on what which kinds of language responses humans prefer and

they use that to actually reward the models for producing the kinds of responses humans will like some people would argue that auto regressive models might not be very well in generating

very long text or predicting long time series because there might be the mechanism of accumulating the error so

the model is conditioning the next token or the next Whatever item data point on its previous generation and this

generation necess necessarily contains some error and this error can be reinforced over time

steps interventions might be used in order to decrease the the the amount of error how much intervention and action

or action do we need to make autoaggressive models reliable for long longer form content let's let's think about text generation or video generation so I think there's a lot of

things that go into this question I mean one point I want to make is that even with current language models at least the state-of-the-art systems they have some ability to air correct in the sequences they're producing if you do something like Chain of Thought you will

occasionally see the models like produce something and then say Oh no that's a mistake actually I should do this right so it's not the case that air is necessarily M monotonically increasing over time although I will agree that the

systems are not particularly good at this yet I think that part of what's important for this is that the system is able to reconsider the prior context and

maybe find errors in its res in that way and so the architecture of the system and the for example its context length and its ability to use the earlier Generations may affect its ability to do

that in non-trivial ways now coming to the intervention part it's a well-known fact that passive learning of systems is not very efficient and tends to lead to worse generalization out of distribution that doesn't mean that it's impossible

for these systems to generalize but simplistic strategies for Passive learning at least you tend to get this problem where on the data distribution the system is accurate but once it starts being active it will move off

that data distribution and then it will start to break down because it's never seen any situation like this before and so in the offline reinforcement learning literature for a long time there's been techniques like dagger which basically

you get a little bit of Interventional data that sort of allows you to recover back to the data distribution if you step off of it and those kind of strategies are quite important for getting a robust system in practice so

you could see things like the active reinforcement learning people do with language models at the end of training as being a kind of way of doing this or even if you're doing supervised tuning but on real Generations the model

produces when interacting with users this is a kind of Interventional data but the question of how much of that data you need to get a robust model is I think a very hard question to answer you would have to make very strong

assumptions on what the data generating process are like what kind of structures you're trying to get the model to learn how how reliably the model can air correct and I think all of those questions are quite open at the moment

so I don't think it's easy to put put a number on it I see a very interesting parallel between what you just mentioned and the ideas from from causal inference literature where people talk about the

causal data Fusion or mixing observational and Interventional data in order to recover on on one hand recover the the causal structure of the problem

on the other hand maximize the efficiency of inference have you ever thought about those two ideas those those two p paths and relations

potential relationships between them yeah that's a great question so we didn't really address this in the paper but we did have some cases where we sort of like got the system to use some what

we called like prior knowledge from the literature you could think of it as coming from like an observational Source where we sort of cue it on which variables in a system might be relevant and then the system learns in a generalizable way to intervene only on

the relevant varibles so it's sort of like most efficiently determining the causal structure that are NE that are necessary and not exploring the things that aren't you could imagine doing this similarly with this system that also

combines observational data in as a way of sort of efficiently discovering which things might be the most promising to intervene on or as a part of the learning process more generally I think

that that similarly would work pretty well and um yeah I I I have some colleagues like Rosemary K who have worked on things in this space so I definitely think it's a promising area

where are we today from your perspective when it comes to building useful sequential decision making systems useful for what well useful for

some problems so I guess like I don't personally use large language models that much but I would consider them a sequential decision-making system and I've talked to a number of people who

have found them quite useful for things like writing for idea generation even for just sort of like critiquing their thinking on a certain problem that I think is you know so in some sense I

would say language models are a useful sequential decision-making system I wouldn't say they're a useful autonomous sequential decision-making system perhaps that would be an important distinction to make right they're useful if you have a human in the loop at least

for right now because they're way too unreliable to use without a human but I think they're useful in certain cases and honestly I'm more interested in AI systems as tools to help humans rather

than AI systems to be an autonomous agent at least in deployment time right I'm I'm more excited about for example things that a scientist could use as a tool like for helping them to understand

some complex system better or helping them to reason about a problem I think that is probably going to be one of the most important and impactful use cases so as far as I'm concerned language

models are a useful sequential decision maker of course there could be a more useful sequential decision maker and I'm sure there'll be improvements in that very soon you said about helping scientists helping humans Building

Systems that are that are helping helpful for someone what drives you in your work yeah that's a great question because I I actually feel like a lot of the work I do is not particularly useful compared to some of my colleagues who

are working on things that help scientists like doing protein structure prediction or Fusion or something um I guess I'm mostly motivated by curiosity and by the hope that by developing some

understanding of the systems that we're working with we can better understand how to improve them and in ways that will be deployed Downstream to help improve scientific applications or other things so I guess I'm motivated

primarily by curiosity and by the hope that doing the sort of pure research that I do or more conceptual work that it will be useful Downstream in some application but we'll see if it is or

not you studied you studed mathematics and physics um and then you mve to cognitive psychology with elements of cognitive science neuroscience and so on and so on U and then you ended up at

Deep Mind or Google Deep Mind as we should call it now working with algorithms that nevertheless uh in many

cases somehow interact with with some environments what qualities or resources were the most helpful in your

journey to go through all those stages and and work in one of the the best known AI labs in the in the world I think my foundation in physics and

Mathematics has been really useful to me across a lot of the stages of my career because I think you know working in physics and in mathematics you get you develop good intuitions for things like linear algebra that crop up all over the

place in the statistics we do in cognitive psychology in thinking about how machine learning systems work and so on so I think that those quantitative foundations have been very useful to me

and also sort of programming foundations that I acquired there and in in high school as well the main resource I'd point to is actually the the people that I've interacted with along the way

because I think one of the things I find most valuable work about working at Deep Mind is the incredible people who were there who come from all sorts of different backgrounds and similarly during my PhD I was very lucky to work

with an adviser and in a department that has a diversity of backgrounds and has just really supportive people who are excited to teach you about the things that they're interested in and so yeah I

think I've learned the most from the people I've met along the way MH do you think that causality is necessary for

generalization I think that causality is one thing you can look for as a tool of generalization I mean I I often think about problems like in mathematics because of my background there and for

example in mathematics if I have a system that's like a a theorem that generalizes to some new theorems I don't know if thinking about that in terms of causal causality is the most useful way to understand that because I tend to

think of math as being fundamentally about equivalences that don't really have a causal direction right however I think in terms of a lot of real world systems and identic systems causality at

least offers very useful intuitions for thinking about how a system will generalize and where things might break down um and so I do find intuitions about confounding and intervention and

so on to be quite useful and practical at least in how I think about the problems of generalization for a gentic system specifically you mentioned

mathematical system fear improvers uh those systems are often are

built or maybe always are built as as um some logical uh as a series of logical

steps um which involve symbolic operations so in contrast to latch language models that learn well at least that's that's my understanding that

learn some kind of a smooth approximation U or smooth representation of Concepts symbolic systems will be more discrete on more discrete

end there's a whole field called neuros symbolic Ai and a Ab with neuros symbolic causal AI as well um in some of your work you all you're also interested

in representation learning do you think that combining representation learning as we usually do it today in in this

soft differentiable way with symbolic systems more discrete systems is um is a way or a path that has a bright future I

do but I think I think this in kind of the opposite way that of many other people and in particular what I'd like to say is that I think symbolic systems logical reasoning systems are useful

tools for an intelligent system to use but they are not intell relevant to intelligence in itself and what I mean by that is actually if you think about humans we are not particularly good logical reasoners we are not

particularly good mathematical reasoners without extensive training right and because of that we build tools like you know mathematical proving systems and things like that to help us with the

parts of solving problems that we're not particularly good at but I would say those tools are most useful in so far as they're used by a system that is intelligent and I think that that intelligence tends to take something

that's more the form of a continuous fuzzy sort of reasoning system and the symbolic logic is just a useful tool for those systems to use to tackle certain kinds of constraint problems that those

logical systems are good at so I do think that using symbolic approaches in certain situations like in mathematics or in physics or in many other computational Sciences is very very

useful but I think that the most effective most General ways to approach that will be through systems that put the fuzzy learning uh continuous systems

in control and just use the logical systems as sort of a a tool to use just like humans use yeah yeah I'm happy you said this I think I agree and not not maybe not many people uh think about

this in this direction so I feel less alone in the world um I often also argue that people are not that good in causality um which might be well

counterintuitive sometimes when we think about causality in humans in particular uh we'll often if you start reading about this we'll often end up at some

point at least that's one of the paths you can take with with research coming from Alison gopnik who's a developmental

psychologist a very prominent one he and she and and people in her team has shown repeatedly that human babies uh when they interact with with

the world they do it in a systematic way that allows them to build World models and it seems that the motivation for this exploration or at least some

structure that allows them to do this in a way that is systematic uh is at least to some extent uh inherited inborn or

developed through evolutionary paths maybe do you think that these types of um solutions that come from Evolution I

don't know if you would agree with me that it comes from from from Evolution but assuming that it does do you think that those can be

imitated uh in in a simple computational manner or is there something maybe some underlying complexity in the EV evolutionary process that is not that

easy to to ulate yeah I think that's a great question I don't think we know the answer yet one different instance I'll point to is the fact that you know for a long time people thought that

grammatical abilities of humans were evolutionary and couldn't possibly be learned from any finite amount of data particularly only positive examples I think what language models show is that

actually you know you might need more language data than humans learn from but you don't need that much more and actually people are trying to push that amount of data down even further and get it to a really humanlike regime as a

side note I actually think that embodied experience might be quite important in our learning efficiency in language and we had a paper a few years back showing that systems that are embodied in richer en environments show better language

generalization for example now coming back to the causality point though I don't think we know yet exactly what it will take to get human-like inductive biases to a system maybe you do need

more data or maybe the active experience that infants have even when they're growing up before they come to Allison's lab is quite important one thing that I think is very interesting I'm glad you brought up Allison's work because some

of her most famous studies are actually quite close to the Paradigm I'm describing of passive observation because the children don't perform the experiments in these studies themselves at least not at first they observe a lab

assistant putting some things onto different detectors and seeing what lights them up and what doesn't and then it's only from observing that uh those interventions that the children get some sense of the causal structure that is

then tested in some trials where the kids are actually allowed to play with the detectors and the different things so I think it is actually it has a somewhat close analogy to the kinds of learning that I'm describing in this passive learning paper um and I think

there's some very interesting connections to be made there now Allison and some people from her lab did try these kind of studies with language models uh in a paper that came out a little bit before minded and they showed

that the language models are still not particularly good at them so it does seem like there's something there that's missing and perhaps it is sort of this early interactive experience that the models are missing but I think we'll

have to do some more exploring to resolve that often when we talk about large language models we can see some people giving comments about understanding so they say ah they are just predicting the next token but they

don't really understand what language is they don't have the semantics are in a way uh similar to the Chinese rumor

argument what is understanding I guess I tend to think of understanding and many other properties as being sort of graded things that are you know it's not like a

system is discreetly oh it has understanding or it doesn't rather it's like well understanding is something you have to a degree it's the degree of like richness of representation you have of something degrees of ways you can

generalize that so again returning to math like I think about this a lot in the context of you know if you have a student in an intro calculus class they might understand differentiation for example to some extent but probably

their math teacher understands it better and someone who's like a researcher in sort of an analysis would understand it even better right there's sort of different degrees of understanding of a

concept even if you're able to work work with it to some extent so I think that language models do have some degrees of understanding of certain features of course there's some aspects of our

understanding of the world for example that are really about the phenomena of perception that they can't possibly learn about but that doesn't mean they don't have any understanding at all and similarly the

fact that they are trained by just predicting the next token I don't think that means they don't have understanding I would think of understanding as you know the degree to which that if you test them on something by like asking

them questions about it they get those questions right that's how I would measure understanding and I think by that measure they have some understanding often imperfect but then if you ask the average human questions

they'll often get some of them wrong so I would still say that there's some understanding there we started our conversation today um talking about your paper about passive strategies and

active strategies and and so on and you shared with us some of the results showing that large language models can

generalize to new um causal context if the training have certain certain properties some time ago uh franois Chet

published uh his set of tasks to measure what he called in calls intelligence um these are tasks called Arc or sets of tasks called Arc and Arc 2 and it seems

that large language models are um well there's a lot of detail to this but in some sense they are not performing very well in in those tasks in in some sense

those tasks what is important about them is that there is a concept and there is a structure and the concept and the structure are orthogonal what kind kind of training or what kind of data would

we need for large language models in order to make them perform better on this type of tasks yeah that's a great question so one thing I want to bring up in this context is there is a nice paper

by Taylor web and Keith Holio um came out maybe 6 months or a year ago looking at the ability of language models to understand analogical reasoning problems like the ones that humans are have been

tested out in the past and Keith Holio is one of the people who did a lot of the earlier work on human analogical re and so an analogical problems are really a an example of this case where you have this structure and you have some

superficial details around it the details are orthogonal to the structure and so you can test the same underlying abstract structure in different situations so they tested these language

models on some problems that are analogy problems with letter strings and things like that that are somewhat similar to some of the arc problems and they did find that the later generation models are doing increasingly well on these

problems even in Fairly comp often nearly as well as are um so I do think that to some ex you can get some performance on these problems out of

these systems at the same time I think that you know one of the things I think about a lot is in human in humans formal reasoning is really an ability that we

really have to train in depth right like you learn to be a formal Reasoner through years of education maybe like 20 years of education to become like a researcher in mathematics or logic or

something like that right and so I think that the we perhaps underestimate the extent to which a system needs really rigorous and perhaps interactive

training to efficiently learn these kind of formal reasoning strategies and we maybe overestimate the extent to which humans are just naturally good reasoners and I think part of the reason that we

overestimate this is that the people who run these studies are exceptional and they tend to be highly educated humans right they are the people who have gone through these years of rigorous training

in order to become a more formal Reasoner I think it we sort of suffer from this expert blind spot where it's hard to step back and say well actually how easy is it for the average person to

reason about a logical problem or to do these sort of structure mapping uh reasoning processes and so one slightly different example of this is in a recent paper we looked at how language models

in humans reason about logical problems that have some underlying semantic content and we looked at what we showed is that basically language models and humans both do better when the semantic

content of the problem supports their logical reasoning and they do much worse when the semantic content contradicts that the sort of conclusions that you would draw from logic and so what I think this is pointing to is the fact

that you know like humans language models are imperfectly using logic in order to address a problem but that's maybe kind of a natural thing to do unless you've been very carefully

educated to take a more rigorous strictly formal approach we as humans are also exceptionally good in building defense

mechanisms that allow us to think about as ourselves as better than than we might be in certain uh in certain contexts it's about reasoning as well

I'm pretty convinced yeah definitely I mean I think that you know I make all kinds of reasoning mistakes all the time and you know if if someone's skeptical of reasoning saw some of those they would say oh that's a system that really

doesn't understand but I think I like to think at least that I do yeah yeah I think we we we we all like to and I think it's healthy actually to think

this way about ourselves at least to an healthy extent whatever it is Andrew what would be your advice to people who are entering a complex field

like causality or reinforcement learning or machine learning or theoretical physics my perspective on this which comes from perhaps from from physics to some extent is that the best way to

learn about something is to play around with it right so if you're learning about a mathematical concept or if you're learning about a causality concept or a reinforcement learning concept the best way to learn is to like

code That Thing Up try it out like change some parameters see how things vary see you know if you change from more observational data to more Interventional data what does the system learn or fail to learn doing those kind

of experiments yourself and playing around with it I think it's the best way to build the intuitions that I think are kind of the foundation of our understanding what's your message to the coal python code

Community I guess one of the things that I take away from our paper that I think is kind of an important point is that you know if you have a system that

treats different levels of abstraction for example the raw data themselves but also the causal abstractions about them and even counterfactual questions like what would have happened if we'd done

something else in sort of a homogeneous way you actually enable kinds of reasoning that might not so obvious like being able to learn about causal strategies for example and so I think

that my message would be you know think about how you can build more holistic systems that are capable of bridging understanding across multiple levels and I think I'd be really excited to see

where the Cal Community takes that in your work uh you extensively refer to reinforcement learning on different levels also in in

this paper for many people who are maybe less aware about your work but we interested in reinforcement learning in the context of causality they might be

familiar with the works of Elias SP andb uh what would you say are the main similarities and differences between your work and in Elias's work yeah that's a great question so I'm not

familiar with all of Elias's work but he does have some papers that are quite closely related to the kinds of work that we did in this paper looking at what you can learn from sort of passively imitating a system I would say

the key difference between the work that I'm familiar with from him and our work is that we considered sort of fundamentally a metal learning regime where you don't have a system that's learning about a single problem but you

have a system that's learning how to adapt in new situations to new problems and that's sort of the general distinction between metal Learning Systems they're learning to adapt to learn and new learning how to learn

that's the metal learning part whereas a basic Learning System is just learning about a single structure or a single problem so in L's work there looking at what you can learn by imitating in a

single case and of course there you're really Bound by the fact that this dat the data are observational the key point we make in our paper is that actually if

you're learning about how to learn how to learn by intervening that is something you can learn just by passively imitating what someone else has done and you can actually deploy

that to acquire like more General causal understanding in new situations so I think that is the fundamentally important distinction between those works when we're speaking uh a second

ago you mentioned the systems that maybe could learn in this soft way and then just

use logical symbolic systems as a tool that is uh that is being used in cases where where it can be

useful Antonio damasio in one of his books uh was describing a case of one of his patients he he nicknamed him him

Elliot so Anton demasu is is a neuro sergeon neuroscientist and his patient uh has a particular brain injury that

disconnected lower parts of his BL brain with his prefrontal Contex cortex and this patient was doing

exceptionally well in logical reasoning so in like task like int Ence uh intelligence tests and mathematical reasoning tests he was everywhere there

he was above the average but when it comes to his personal life he was suffering uh he was suffering greatly he he lost his family

one day he went on the street he met a bunch of people somewhere on the on the car parking and started talking about business with them and they asked him to

to to give them his money and he just gave gave them like a huge amount of money like most of his money I I don't remember exactly and all those happened

to a person who was exceptionally intelligent in terms of how we me how we tend to measure intelligence perhaps this story tells us

something important about our own rationality what are your thoughts about rationality of artificial systems and what role those symbolic uh capab

abilities would play in those systems when we think about them from the point of view of rationality and decision making I think that's a really interesting question so I mentioned

earlier this paper on how humans and language models show similar patterns of content entangled logical reasoning and one of the arguments we make in that paper is that there's actually a

rational reason to do this if it helps the system to make better decisions across the distribution of situations that it encounters every day so humans tend to make better logical reasoning

inferences in situations that are sort of more familiar to them which are of course naturally the ones that they tend to encounter more often and similarly uh language models perhaps make better inferences in situations that they

intend to counter encounter in their data distribution so I think one way of framing these results is that well the objective of humans and of language

models isn't to be rational reasoners it's to be sort of a adaptive to the situations we encounter and that might actually mean that it's better to be less rational but to do better on the

kinds of situations that you tend to encounter every day and I think that might be one thing that's playing out in these kind of cases you know humans are sort of adapted to be irrational in selective ways that perhaps make us

better able to handle the kinds of social situations and other things that we encounter and maybe if you're too rational in a logical sense that will actually work against you in certain

cases and so uh there's actually a whole literature on sort of like things that are called bounded rationality for example that try to come up with normative theories of why humans are

irrational based on sort of similar arguments that like overall it helps us to be irrational because like for example we have limited resources and so we can't make the fully perfect rational inferences all the time or because it

helps us to be more accurate in the kinds of situations we encounter every day perhaps also our definition of rationality as as related to formal reasoning and maybe

this example shows this the example from the masio is is just not a useful definition in terms of what is really rational in terms of in terms of decisions that we make in real world in

everyday life so I think like a lot of these approaches that try to come up with a more normative Theory draw in sort of more probabilistic accounts of what's rational and sort of how you would be the best in expectation over a probabilistic distribution and that

tends to lead to slightly different patterns of behavior than you would get from just doing sort of strictly formal logic it seems that when we look at complex

systems dynamical dynamical and complex systems when we try to regularize those

systems overly makes them perfect it might cost us very unpleasant surprises and very uh abrupt surprises later on

when when it systems when the system D Dynamics just explodes in a sense at some point one example of this has been given by by by Nicholas talp in his in his

book about anti fragility when you work with learning systems does this intuition about bounding or or trying to

limit systems in a certain way that leads to detrimental consequences down the road is something that is also like reflected in your work that you find

useful in your work yeah definitely so so I think that we've written a fair amount about uh the sort of idea that instead of building constraints into a

system you should try to learn about those constraints from data and we've demonstrated this in various contexts but I think the general intuition is similar to what you're describing it's

that if you have some idea about how you think a system should approach a certain class of problems like you think oh it should really be a formal logical Reasoner and you try to constrain the

reasoning processes to too strongly that actually is going to make the system more fragile because as soon as there's something weird in the world that doesn't quite match your assumptions the

system will totally break down uh and so I think that the approach that I try to take for example is to use things like these explanation prediction objectives that try to encourage the system to

represent the things we think are important but without overly constraining the internal computations of the system and perhaps that's a way that you can in a softer way influence what the system is doing without putting

too hard of constraints on it but of course I think that there's a trade-off like if you know exactly what the problem is that you're trying to tackle you can build in a really strong inductive bias and you can make the

system do better of course but I think a lot of the cases I'm concerned with are things like you know things that work at scale like how do you build a better language model or how do you build a better agent that can interact in a rich

virtual environment and for those kind of situations I don't think we really know what the right solution is and so when we try to impose something like oh I think there should be a formal

hierarchical grammar here we tend to probably get some things wrong about how you actually need to solve that problem in practice therefore we make the system more brittle and so I guess there's this

idea that's gone around for a while now in the machine Learning Community called The Bitter lesson from Rich Sutton which basically says that you know if you try to build in what you think is the right

way to solve a problem it'll work at small scale but as soon as you scale the system it tends to break break down and I think empirically we've encountered that a lot and so I tend to think that

these sort of softer Solutions like giving explanations or changing the data distribution that you're learning from can be more effective and scalable ways to solve the problem at least in certain

situations in both machine learning research and and causality research uh we sometimes have very interesting

papers that show some very interesting phenomena but those phenomena are are only demonstrated empirically in the

context of a specific set of tasks in contrast when we take those systems to the real world the tasks are usually not that well defined or the distribution of

task is is is is completely unknown what are your thoughts about this discrepancy between the lab and the reality yeah that's a great question so I think it's important to do work in controlled settings where we really understand

what's going on and usually that necessitates doing something that's a little bit more toy because otherwise we can't possibly understand it but I think that the real world is a good forcing

function for encouraging you to not overfit because I think a problem that comes up is that people are designing a a new algorithm for example that they think will work better and then they're

designing the test for it simultaneously and so they sort of test the algorithm on the perfect case that it will perfectly work for and of course their algorithm is better for that particular case but then if it's like well I want

it to work across all the language tasks that you could learn from the internet maybe the assumptions that you made there in designing that system won't hold anymore and so I think that working

in the real world like trying to build image processing systems that can handle any natural image they encounter or trying to build language models that can handle sort of any natural language

query from a user while it's a very hard problem that neither of those is even close to fully solved I would say I think it's a really good way to test your ideas and make sure that they're

really solving the problem and not just solving some toy problem that you designed what would be the main lessons that you learned during your studies in Psychology that help you in

your work today I think there's so many things I mean I I think cognitive scientists have spent a lot of time thinking about some of these causality related issues about things like confounding or different ways that you

could explain the same different underlying processes that could explain the same observations that you make about humans in an experiment and so I think from an experimental design perspective and also just thinking about

doing rigorous statistics on the experiments we perform I think cognitive psychology has a lot to teach machine learning where often they don't even do any statistics it's just like this number is bigger than that other one so

the system must be better so yeah I think experimental design and doing rigorous statistical analyses are two of the main things I've learned um I guess

the the other thing might be just sort of thinking at a more abstract level and trying to bridge from sort of the observations we make to more abstract models that could explain them I think

that's something that's really emphasized in computational cognitive science and uh is maybe a valuable skill for thinking about the kinds of systems that we're working with today you mentioned

statistics uh in the context of of machine learning um you said that sometimes we just say hey this is a bigger number right that that's true unfortunately maybe unfortunately I

don't know um what are your thoughts about statistical learning Theory and how it applies to modern machine Learning

Systems yeah great question so I think you know when you make a theory just like when you make a simplified model you build in some assumptions to it and it's important to consider what those assumptions are in order to understand

how well the theory is going to apply to a new situation so statistical learning theory I think has some assumptions that are perhaps too strong in some cases for

modern machine learning systems or maybe another way of saying it is that we don't quite understand how modern machine Learning Systems fit into statistical learning theory appropriately so one example of this is

that there's been a lot of theory work maybe trying to bridge this Gap in understanding arguing that there's some implicit inductive biases in for example the architectures of our models or the learning processes of gradient descent

or how those things interact that causes the models to effectively be not as overparameterized as you would think of them as being at least at early stages in the training process and I think that

those kind of explanations you know point to a Nuance that's maybe not really captured in at least the most naive versions of statistical learning theory where you sort of assume that like the capacity of the system is sort

of uniformly distributed over all the different ways that it could fit the function maybe that is not really true in practice when you train a system by something like gradient descent for example is the future

causal of course yeah I mean I tend to think that the world is causal and so the future must be

from all you learned in your in your journey what do you think is a good life wow that's an interesting question

I think the the life I enjoy is being able to both engage with sort of intellectual issues like ask questions about causality and intelligence and even like dabble in a bit of philosophy

although I'm not an expert in that not an expert in very many things honestly but also to have sort of a fulfilling life outside of research and I do a lot of rock climbing as I think you mentioned in the introduction and I uh

you know have a lot of friends and my partner Who I Really value our time together and so I think it's uh yeah it's good to have a balance of engaging in intellectual topics but also you know

having a more sort of like physical engagement with the world and having social relationships too I think that's a very important part of what it means to be human who would you like to thank I would like to thank all of my

collaborators at Deep Mind I mean there's so many amazing people there people like Stephanie Chan initiative do scoter who I've worked pretty closely on a lot of this work also my uh Felix Hill

who's been a real mentor to me over the years and um my grad school adviser James molland who has been a huge hero of mine and a real inspiration and it's really helped me to think carefully

about science and statistics and everything else and I'd also like to thank my partner Julie kaser who has been a huge support to me and uh just

tremendously helpful before we conclude um what question would you like to ask me I'd like to ask you where you see the

future intersection of causality and machine learning going like what is your vision for how causality research and machine learning research will come together to make the future more

causal that's a great question um I'm recently thinking a lot about um combining uh the the formalisms the causal

formalisms like P formalism with things like uh differential equations and uh chaos in mathematics dynamical systems and this kind of stuff and I'm also

thinking about something I feel very similar to what what you mentioned before so combining this soft learning with more formalized ways of of of

inference of doing inference and um my intuition is that we in everyday life as humans using more of

soft learn uh soft learn skills than than formal reasoning but we probably also have certain constraints imposed on those

soft soft landar in a soft way uh associations or maybe maybe that maybe there's some structure I don't know maybe there are some constraints that

are enforcing structure on our associative learning and and I agree with you with

what you said before that formal reasoning is something that we need to be trained in and I experienced this firsthand because I studied philosophy

so I studied logic and then more logic and even more logic and then I started learning causality already having experience in machine learning and I

found it deeply counterintuitive um to internalize like things like do calculus for instance so I think these tools are

useful they're not necessarily the main ingredient if and I think this is maybe the main point of of of this answer if

we want to build a system that is humanik and this conditional goes down to something that I think is very fundamental and this is a question that

I think we don't ask ourselves as a machine learning community in general uh often enough and it this is a question about

what are we actually trying to do so do we want humanlike systems that will give us a feeling that we're interacting with something that is similar to us or do we

want to build systems that will be superum in certain regards and then in which regards actually we are interested

in building uh in systems like like this so these are the things I'm thinking about and I must say that this is probably a little bit on the

philosophical end this is a little bit further further away from from practice on the Practical uh on the Practical

end I'm thinking a lot about um providing the community of ways to provide the community with more narratives about causality that make kazal more

accessible to more people because there's a certain learning curve to jump into this things definitely yeah and there's also I think a lot of misconception that

comes from many different angles so we have people uh I see sometimes people from the experimental Community saying things like oh we are doing KB testing and is superior to causal inference

because causal inference has to be from observational data which is not true uh we have people in simulation and they digal twins that are saying hey we're doing simulation why we need actually

any causal inference so I think a unified perspective that shows that interventions or AB tests are a a special case of causal inference and so

are simulations because they are basically some kind of oper operationalization over a over structural causal models is something that we that that could be very

beneficial for the community uh to move forward so so this unified perspective I I found it something fascinating and I work every day

on making building narratives that that could be readable to to to more people regarding this yeah I think that's super valuable I mean I think understanding technical topics is always hard and so I

really admire the work you're doing in the podcast and with your book and everything to try to bring these things in an accessible way to a broader Community I think that's that's super important work um and yeah trying to

trying to arrive a more unified perspective of everything is the grand goal of many Sciences right so yeah that would be awesome not an easy one thank

you so much uh thank you Andrew where can people learn more about you and your your and your team's work well you can check out my website if you just Google my name Andrew lypin or my Twitter I

often post about my work on Twitter and my colleagues work but also just other papers that I find exciting other things in the field that I'm excited about so yeah follow me on Twitter that's probably good good Solution that's great

I have one more question I want to ask you before we before we finish what are two books that change your life the first one I would say is

perhaps a more recent one which is becoming Human by Michael tomasella I think it's a really beautiful book for it's sort of a technical book you know he's a researcher who thinks about

humans and What Makes Us special um and does a lot of comparative work with uh other primates for example and it's it's a beautiful book for thinking about how social interactions from a really early

age are a fundamental part of human experience and that book provided a lot of inspiration for some of the ideas we talked about in a paper called symbolic behavior that we had a few years back um thinking about how social interactions

might be really important for building more natural artificial intelligence for a second book oh man there's so many it's hard to pick just one I think one

of my this is a bit of a a trite answer but one of my favorite books ever is uh ulyses by James Joyce and I think that one of the things I really like about

that book is the way that it plays the sort of style of the writing and the content together with the narrative in different sections of the book and I actually think that like this bridging

between different levels of abstraction right what is just the superficial style what is the content what is the overall underlying narrative is actually quite similar to some of the themes I'm interested in in my research like how do

you bridge between different levels of causal reasoning how do you bridge uh in my dissertation I was interested in bridging between sort of like reasoning within a task and reasoning a cross

tasks and things like that so I think this perhaps I'm totally just re retrospectively attributing this to myself but perhaps some of the ways I thought about like literature when

reading ulyses and other books like that uh percolated into the way that I think about more complex formal systems that I reason about in my everyday work thank you so much for the conversation it was

a pleasure Andrew thank you very much

Loading...

Loading video analysis...