On Scientific Understanding With Artificial Intelligence: Mario - Krenn@mpl - Mpg.de

On scientific understanding with artificial intelligence
Mario Krenn,1, 2, 3, 4, ∗ Robert Pollice,2, 3 Si Yue Guo,2 Matteo Aldeghi,2, 3, 4 Alba

Cervera-Lierta,2, 3 Pascal Friederich,2, 3, 5 Gabriel dos Passos Gomes,2, 3 Florian Häse,2, 3, 4, 6
Adrian Jinich,7 AkshatKumar Nigam,2, 3 Zhenpeng Yao,2, 8, 9, 10 and Alán Aspuru-Guzik2, 3, 4, 11, †
1
Max Planck Institute for the Science of Light (MPL), Erlangen, Germany.
2
Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Canada.
3
Department of Computer Science, University of Toronto, Canada.
4
Vector Institute for Artificial Intelligence, Toronto, Canada.
5
Institute of Nanotechnology, Karlsruhe Institute of Technology, Eggenstein-Leopoldshafen, Germany.
6
Department of Chemistry and Chemical Biology, Harvard University, Cambridge, USA.
7
Division of Infectious Diseases, Weill Department of Medicine,
Weill-Cornell Medical College, New York, USA.
8
Center of Hydrogen Science, Shanghai Jiao Tong University,
800 Dongchuan Road, Shanghai 200240, China.
arXiv:2204.01467v1 [cs.CY] 4 Apr 2022
9
The State Key Laboratory of Metal Matrix Composites,
School of Materials Science and Engineering, Shanghai Jiao Tong University,
800 Dongchuan Road, Shanghai 200240, China.
10
Innovation Center for Future Materials, Zhangjiang Institute for Advanced Study,
Shanghai Jiao Tong University, 429 Zhangheng Road, Shanghai 201203, China.
11
Canadian Institute for Advanced Research (CIFAR) Lebovic Fellow, Toronto, Canada.
Imagine an oracle that correctly predicts the outcome of every particle physics experiment, the
products of every chemical reaction, or the function of every protein. Such an oracle would revolu-
tionize science and technology as we know them. However, as scientists, we would not be satisfied
with the oracle itself. We want more. We want to comprehend how the oracle conceived these
predictions. This feat, denoted as scientific understanding, has frequently been recognized as the
essential aim of science. Now, the ever-growing power of computers and artificial intelligence poses
one ultimate question: How can advanced artificial systems contribute to scientific understanding
or achieve it autonomously?
We are convinced that this is not a mere technical question but lies at the core of science. There-
fore, here we set out to answer where we are and where we can go from here. We first seek advice
from the philosophy of science to understand scientific understanding. Then we review the current
state of the art, both from literature and by collecting dozens of anecdotes from scientists about how
they acquired new conceptual understanding with the help of computers. Those combined insights
help us to define three dimensions of android-assisted scientific understanding: The android as a I)
computational microscope, II) resource of inspiration and the ultimate, not yet existent III) agent
of understanding. For each dimension, we explain new avenues to push beyond the status quo and
unleash the full power of artificial intelligence’s contribution to the central aim of science. We hope
our perspective inspires and focuses research towards androids that get new scientific understanding
and ultimately bring us closer to true artificial scientists.
I. INTRODUCTION distinguish between this and communicating with a

human expert” [4].
Artificial Intelligence (A.I.) has recently been called However, this excitement has not been shared
a “new tool in the box for scientists”[1] and that “ma- among all scientists. Specifically, it has been ques-
chine learning with artificial networks is revolutioniz- tioned whether advanced computational approaches
ing science“[2]. Additionally, it has been conjectured can go beyond numerics [5–9] and contribute funda-
“that machines could have a significantly more cre- mentally to one of the essential aims of science, that
ative role in future research.” [3]. For instance, it has is, gaining of new scientific understanding [10–12].
even been postulated that “[t]he new goal of theoret- In this work, we address how artificial systems can
ical chemistry should be that of providing access to contribute to scientific understanding – specifically,
a chemical ’oracle’: an A.I. environment which can what is the state-of-the-art and how we can push fur-
help humans solve problems, associated with the fun- ther. Besides a thorough literature review, we sur-
damental chemical questions of the fourth industrial veyed dozens of scientists at the interface of biol-
revolution [...], in a way such that the human cannot ogy, chemistry or physics on the one hand, and arti-
ficial intelligence and advanced computational meth-
ods. These personal narratives focus on the concrete
discovery process of ideas and are a vital augmenta-
∗ mario.krenn@mpl.mpg.de tion to the scientific literature. We put the literature
† alan@aspuru.com and personal accounts in the context of a philosophi-
2
Figure 1. How can Androids contribute to new scientific understanding? In addition to scientific literature,
we take inspiration from the philosophy of science and from dozens of stories provided by active computational natural
scientists. Thereby we identify three fundamental dimensions of computer-assisted scientific understanding. From there,
we look into the future and develop a roadmap on how to develop Androids that can contribute to understanding – the
essential aim of science.
cal theory of Scientific Understanding recently devel- III) Lastly, in an ultimate dimension of android-
oped by Dennis Dieks and Henk de Regt [12, 13], who assisted scientific understanding, computers are
was awarded the Lakatos Award in 2019 for the de- the agents of understanding. While we have not
velopment of this theory. We thereby introduce three found any evidence of computers acting as true
fundamental dimensions for scientific androids1 con- agents of understanding in science yet, we out-
tribution towards new scientific understanding: line important characteristics of such an artifi-
I) Androids acting as a microscope in the re- cial system of the future and potential ways to
sponses, i.e., akin to an instrument revealing achieve it.
properties of a physical system that are other- In the first two dimensions, the android enables hu-
wise difficult or even impossible to probe. Hu- mans to gain new scientific understanding while in the
mans then lift these insights to scientific under- last one the machine gains understanding itself. These
standing. classes enable us to layout a vibrant and mostly unex-
II) Androids acting as muses, i.e., sources of inspi- plored field of research, which will hopefully manifest
ration for new concepts and ideas that are sub- itself as a guiding star for future developments of ar-
sequently understood and generalized by human tificial intelligence in the natural sciences.
scientists. The goal of this perspective is to put Scientific
Understanding back to the limelight – where we are
convinced it belongs. We hope to inspire physicists,
1 We encapsulate all advanced artificial computational systems chemists and biologists and A.I. researchers to go be-
under androids, independent of their working principles. In yond the status quo, focus on these central aims of
this way, we are focusing on the operational objective rather science, and revolutionize computer-assisted scientific
than the methodology. understanding. In that way, we believe that androids
3
will become true agents of understanding that con- Concretely, de Regt and Dieks define two inter-
tribute to science in a fundamental and creative way. linked criteria:
1. Criterion of Understanding Phenomena:

A phenomenon P can be understood if a theory
II. SCIENTIFIC UNDERSTANDING
T of P exists that is intelligible.
Let us imagine an oracle providing non-trivial pre- 2. Criterion for the Intelligibility of Theo-
dictions that are always true. While such a hypo- ries: A scientific theory T is intelligible for sci-
thetical system would have a very significant scien- entists (in context C) if they can recognise qual-
tific impact, scientists would not be satisfied. We itatively characteristic consequences of T with-
want “to be able to grasp how the predictions are gen- out performing exact calculations.
erated, and to develop a feeling for the consequences
in concrete situations” [13]. Colloquially, we refer to We decided to use this specific theory because of
this goal as “understanding” – But what does that one particular strength: We can use it experimen-
really mean? Can we find criteria for scientific under- tally to evaluate whether scientists have understood
standing? To do that, we seek guidance from the field new concepts or ideas, rather than by inspecting their
of philosophy of science. Notably, while hardly any methodology, by simply looking at the scientific out-
scientist would argue against “understanding” as an come and the consequences. This also coincides with
essential aim of science (next to explanation, descrip- Angelika Potochnik’s argument that “understanding
tion and prediction [14]), this view was not always requires successful mastery, in some sense, of the tar-
accepted by philosophers. Specifically, Carl Hempel, get of understanding”[11]. We will follow this ap-
who made foundational contributions clarifying the proach and, consequently, here explore its relationship
meaning of scientific explanation, argued that “un- to the role of A.I. in science. Accordingly, we believe
derstanding” is subjective and merely a psychologi- we can significantly advance A.I.’s contribution to this
cal by-product of scientific activity and is therefore central aim of Science if we have a clear picture of
not relevant for the philosophy of science [15]. Other how scientists gain conceptual understanding, and in-
philosophers criticized these rather unsatisfying con- stil it to artificial systems afterwards. We approach
clusions, and they tried to formalize what scientific this goal by applying ideas of de Regt and Dieks di-
understanding means. Proposals include that under- rectly to android assisted science (and ultimately, to
standing is connected to the ability to build causal android scientists themselves).
models (Lord Kelvin said “It seems to me that the
test of ’Do we or not understand a particular subject
in physics?’ is, ’Can we make a mechanical model
III. WHAT IS NEXT?
of it?’ ”[13]), connected to providing visualizations
(or Anschaulichkeit, as its strong proponent Erwin
Schrödinger called it[16, 17]) or that understanding A. Beyond Re-Discovery
corresponds to providing unification [18, 19].
In recent years, Henk de Regt and Dennis Dieks In recent years, scientists at the interface between
have developed a new theory of scientific understand- A.I. and the natural sciences tried to rediscover scien-
ing, which is both contextual and pragmatic [12–14]. tific laws or concepts with machines. The question is,
Importantly, they find that techniques such as visu- however, whether an android is capable of contribut-
alization or unification are “tools for understanding”, ing to new scientific understanding if it can rediscovers
thereby unifying previous ideas in one general frame- physical laws and concepts, such as the heliocentric
work. Their theory is agnostic to the specific “tool” world view [21], the arrow of time [22] or mechani-
being used, making it particularly useful for applica- cal equations of motions [23]? We believe that this
tion in scientific disciplines. They extend crucial in- is not guaranteed. The human creators of these an-
sights by Werner Heisenberg [20] and rather than in- droids know what they are looking for in these case
troducing mere theoretical or hypothetical ideas, the studies. Therefore, it is unclear how both conscious
main motivation behind their theory is that a “satis- and unconscious biases (in the broadest sense, e.g., by
factory conception of scientific understanding should choosing particular representations) in the code or the
reflect the actual (contemporary and historical) prac- data analysis can be prevented. Consequently, even if
tice of Science”. Put simply, they argue that: an algorithm can rediscover interesting physical phe-
nomena, we cannot know whether and how they can
A phenomenon P can be understood if there be used to advance Science by helping to uncover new
exists an intelligible theory T of P such that scientific understanding.
scientists can recognise qualitatively charac- Hence, we believe we need to go beyond rediscovery
teristic consequences of T without performing tasks. Therefore we focus explicitly on the question
exact calculations [12, 13]. of how to get new scientific understanding.
4
B. Beyond Discovery In those two classes, the human scientist is essential

to take the new insight and inspiration and develop it
Importantly, other central aims of science such as to full understanding. Finally, an android can be
prediction and discovery can lead to scientific and
technological disruptions while not directly contribut- III) an agent of understanding, replacing the hu-
ing to scientific understanding as discussed above man in generalizing observations and transfer-
[11, 14]. For instance, imagine the hypothetical dis- ring scientific concepts to new phenomena.
covery of the hitherto best material for energy stor- We stress that these three classes should not be un-
age that could revolutionize batteries. However, this derstood dogmatically but rather guide future possi-
game-changing discovery would not qualify as under- bilities. In the following sections, based on concrete
standing if chemists could not use the underlying prin- examples, we discuss each class in more detail and pro-
ciples fruitfully in other contexts (without computa- pose avenues for pushing the boundaries of the current
tion). computational faculties.
Similarly, the recent breakthrough in protein fold-
ing will undoubtedly change the landscape of biochem-
istry. However, so far, AlphaFold is a black box – an A. Computational microscope for scientific
oracle[24, 25]. As such it does not directly provide understanding
new scientific understanding in the sense of de Regt
and Dieks (but could of course in the future enable Microscopes are devices that enable us to inves-
humans to gain new scientific understanding). Hence, tigate objects and phenomena imperceptible to the
we believe we must go beyond artificial discoveries in naked eye. In a similar way, computational micro-
science. scopes enable the investigation of objects or processes
that we cannot visualize or probe in any other way.
C. Where to go from here?
One main objective is to simulate biological, chemical
or physical processes that happen at length and time
scales not perceivable by experiment.
The ultimate goal is to get new understanding from As we are interested in understanding, the new
androids. Loosely speaking, we want to find new ideas computer-generated data needs to be generalized to
or concepts that we can apply and use in different other contexts without complete computation[13]. We
situations without (complete) computations. show now two concrete examples.
This article aims to explain precisely what such The first example is molecular dynamics simulations
a goal requires, what previous approaches have of the SARS-CoV-2. The authors uncovered new bio-
achieved, and how we can go further. We want to logical functions that show different behaviours in the
clearly lay out this underappreciated but essential re- open and closed conformations of the spike protein.
search question and thereby give a clear goal for the This explanation changed the view upon glycans in
future of A.I. in the natural sciences. biological systems and inspired new ways to analyze
these systems without the need to perform full com-
putations [26].
IV. THREE DIMENSIONS OF
COMPUTER-ASSISTED UNDERSTANDING
In the second example, the authors describe how
molecular dynamics simulations helped to uncover
fundamental patterns called glycoblocks. The sys-
We use scientific literature and personal anecdotes tematic use of glycoblocks can both be used to un-
of dozens of scientists, and the context of the philos- derstand sequence-structure-property relationships of
ophy of science, to introduce a new classification of biomolecules and can also inform the design of syn-
androids contribution to scientific understanding2 . It thetic structures with desired functions without the
helps to see diverse unexplored journeys that can be need for simulating the entire system [27].
investigated in the future.
An android can act
I) as a computational microscope, providing in- The next computational microscope
formation not (yet) attainable by experiment
A computational microscope aims to provide data
II) as a resource of inspiration or an artificial via computation that are not (yet) accessible by ex-
muse, expanding the scope of human imagina- periments that humans can understand. How could
tion and creativity. we make computational microscopes even more in-
sightful and make it easier for human scientists to use
this data to gain scientific understanding? There are
2 Wecall the classification dimensions, as they are independent two vibrant directions going forwards. First, more ad-
and non-exclusive. vanced computational systems will allow to analyze of
5
Figure 2. The future computational microscope. We envision two types of advances in the next-generation
computational microscopes which aim to advance genuine scientific understanding. First (left), larger and more complex
computations will allow the computational observation of phenomena not accessible so far. There, new computational
paradigms will play a significant role, such as Graphical Processor Units (GPU), Tensor Processor Units (TPU), Optical
Processor Units (OPU) and – ultimately – quantum computers. Second (right), new ways to represent the highly complex
data will advance our ability to sense structure and recognize underlying patterns. The involvement of all our senses
could, for sensing computer-generated data, be an exciting pathway to advanced understanding.
more complex physical systems. Second, representing to advance scientific understanding. In addition, we
the information in a more interpretable way will help expect that going beyond the visual sense can open
to lift the indications from computers to true scientific entirely new ways to experience scientific data. For
understanding. example, the auditory sense is excellent in detecting
structure or symmetries in (periodic) time-dependent
More Complex Systems – One obvious but nev-
data [40]. Furthermore, including the sense of touch,
ertheless important research direction is increasing
smell and taste could further expand the horizon of
the complexity as well as the accuracy of computer
experiences. We expect that in order to realize that,
simulations[28]. For example, increasing the size of
physical scientists need to work closely together with
the systems, the time-scale of the simulations, the
psychologists and neurologists (and potentially even
number of interactions that can be modelled will sig-
with artists), to develop suitable data representations
nificantly increase the applicability in complex dy-
that can be efficiently recognized by scientists with
namic systems. In general, this can be achieved
all their senses. An ultimate, admittedly futuristic
by either algorithmic improvements or hardware im-
version of a computational microscope could circum-
provements, or both. In that regard, we expect that
vent the receptors of human senses and instead use
modern neural network technologies together with ad-
a computer-brain interface to enhance further experi-
vanced hardware such as GPUs, TPUs or even OPUs
encing computed data.
[29, 30] will have an enormous impact. Furthermore,
the recent progress in experimental quantum com-
puting for quantum chemistry [31] and physics[32–34] B. Resource of inspiration for scientific
promises that entirely new algorithms, based on quan- understanding
tum mechanics itself, will play an important role in
this area [35, 36]. Algorithmic improvements could in-
Surprising and creative ideas are the foundation of
volve adaptive and intelligent resolution during simu-
Science. Computer algorithms are a means to provoke
lation and advanced visualization methods[13], which
such ideas systematically, thereby significantly accel-
directly leads to the second future techniques:
erating scientific and technological progress. Already
Full spectrum of senses – We believe that human 70 years ago, Alan Turing realized that computers
scientists can get more out of data if the full capabil- could surprise their human creators: “Machines take
ities of all our senses are addressed. At the moment, me by surprise with great frequency. This is largely be-
we analyze data largely in (potentially animated) 2- cause I do not do sufficient calculation to decide what
dimensional pictures. As a first step, we believe that to expect them to do, or rather because, although I
real 3D environments (realized either via virtual or do a calculation, I do it in a hurried, slipshod fash-
augmented reality glasses, or holography) will signifi- ion, taking risks.” and ”Naturally I am often wrong,
cantly help in understanding complex systems or com- and the result is a surprise for me for by the time
plex data. Initial advances in that regard have been the experiment is done these assumptions have been
demonstrated in the domain of chemistry [37–39], and forgotten“ [41].
we expect this to become a standard tool for scientists A much more recent study provides stories by
6
dozens of researchers of artificial life and evolution. works into the direction of autonomous anomaly de-
They demonstrate in an impressive way how computer tection have been applied on scientific data from the
algorithms can surprise their human creators and lead Large Hadron Collider (LHC) at CERN [51–53]. Such
to behaviour that the authors would denote as cre- techniques have the potential to identify new physics
ative[42]. Accordingly, we believe that androids can signatures, which can then be conceptualized and un-
be artificial muses of Science in a metaphorical sense. derstood by human physicists [54, 55]. Neural net-
Those examples demonstrate that computers can works that autonomously discover symmetries could
indeed be used as a source of surprises. But what are become an efficient discovery tool for outliers in sci-
the most general ways to get inspirations from com- entific data where the underlying rules might not be
puters? And how can they be lifted by humans to true known beforehand [56, 57].
scientific understanding? We will outline a number of Estimating the confidence of predictions will be one
ways to develop ways to provoke surprising behaviours method to directly search for anomalies in data [58].
of algorithms and use their solutions, internal or exter- The ability to uncover hidden regularities was very
nal states as a source of inspiration for new scientific recently demonstrated in mathematics, where an A.I.
ideas. hinted on relations between previously unconnected
invariants in knot theory, which allowed mathemati-
cians to conjecture and prove new theorems [59]. Al-
The future resource of inspiration ternatively, an A.I. capable of constructing new scien-
tific hypotheses could uncover outliers or unexpected
Identifying surprises in data – Exceptional data patterns that are not discernible with standard statis-
points or unexpected regularities obtained from ex- tical methods.
periments or simulations can surprise human scien- It would be truly exciting to see an A.I. uncover hid-
tists and inspire new ideas and concepts. Our survey den patterns or irregularities in scientific data previ-
shows that these exceptional points are usually iden- ously overlooked by humans, which leads to new ideas
tified by humans, such as the following two examples, and, ultimately, to new conceptual understanding. As
which use high-throughput computations in chemistry of now, we are not aware of cases like that.
[43] and quantum optics [44, 45]. The data points for these systems could be ob-
The first example deals with a surprising phase of tained from computational methods (involving those
crystal structures in high-pressure physics. There, the described in section IV A), with exciting opportunities
authors found an unexpected stable configurations of for mathematics or theoretical physics [60]. Alterna-
alternating NH2 and NH4 layers, rather than a dense tively, the data could be obtained directly from exper-
NH3 phase. The authors conceptualized this phe- iments. Here we can imagine a closed-loop approach
nomenon as spontaneous ionization, a common pro- where an algorithm tries to explore the environment
cess in acid-base chemistry, which is now a widely and steer the exploration into unexpected regions. If
accepted phenomenon in the high-pressure phase di- the data-source is an experiment, this future system
agram of NH3 . Spontaneous ionization in the high- will require access to complex lab automation with
pressure behaviour of matter has become a more gen- large parameter spaces to explore, as demonstrated
eral principle that can be used without performing any recently in biology [61], chemistry [62–67] or physics
simulations [46]. [68, 69].
In the second example, a search for new quan- Identifying surprises in the scientific literature –
tum experiments uncovered a solution with consid- The number of scientific papers in essentially every
erable larger quantum entanglement than expected. scientific domain is growing enormously[70]. Conse-
The authors understood the underlying principles and quently, researchers have to specialise in narrow sub-
thereby discovered a new concept of entanglement disciplines, which makes finding new interdisciplinary
generation [47, 48]. The principle can be used without ideas difficult. In the future, we believe that comput-
any computation and, for example, acts now as a new ers will be able to use the scientific literature in an
representation in more advanced artificial intelligence automated way[71–74] and identify exceptional and
systems for quantum physics [49], demonstrating the surprising phenomena for further investigation. While
application of the computer-inspired idea in more gen- the large-scale automated analysis of the scientific lit-
eral different contexts. erature, to our knowledge, has not yet been able to
In contrast to these examples and many others from induce new scientific understanding, there is signifi-
literature and from personal accounts, the anomalies cant progress in the field. One promising approach
could manifest themselves in a more involved com- towards this goal is unsupervised word embedding of
bination of variables, which might be very difficult a large corpus of scientific papers. In that technique,
for humans to grasp. Accordingly, applying advanced the content of the scientific literature is transformed
statistical methods and machine learning algorithms into a high-dimensional vector space. Recently, this
(e.g., see reference [50]) to this type of problem will technique has been applied in the domain of mate-
be an important future research direction. Exciting rial science [75] and rediscovered central scientific con-
7
Figure 3. The future re-source of inspiration: An android could act as a computational muse and inspire the
human scientist by (A) identifying surprises in data, (B) identifying surprises in the scientific literature, (C) finding
surprising concepts by inspecting models or (D) by probing the behaviour of artificial agents or (E) my finding new
concepts from interpretable solutions.
cepts such as the periodic table of the elements. Ad- and probe its behaviour. Recently, this approach has
ditionally, the results also suggested the existence of been applied to rediscover thermodynamical proper-
previously undiscovered structure-property relation- ties [22], and design principles for functional molecules
ships. Examples include new candidates for thermo- [89]. An alternative and remarkable application is the
electric materials. Moreover, several other advanced disentanglement of variables in neural networks [90].
computational techniques are being developed in ma- The goal is to understand the internal representation
terial science to extract knowledge from the scien- the neural network has learned. Recently, astronom-
tific literature and investigate it systematically by A.I. ical data, represented in geocentric coordinates, was
technologies[76], and can lead to complex scientific used to train a neural network and disentanglement
conclusions as demonstrated for instance on zeolite of variables enabled the rediscovery of heliocentric co-
transformations [77]. ordinates via the internal representation of the model
An alternative approach aims to build semantic [21]. In a related study, using gradient boosting with
knowledge networks from large bodies of scientific lit- decision trees, feature importance has been used to
erature. In these networks, scientific concepts are explain properties of molecules, and quantum optics
nodes, and edges carry relational information. In the circuits [91]. Related to this is a study where the inter-
simplest case, that means two scientific concepts are nal representation of an unsupervised deep generative
mentioned in the same scientific paper [78, 79]. Thus, model for quantum experiments has been inspected
scientific knowledge is represented as an evolving net- to understand the model’s internal worldview[92]. In
work, which can be used to identify both islands and the chemical domain, counterfactual explanations for
unexplored regions of the scientific literature. This machine learning models have been demonstrated to
type of network was used in biochemistry to identify produce rationale behind a model’s prediction. Coun-
efficient global research strategies [78] and in quantum terfactual explanations illustrate what differences to
physics to predict and suggest future research direc- an event or instance would generate a change in an
tions [79]. Advances in A.I. technology could improve outcome. Wellawatte et al. [93] showed how this can
this type of system significantly. For example, natural be achieved in a model-independent way (it has been
language processing architectures such as BERT [80], demonstrated for random forest, sequence models and
or GPT3 [81] could help extract more scientific knowl- graph neural networks), indicating great future poten-
edge from research papers, and large graph-based neu- tials for opening the black-box of AI in science. Albeit
ral networks could improve the prediction of new re- not in science, recent work has investigated what the
search topics from semantic networks [82]. chess-playing A.I AlphaZero has learned about chess
Surprising concepts by inspecting models – We also and how human-like knowledge is encoded in the in-
expect considerable progress by rationalising what ternal representation [94]. The concepts rediscovered
A.I. algorithms have learned in order to solve a specific in all of those works were not new, and thus the most
problem, i.e., explainable or interpretable A.I. [83–86]. important challenge for the future is to learn how to
One idea towards this goal is inspired by DeepDream- extract previously unknown concepts. Progress to-
ing, a method first used in computer vision [87, 88]. wards resolving that challenge will be essential in the
Put simply, the idea is to invert a neural network near future to inspire new scientific ideas.
8
New concepts from interpretable solutions – Rather scientists can observe how they navigate this space.
than getting inspirations from the A.I. algorithms Instead of following a strict external reward, e.g., max-
themselves, scientists can also be surprised by the cor- imise a specific property of a physical system, intrinsic
responding solutions. When solutions are represented rewards such as artificial curiosity can be implemented
in an interpretable way, they can provoke new ideas [103, 104]. Instead of maximizing directly some func-
and lead to new concepts. An example of interpretable tions, the artificial agent tries to learn and predict
representations is a mathematical formula. Thus, the behaviour of the environment. It then chooses
scientists can inspect formulae derived by computer actions that lead to situations it cannot predict well,
algorithms to solve mathematical problems directly thus maximizing its own understanding of the envi-
and derive more general solution strategies. Several ronment. It has been shown using curious agents in
publications demonstrated extracting symbolic mod- simulated virtual universes [105] and robot agents in
els from experimental data of mechanical systems real laboratories [67] that curiosity is an efficient ex-
[23, 95], of quantum systems [96] and in astronomy ploration strategy. Alternative intrinsic rewards for
[97]. It will be exciting to see how these approaches, artificial agents are computational creativity[106, 107]
e.g., combined with methods such as causal inference and surprise [108]. These intrinsic rewards can pro-
[98], can be improved to propose reasonable physical duce exceptional and unexpected solutions, ultimately
models of unknown systems that advance scientific un- inspiring human scientists.
derstanding. Altogether, exciting advances have been
achieved in the field of mathematics[99, 100], and we
foresee similar approaches making a significant impact C. Agent of Understanding
in the physical sciences as well.
One concrete, recent example in astronomy is the The third and final class we consider are algorithms
rediscovery of Newton’s law of gravitation from real- that can autonomously acquire new scientific under-
world observational data of planets and moons in our standing, a feat that has neither been described by
solar system from the last 30 years [101]. The appli- the respondents of our survey nor in the scientific lit-
cation of graph neural networks allowed for the high- erature. Therefore, we will approach this class by list-
quality prediction of the object’s motion. Further- ing the requirements of these agents, proposing tests
more, a symbolic regression technique called PySR to detect their successful realization and speculating
(introduced in [97]) was able to extract reasonable what such computer programs could look like.
mathematical expressions for the learned behaviour.
First, it is important to realize that finding new
Interestingly, besides the equations of motions, the
scientific understanding is context-dependent. What
method simultaneously predicts the masses of the
is new depends on whether we consider an individ-
planetary objects correctly. The technique required
ual scientist and their field of expertise, a scientific
the assumption of several symmetries and other phys-
domain, the whole scientific community or even the
ical laws. It will be interesting to see whether these
entire scientific endeavour throughout history. Hence,
prerequisites can be reduced further and how related
true agents of understanding must be able to evaluate
approaches can be applied to modern physics ques-
whether an insight is new, at least in the context of
tions.
a specific scientific domain that requires access to the
Another concrete example of this methodology has knowledge of a scientific field.
been showcased in the field of quantum optics [49]. Secondly, de Regt emphasized the importance of
There, an A.I. algorithm with a graph-theoretical rep- underlying scientific theories that allow us to recog-
resentation of quantum optical setups designs config- nize qualitatively characteristic consequences [12]. It
urations for previously unknown quantum systems. is not enough to simply interpolate data points or pre-
The final solutions were represented in a physically- dict new ones using advanced statistical methods such
interpretable graph-theoretical representation. From as machine learning. Thus, even though such meth-
there, human scientists can quickly interpret the un- ods can approximate complex and expensive compu-
derlying reasons why the solutions work and apply tations, naı̈ve applications of neural networks cannot
it in other contexts without further computation. be agents of understanding. Scientific understanding
Accordingly, developing interpretable representations requires more than mere calculation. To illustrate this
and methods to extract underlying concepts in other point even further, let us consider one concrete exam-
domains will be an important future research direc- ple in quantum physics from the literature: A com-
tion. putational method solved an open question about the
Probing the behaviour of artificial agents – Another generation of important resource states for quantum
only rarely explored opportunity is interpreting the computing. Then it extracted the conceptual core of
behaviour of machines when tasked to solve a scien- the solution in the form of a new quantum interfer-
tific problem [102]. Algorithms that take actions such ence effect in such a fashion that human scientists
as genetic algorithms or reinforcement learning agents can both understand the results and apply the ac-
adopt policies to navigate the problem space. Human quired understanding in different contexts [49]. Even
9
if the computer itself was able to apply the conceptual This implies that humans need to understand the
core to other situations, it would not be a priori clear new concepts that androids devised. If a machine
whether the computer truly acquired scientific under- truly understands something, it will be able to ex-
standing. What is still missing is an explanation of plain it and transfer the understanding to someone
the discovered technique in the context of a scientific else.4 We believe that this should always be possible,
theory. In this particular example, the android and even if the understanding is far beyond what human
the human scientist would need to recognize the un- experts know at this point. We envision that com-
derlying quantum interference in the context of the puters will use advanced human-computer interaction
theory of quantum physics. Thus, we can propose the techniques together with the tools we described for
first sufficient condition for agents of understanding: (future) computational microscopes.
Additionally, scientific discussions between a hu-
Condition for Scientific Understanding I: man and a computer could be realized using advanced
An android gained scientific understanding if it queries in natural language processing tools such as
can recognize qualitatively characteristic conse- BIRD [80] or GPT-3 [81]. That way, the scientist
quences of a theory without performing exact could probe the computer with scientific questions.
computations and use them in a new context. Suppose the scientist gains new scientific understand-
This condition closely follows the ideas of de Regt ing by communicating with the algorithm, as judged
and Dieks [13]. Let us go one step further and imagine by our scientific understanding test. In that case, they
that there is an android capable of explaining discov- can confirm that the computer truly acquired under-
eries in the context of scientific theories. How could standing.5 We are optimistic that more efforts will
human scientists recognize that the machine acquired be directed at developing the necessary technologies,
new scientific understanding? We argue that human which will lead to ever more convincing demonstra-
scientists would do it in the exact same way they can tions of android scientists acting as true agents of un-
recognize that other human scientists acquired new derstanding in the future.
scientific understanding. That is, let the other human
scientists transfer the newly acquired understanding
to themselves. This allows us to propose the second V. CONCLUSION
sufficient condition for agents of understanding:
Undoubtedly, advanced computational methods in
Condition for Scientific Understanding II: general and artificial intelligence specifically will fur-
An android gained scientific understanding if it ther revolutionize how scientists investigate the se-
can transfer its understanding to a human ex- crets of our world. We outline how these new methods
pert. can directly contribute to one of the main aims of sci-
We argue that one can only recognize indirectly ence, namely acquiring new scientific understanding.
whether a computer (or human) has gained scientific We suspect that significant future progress in the use
understanding. Therefore, finally, we propose a test of androids to acquire scientific understanding will re-
in the spirit of the Turing test [41] or the Feigenbaum quire multidisciplinary collaborations between natu-
test[109] (or adaptations thereof in the natural sci- ral scientists, computer scientists and philosophers of
ences such as the Chemical Turing Test or the Feyn- science. Thus, we firmly believe that these research
man Test [4]): efforts can – within our lifetimes – transform androids
into true agents of understanding that will directly
The Scientific Understanding Test: contribute to one of the most essential aims of sci-
A human (the student) interacts with a ence, namely Scientific Understanding.
teacher, either a human or an android sci-
entist. The teacher’s goal is to explain a sci-
entific theory and its qualitative, characteris- ACKNOWLEDGEMENTS
tic consequences to the student. Another hu-
man (the referee) tests both the student and The authors thank Anastassia Alexandrova, Rom-
the teacher independently3 . If the referee can- mie Amaro, Curtis Berlinguette, Lillian Chong,
not distinguish between the qualities of their Gerardo Cisneros, Andy Cooper, Graeme Day,
non-trivial explanations in various contexts,
we argue that the teacher has scientific un-
derstanding.
4 We leave aside the question whether the explanation of the
android is true or false. It has been argued that also false
theories can lead to genuine understanding [110].
3 Inprinciple, there is no reason for the student or the referee not 5 We would like to point out that our test, like the ones originated
to be androids. However, to keep the test as simple as possible, by Turing and Feigenbaum, are not clear-cut, leaving room for
we want to keep the number of possible variations small. situations that do not allow a clear judgement.
10
Francois-Xavier Coudert, Lee Cronin, Elisa Fadda, Lilienfeld and Eva Zurek for answering our questions
Rafael Gomez-Bombarelli, Leticia Gonzalez, Johannes on understanding, Xuemei Gu for Figure 2 and 3,
Hachmann, Roald Hoffmann, Jan Halborg Jensen, and Nora Tischler and Robert Fickler for valueable
Erin R. Johnson, Lynn Kamerlin, Heather J. Ku- comments on the manuscript. A.A.-G. and his group
lik, Jean-Paul Malrieu, Anat Milo, Frank Noe, acknowledge generous support from the Canada 150
Jens Kehlet Nørskov, Artem Oganov, Juan Perez- Research Chairs Program, the University of Toronto,
Mercader, Chris Pickard, Markus Reiher, Jean-Louis and Anders G. Frøseth. M.K. acknowledges support
Reymond, Dennis Salahub, Stefano Sanvito, Franziska from the FWF (Austrian Science Fund) via the Erwin
Schoenebeck, Ilja Siepmann, Alex Sodt, Isaac Tam- Schrödinger fellowship No. J4309. R.P. acknowledges
blyn, Donald Truhlar, Alexandre Tkatchenko, Koji funding through a Postdoc.Mobility fellowship by the
Tsuda, Alexandre Varnek, Tejs Vegge, Anatole von Swiss National Science Foundation (SNSF, Project
No. 191127).
[1] Lenka Zdeborová, “New tool in the box,” Nature [15] Carl G Hempel, Aspects of scientific explanation
Physics 13, 420–421 (2017). (Free Press New York, 1965).
[2] Thomas Fösel, Petru Tighineanu, Talitha Weiss, [16] Erwin Schrödinger, ’Nature and the Greeks’ and ’Sci-
and Florian Marquardt, “Reinforcement learning ence and Humanism’ (Cambridge University Press,
with neural networks for quantum feedback,” Phys- 1996).
ical Review X 8, 031084 (2018). [17] Henk W De Regt, “Visualization as a tool for un-
[3] Alexey A Melnikov, Hendrik Poulsen Nautrup, derstanding,” Perspectives on science 22, 377–396
Mario Krenn, Vedran Dunjko, Markus Tiersch, An- (2014).
ton Zeilinger, and Hans J Briegel, “Active learning [18] Michael Friedman, “Explanation and scientific un-
machine learns to create new quantum experiments,” derstanding,” The Journal of Philosophy 71, 5–19
Proceedings of the National Academy of Sciences (1974).
115, 1221–1226 (2018). [19] Philip Kitcher, “Explanatory unification,” Philoso-
[4] Alán Aspuru-Guzik, Roland Lindh, and Markus phy of science 48, 507–531 (1981).
Reiher, “The matter simulation (r)evolution,” ACS [20] Werner Heisenberg, “über den anschaulichen inhalt
central science 4, 144–152 (2018). der quantentheoretischen kinematik und mechanik,”
[5] Roald Hoffmann and Jean-Paul Malrieu, “Simula- Zeitschrift für Physik 43, 172–198 (1927).
tion vs. understanding: A tension, in quantum chem- [21] Raban Iten, Tony Metger, Henrik Wilming, Lı́dia
istry and beyond. part a. stage setting,” Angewandte Del Rio, and Renato Renner, “Discovering physi-
Chemie 132, 12690–12710 (2020). cal concepts with neural networks,” Physical Review
[6] Roald Hoffmann and Jean-Paul Malrieu, “Simula- Letters 124, 010508 (2020).
tion vs. understanding: A tension, in quantum chem- [22] Alireza Seif, Mohammad Hafezi, and Christopher
istry and beyond. part b. the march of simulation, for Jarzynski, “Machine learning the thermodynamic ar-
better or worse,” Angewandte Chemie International row of time,” Nature Physics 17, 105–113 (2021).
Edition 59, 13156–13178 (2020). [23] Silviu-Marian Udrescu and Max Tegmark, “Ai feyn-
[7] Roald Hoffmann and Jean-Paul Malrieu, “Simula- man: A physics-inspired method for symbolic regres-
tion vs. understanding: A tension, in quantum chem- sion,” Science Advances 6, eaay2631 (2020).
istry and beyond. part c. toward consilience,” Ange- [24] John Jumper, Richard Evans, Alexander Pritzel,
wandte Chemie International Edition 59, 13694– Tim Green, Michael Figurnov, Olaf Ronneberger,
13710 (2020). Kathryn Tunyasuvunakool, Russ Bates, Augustin
[8] Gary Marcus, “The next decade in ai: four Žı́dek, Anna Potapenko, et al., “Highly accurate
steps towards robust artificial intelligence,” protein structure prediction with alphafold,” Nature
arXiv:2002.06177 (2020). 596, 583–589 (2021).
[9] Jesse Thaler, “Designing an ai physicist,” CERN [25] Kathryn Tunyasuvunakool, Jonas Adler, Zachary
Courier 9–10, 49 (2021). Wu, Tim Green, Michal Zielinski, Augustin Žı́dek,
[10] Angela Potochnik, “The diverse aims of science,” Alex Bridgland, Andrew Cowie, Clemens Meyer,
Studies in History and Philosophy of Science Part Agata Laydon, et al., “Highly accurate protein struc-
A 53, 71–80 (2015). ture prediction for the human proteome,” Nature
[11] Angela Potochnik, Idealization and the Aims of Sci- 596, 590–596 (2021).
ence (University of Chicago Press, 2017). [26] Lorenzo Casalino, Zied Gaieb, Jory A Goldsmith,
[12] Henk W De Regt, Understanding scientific under- Christy K Hjorth, Abigail C Dommer, Aoife M Har-
standing (Oxford University Press, 2017). bison, Carl A Fogarty, Emilia P Barros, Bryn C Tay-
[13] Henk W De Regt and Dennis Dieks, “A contextual lor, Jason S McLellan, et al., “Beyond shielding: the
approach to scientific understanding,” Synthese 144, roles of glycans in the sars-cov-2 spike protein,” ACS
137–170 (2005). Central Science 6, 1722–1734 (2020).
[14] Henk W De Regt, “Understanding, values, and the [27] Carl A Fogarty, Aoife M Harbison, Amy R Dug-
aims of science,” Philosophy of Science 87, 921–932 dale, and Elisa Fadda, “How and why plants and
(2020). human n-glycans are different: Insight from molecu-
11
lar dynamics into the ”glycoblocks” architecture of Bryson, et al., “The surprising creativity of digital
complex carbohydrates,” Beilstein journal of organic evolution: A collection of anecdotes from the evolu-
chemistry 16, 2046–2056 (2020). tionary computation and artificial life research com-
[28] Pascal Friederich, Florian Häse, Jonny Proppe, and munities,” Artificial life 26, 274–306 (2020).
Alán Aspuru-Guzik, “Machine-learned potentials for [43] Chris J Pickard and RJ Needs, “Ab initio random
next-generation matter simulations,” Nature Mate- structure searching,” Journal of Physics: Condensed
rials 20, 750–761 (2021). Matter 23, 053201 (2011).
[29] Sylvain Gigan, Florent Krzakala, Laurent Daudet, [44] Mario Krenn, Mehul Malik, Robert Fickler, Radek
and Igor Carron, “Artificial intelligence: From elec- Lapkiewicz, and Anton Zeilinger, “Automated
tronics to optics,” Photoniques 104, 49–52 (2020). search for new quantum experiments,” Physical re-
[30] Xingyuan Xu, Mengxi Tan, Bill Corcoran, Jiayang view letters 116, 090405 (2016).
Wu, Andreas Boes, Thach G Nguyen, Sai T Chu, [45] Mario Krenn, Manuel Erhard, and Anton Zeilinger,
Brent E Little, Damien G Hicks, Roberto Moran- “Computer-inspired quantum experiments,” Nature
dotti, et al., “11 tops photonic convolutional acceler- Reviews Physics 2, 649–661 (2020).
ator for optical neural networks,” Nature 589, 44–51 [46] Chris J Pickard and RJ Needs, “Highly compressed
(2021). ammonia forms an ionic crystal,” Nature materials
[31] Google AI Quantum et al., “Hartree-fock on a super- 7, 775–779 (2008).
conducting qubit quantum computer,” Science 369, [47] Mario Krenn, Armin Hochrainer, Mayukh Lahiri,
1084–1089 (2020). and Anton Zeilinger, “Entanglement by path iden-
[32] Jiehang Zhang, PW Hess, A Kyprianidis, P Becker, tity,” Physical review letters 118, 080401 (2017).
A Lee, J Smith, G Pagano, I-D Potirniche, Andrew C [48] Mario Krenn, Xuemei Gu, and Anton Zeilinger,
Potter, A Vishwanath, et al., “Observation of a dis- “Quantum experiments and graphs: Multiparty
crete time crystal,” Nature 543, 217–220 (2017). states as coherent superpositions of perfect match-
[33] Christian Schweizer, Fabian Grusdt, Moritz Bern- ings,” Physical review letters 119, 240403 (2017).
gruber, Luca Barbiero, Eugene Demler, Nathan [49] Mario Krenn, Jakob Kottmann, Nora Tischler,
Goldman, Immanuel Bloch, and Monika Aidels- and Alán Aspuru-Guzik, “Conceptual understand-
burger, “Floquet approach to z2 lattice gauge theo- ing through efficient automated design of quantum
ries with ultracold atoms in optical lattices,” Nature optical experiments,” Physical Review X 11, 031044
Physics 15, 1168–1173 (2019). (2021).
[34] Esteban A Martinez, Christine A Muschik, Philipp [50] Pankaj Malhotra, Lovekesh Vig, Gautam Shroff,
Schindler, Daniel Nigg, Alexander Erhard, Markus and Puneet Agarwal, “Long short term memory net-
Heyl, Philipp Hauke, Marcello Dalmonte, Thomas works for anomaly detection in time series,” Proceed-
Monz, Peter Zoller, et al., “Real-time dynamics of ings on Atifical Neural Networks, Computational In-
lattice gauge theories with a few-qubit quantum telligence and Machine Learning (ESANN) 89, 89–
computer,” Nature 534, 516–519 (2016). 94 (2015).
[35] Yudong Cao, Jonathan Romero, Jonathan P Ol- [51] ATLAS Collaboration, “Dijet resonance search with
son, Matthias Degroote, Peter D Johnson, Mária weak supervision using s= 13 tev p p collisions in the
Kieferová, Ian D Kivlichan, Tim Menke, Borja Per- atlas detector,” Physical review letters 125, 131801
opadre, Nicolas PD Sawaya, et al., “Quantum chem- (2020).
istry in the age of quantum computing,” Chemical [52] CMS Collaboration, “Probing effective field theory
reviews 119, 10856–10915 (2019). operators in the associated production of top quarks
[36] Christian Gross and Immanuel Bloch, “Quantum with a z boson in multilepton final states at s=13
simulations with ultracold atoms in optical lattices,” tev,” Journal of High Energy Physics 2021, 1–55
Science 357, 995–1001 (2017). (2021).
[37] Michael O’Connor, Helen M Deeks, Edward Dawn, [53] Sang Eon Park, Dylan Rankin, Silviu-Marian
Oussama Metatla, Anne Roudaut, Matthew Sutton, Udrescu, Mikaeel Yunus, and Philip Harris, “Quasi
Lisa May Thomas, Becca Rose Glowacki, Rebecca anomalous knowledge: searching for new physics
Sage, Philip Tew, et al., “Sampling molecular confor- with embedded knowledge,” Journal of High Energy
mations and dynamics in a multiuser virtual reality Physics 2021, 1–26 (2021).
framework,” Science advances 4, eaat2731 (2018). [54] Matthew D Schwartz, “Modern machine learning
[38] Daniel Probst and Jean-Louis Reymond, “Exploring and particle physics,” arXiv:2103.12226 (2021).
drugbank in virtual reality chemical space,” Journal [55] Gregor Kasieczka, Benjamin Nachman, David Shih,
of chemical information and modeling 58, 1731–1735 Oz Amram, Anders Andreassen, Kees Benk-
(2018). endorder, Blaz Bortolato, Gustaaf Broojimans, Flo-
[39] Jonas R Schmid, Moritz J Ernst, and Günther rencia Canelli, Jack Collins, et al., “The lhc olympics
Thiele, “Structural chemistry 2.0: Combining aug- 2020: a community challenge for anomaly detec-
mented reality and 3d online models,” (2020). tion in high energy physics,” Reports on Progress
[40] Davide Castelvecchi, “Using sound to explore events in Physics (2021).
of the universe,” Nature 597, 144 (2021). [56] Haizi Yu, Igor Mineyev, and Lav R Varshney,
[41] Alan M Turing, “Computing machinery and intelli- “A group-theoretic approach to computational ab-
gence,” Mind 50, 433–460 (1950). straction: Symmetry-driven hierarchical clustering,”
[42] Joel Lehman, Jeff Clune, Dusan Misevic, Christoph arXiv:1807.11167 (2018).
Adami, Lee Altenberg, Julie Beaulieu, Peter J Bent- [57] Nima Dehmamy, Robin Walters, Yanchen Liu,
ley, Samuel Bernard, Guillaume Beslon, David M Dashun Wang, and Rose Yu, “Automatic symmetry
12
discovery with lie algebra convolutional network,” [70] Peder Larsen and Markus Von Ins, “The rate of
Advances in Neural Information Processing Systems growth in scientific publication and the decline in
34 (2021). coverage provided by science citation index,” Scien-
[58] AkshatKumar Nigam, Robert Pollice, Matthew FD tometrics 84, 575–603 (2010).
Hurley, Riley J Hickman, Matteo Aldeghi, Naruki [71] James A Evans and Jacob G Foster, “Metaknowl-
Yoshikawa, Seyone Chithrananda, Vincent A Voelz, edge,” Science 331, 721–725 (2011).
and Alán Aspuru-Guzik, “Assigning confidence to [72] Aaron Clauset, Daniel B Larremore, and Roberta
molecular property prediction,” Expert Opinion on Sinatra, “Data-driven predictions in the science of
Drug Discovery , 1–15 (2021). science,” Science 355, 477–480 (2017).
[59] A Davies, P Velickovic, L Buesing, S Blackwell, [73] Santo Fortunato, Carl T Bergstrom, Katy Börner,
D Zheng, N Tomasev, R Tanburn, P Battaglia, James A Evans, Dirk Helbing, Staša Milojević,
C Blundell, A Juhasz, et al., “Advancing mathe- Alexander M Petersen, Filippo Radicchi, Roberta
matics by guiding human intuition with ai,” Nature Sinatra, Brian Uzzi, et al., “Science of science,” Sci-
(2021). ence 359 (2018).
[60] Michael R Douglas, “Machine learning as a tool in [74] Dashun Wang and Albert-László Barabási, The sci-
theoretical science,” Nature Reviews Physics , 1–2 ence of science (Cambridge University Press, 2021).
(2022). [75] Vahe Tshitoyan, John Dagdelen, Leigh Weston,
[61] Ross D King, Jem Rowland, Stephen G Oliver, Alexander Dunn, Ziqin Rong, Olga Kononova,
Michael Young, Wayne Aubrey, Emma Byrne, Maria Kristin A Persson, Gerbrand Ceder, and Anubhav
Liakata, Magdalena Markham, Pinar Pir, Larisa N Jain, “Unsupervised word embeddings capture latent
Soldatova, et al., “The automation of science,” Sci- knowledge from materials science literature,” Nature
ence 324, 85–89 (2009). 571, 95–98 (2019).
[62] Anne-Catherine Bédard, Andrea Adamo, Kosi C [76] Elsa A Olivetti, Jacqueline M Cole, Edward Kim,
Aroh, M Grace Russell, Aaron A Bedermann, Olga Kononova, Gerbrand Ceder, Thomas Yong-Jin
Jeremy Torosian, Brian Yue, Klavs F Jensen, and Han, and Anna M Hiszpanski, “Data-driven materi-
Timothy F Jamison, “Reconfigurable system for au- als research enabled by natural language processing
tomated optimization of diverse chemical reactions,” and information extraction,” Applied Physics Re-
Science 361, 1220–1225 (2018). views 7, 041317 (2020).
[63] Sebastian Steiner, Jakob Wolf, Stefan Glatzel, Anna [77] Daniel Schwalbe-Koda, Zach Jensen, Elsa Olivetti,
Andreou, Jaroslaw M Granda, Graham Keenan, and Rafael Gómez-Bombarelli, “Graph similarity
Trevor Hinkley, Gerardo Aragon-Camarasa, Philip J drives zeolite diffusionless transformations and in-
Kitson, Davide Angelone, et al., “Organic synthesis tergrowth,” Nature materials 18, 1177–1181 (2019).
in a modular robotic system driven by a chemical [78] Andrey Rzhetsky, Jacob G Foster, Ian T Foster, and
programming language,” Science 363 (2019). James A Evans, “Choosing experiments to acceler-
[64] Connor W Coley, Dale A Thomas, Justin AM Lum- ate collective discovery,” Proceedings of the National
miss, Jonathan N Jaworski, Christopher P Breen, Academy of Sciences 112, 14569–14574 (2015).
Victor Schultz, Travis Hart, Joshua S Fishman, Luke [79] Mario Krenn and Anton Zeilinger, “Predicting re-
Rogers, Hanyu Gao, et al., “A robotic platform for search trends with semantic and neural networks
flow synthesis of organic compounds informed by ai with an application in quantum physics,” Proceed-
planning,” Science 365 (2019). ings of the National Academy of Sciences 117, 1910–
[65] Benjamin Burger, Phillip M Maffettone, Vladimir V 1916 (2020).
Gusev, Catherine M Aitchison, Yang Bai, Xiaoyan [80] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and
Wang, Xiaobo Li, Ben M Alston, Buyi Li, Rob Kristina Toutanova, “Bert: Pre-training of deep bidi-
Clowes, et al., “A mobile robotic chemist,” Nature rectional transformers for language understanding,”
583, 237–241 (2020). arXiv:1810.04805 (2018).
[66] Sourav Chatterjee, Mara Guidi, Peter H Seeberger, [81] Tom B Brown, Benjamin Mann, Nick Ryder,
and Kerry Gilmore, “Automated radial synthesis of Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal,
organic molecules,” Nature 579, 379–384 (2020). Arvind Neelakantan, Pranav Shyam, Girish Sastry,
[67] Jonathan Grizou, Laurie J Points, Abhishek Sharma, Amanda Askell, et al., “Language models are few-
and Leroy Cronin, “A curious formulation robot en- shot learners,” arXiv:2005.14165 (2020).
ables the discovery of a novel protocell behavior,” [82] William L Hamilton, Rex Ying, and Jure Leskovec,
Science advances 6, eaay4237 (2020). “Inductive representation learning on large graphs,”
[68] Hyungil Moon, Dominic T Lennon, James Kirk- in Proceedings of the 31st International Conference
patrick, Nina M van Esbroeck, Leon C Camenzind, on Neural Information Processing Systems (2017)
Liuqi Yu, Florian Vigneau, Dominik M Zumbühl, pp. 1025–1035.
G Andrew D Briggs, Michael A Osborne, et al., “Ma- [83] Grégoire Montavon, Wojciech Samek, and Klaus-
chine learning enables completely automatic tuning Robert Müller, “Methods for interpreting and un-
of a quantum device faster than human experts,” derstanding deep neural networks,” Digital Signal
Nature communications 11, 1–10 (2020). Processing 73, 1–15 (2018).
[69] Mogens Dalgaard, Felix Motzoi, Jens Jakob [84] Wojciech Samek, Grégoire Montavon, Andrea
Sørensen, and Jacob Sherson, “Global optimization Vedaldi, Lars Kai Hansen, and Klaus-Robert
of quantum dynamics with alphazero deep explo- Müller, Explainable AI: interpreting, explaining and
ration,” npj Quantum Information 6, 1–9 (2020). visualizing deep learning, Vol. 11700 (Springer Na-
ture, 2019).
13
[85] Ribana Roscher, Bastian Bohn, Marco F Duarte, Spergel, and Shirley Ho, “Discovering symbolic
and Jochen Garcke, “Explainable machine learning models from deep learning with inductive biases,”
for scientific insights and discoveries,” IEEE Access arXiv:2006.11287 (2020).
8, 42200–42216 (2020). [98] Kyle Cranmer, Johann Brehmer, and Gilles Louppe,
[86] Scott M Lundberg, Gabriel Erion, Hugh Chen, Alex “The frontier of simulation-based inference,” Pro-
DeGrave, Jordan M Prutkin, Bala Nair, Ronit Katz, ceedings of the National Academy of Sciences 117,
Jonathan Himmelfarb, Nisha Bansal, and Su-In Lee, 30055–30062 (2020).
“From local explanations to global understanding [99] Gal Raayoni, Shahar Gottlieb, Yahel Manor, George
with explainable ai for trees,” Nature machine in- Pisha, Yoav Harris, Uri Mendlovic, Doron Haviv,
telligence 2, 56–67 (2020). Yaron Hadad, and Ido Kaminer, “Generating con-
[87] Aravindh Mahendran and Andrea Vedaldi, “Under- jectures on fundamental constants with the ramanu-
standing deep image representations by inverting jan machine,” Nature 590, 67–73 (2021).
them,” in Proceedings of the IEEE conference on [100] Adam Zsolt Wagner, “Constructions in combi-
computer vision and pattern recognition (2015) pp. natorics via neural networks,” arXiv:2104.14516
5188–5196. (2021).
[88] Alexander Mordvintsev, Christopher Olah, and [101] Pablo Lemos, Niall Jeffrey, Miles Cranmer, Shirley
Mike Tyka, “Inceptionism: Going deeper into neural Ho, and Peter Battaglia, “Rediscovering orbital me-
networks,” https://ai.googleblog.com/2015/06/ chanics with machine learning,” arXiv:2202.02306
inceptionism-going-deeper-into-neural.html (2022).
(2015). [102] Iyad Rahwan, Manuel Cebrian, Nick Obradovich,
[89] Cynthia Shen, Mario Krenn, Sagi Eppel, and Josh Bongard, Jean-François Bonnefon, Cynthia
Alan Aspuru-Guzik, “Deep molecular dreaming: In- Breazeal, Jacob W Crandall, Nicholas A Christakis,
verse machine learning for de-novo molecular design Iain D Couzin, Matthew O Jackson, et al., “Machine
and interpretability with surjective representations,” behaviour,” Nature 568, 477–486 (2019).
Machine Learning: Science and Technology (2021). [103] Jürgen Schmidhuber, “Driven by compression
[90] Christopher P Burgess, Irina Higgins, Arka Pal, Loic progress: A simple principle explains essential as-
Matthey, Nick Watters, Guillaume Desjardins, and pects of subjective beauty, novelty, surprise, interest-
Alexander Lerchner, “Understanding disentangling ingness, attention, curiosity, creativity, art, science,
in beta-vae,” arXiv:1804.03599 (2018). music, jokes,” Workshop on anticipatory behavior in
[91] Pascal Friederich, Mario Krenn, Isaac Tamblyn, and adaptive learning systems , 48–76 (2008).
Alán Aspuru-Guzik, “Scientific intuition inspired by [104] Deepak Pathak, Pulkit Agrawal, Alexei A Efros,
machine learning-generated hypotheses,” Machine and Trevor Darrell, “Curiosity-driven exploration by
Learning: Science and Technology 2, 025027 (2021). self-supervised prediction,” International Conference
[92] Daniel Flam-Shepherd, Tony Wu, Xuemei Gu, Alba on Machine Learning , 2778–2787 (2017).
Cervera-Lierta, Mario Krenn, and Alan Aspuru- [105] Luca A Thiede, Mario Krenn, AkshatKumar Nigam,
Guzik, “Learning interpretable representations of en- and Alan Aspuru-Guzik, “Curiosity in exploring
tanglement in quantum optics experiments using chemical space: Intrinsic rewards for deep molecular
deep generative models,” arXiv:2109.02490 (2021). reinforcement learning,” arXiv:2012.11293 (2020).
[93] Geemi P Wellawatte, Aditi Seshadri, and Andrew D [106] Lav R Varshney, Nazneen Fatema Rajani, and
White, “Model agnostic generation of counterfac- Richard Socher, “Explaining creative artifacts,”
tual explanations for molecules,” Chemical Science arXiv:2010.07126 (2020).
(2022). [107] Lav R Varshney, Florian Pinel, Kush R Varsh-
[94] Thomas McGrath, Andrei Kapishnikov, Nenad ney, Debarun Bhattacharjya, Angela Schörgendor-
Tomašev, Adam Pearce, Demis Hassabis, Been Kim, fer, and Y-M Chee, “A big data approach to compu-
Ulrich Paquet, and Vladimir Kramnik, “Acquisition tational creativity: The curious case of chef watson,”
of chess knowledge in alphazero,” arXiv:2111.09259 IBM Journal of Research and Development 63, 7–1
(2021). (2019).
[95] Michael Schmidt and Hod Lipson, “Distilling free- [108] Laurent Itti and Pierre Baldi, “Bayesian surprise at-
form natural laws from experimental data,” Science tracts human attention,” Vision research 49, 1295–
324, 81–85 (2009). 1306 (2009).
[96] Antonio A Gentile, Brian Flynn, Sebastian Knauer, [109] Edward A Feigenbaum, “Some challenges and grand
Nathan Wiebe, Stefano Paesani, Christopher E challenges for computational intelligence,” Journal
Granade, John G Rarity, Raffaele Santagati, and of the ACM (JACM) 50, 32–40 (2003).
Anthony Laing, “Learning models of quantum sys- [110] Henk W De Regt and Victor Gijsbers, “How false
tems from experiments,” Nature Physics 17, 837–843 theories can yield genuine understanding,” Explain-
(2021). ing understanding: New perspectives from episte-
[97] Miles Cranmer, Alvaro Sanchez-Gonzalez, Pe- mology and philosophy of science , 50–75 (2017).
ter Battaglia, Rui Xu, Kyle Cranmer, David

On Scientific Understanding With Artificial Intelligence: Mario - Krenn@mpl - Mpg.de

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

On Scientific Understanding With Artificial Intelligence: Mario - Krenn@mpl - Mpg.de

Uploaded by

Copyright:

Available Formats

On scientific understanding with artificial intelligence

Mario Krenn,1, 2, 3, 4, ∗ Robert Pollice,2, 3 Si Yue Guo,2 Matteo Aldeghi,2, 3, 4 Alba

I. INTRODUCTION distinguish between this and communicating with a

1. Criterion of Understanding Phenomena:

B. Beyond Discovery In those two classes, the human scientist is essential

You might also like