Bruno Beltran

Musings

Almost everything that I would normally have made into a blog post is either a coding tutorial (see my code page) or has now become a published paper (see my science page).

However, if you are interested in the philosophy of science, perhaps you'll find something worthwhile here...

See my scientifit work Go to my contact information

The best way to create pseudoscience

Is imprecise language.

Okay, maybe multi-million-dollar misinformation campaigns can also be very effective. But I would argue that imprecise language damaged science in the 20th century at least as much as the cigarette and oil lobbies.

There are innumerable ways that imprecise language can lead to pseudoscience. For one, simple gaffs from prominent scientists can often be enough. After all, if I asked my family back home what the tone of a typical scientist makes them feel, they would probably complain that scientists always seem to want people to know "I am right, whether you like it or not". People can forgive mistakes, but when they feel spoken down to, even the smallest mistakes can be made grounds for complete dismissal of an idea.

On the other hand, I dare anyone to name a single language issue that plagues science more at the turn of the 21st century than attempts to appropriate common words into the scientific parlance (its less-insidious converse: trying to introduce scientific definitions to the public via too-simple analogies, is more well known).

If you need convincing that this is the case, I've provided in what follows three diverse examples of the kinds of issues that arise when common language words are given new, "more rigorous" definitions by scientists. One from the social sciences, one from game theory, and one from quantum mechanics. I close with some of my thoughts about how to prevent these issues in the first place.

I especially try to demonstrate with an example from game theory how trying to come up with scientific definitions of common words tends to lead to incredibly persistent miscommunication, even among other scientists in the same field. However, to see how pernicious this problem can really be, we'll first examine the damage that can be done when the confused party is instead the general public.

For non-scientist readers, hopefully this is a window into one of the many ways in which common journalistic practices from "pop-science" can easily deceive you. For scientist readers, I hope to convince you that if a word is already in common use, and you try to give it a rigorous definition, no matter how well intentioned you may be, or how close that word is to capturing the original colloquial meaning, at best you're going to cause confusion, and at worst you're going to piss people off.

When bad marketing meets existing prejudice

An obvious example of this worst-case has unfortunately already played out with the word "gender". Social scientists now uniformly use this word to refer to the socially-constructed aspects of the ideas colloquially referred to as masculinity and femininity. However, before the 1950s, the word gender was already in use, and referred only to grammatical categories. In many places, for example in the southern United States (where I went to university), this old convention is still the only way the word is commonly used (especially among those above the age of 30).

There is no better way to alienate someone on a topic than to start by trying to convince them that a word they've been using their whole life suddenly has a new definition (entire comedy routines are based off of this fact). While the abuses that continue to be perpetrated towards trans and non-binary peoples in the South are totally inexcusable, I have personally seen countless potential allies (especially religious people) driven away from LGBT+ issues due to simple misunderstandings caused by well-intentioned attempts to change language by fiat.

Had the word chosen for what we now call gender been anything else, maybe something with a Latin root that invokes the idea that it's socially constructed, or maybe something that explicitly intoned an independence from the concept of biological "sex", I posit that any number of needless, hateful internet fights about whether or not there are "only two genders" could have been completely avoided. And if you really think that the people crying that there are "only two genders" are all bigots and never just trying to argue in good faith that we should just use the word gender to mean "chromosome-encoded, biological sex", you, just aren't paying attention.

So, clearly, the choice by sociologists as a scientific community to legitimize this potentially confusing terminology directly fed one of the largest sources of internet hatred in the early 21st century.

But don't think for a second that this is something that can only happen in the "soft" sciences. Non-social scientists often feel immune to this type of problem due to a perception that they are more "rigorous" and so cannot fall into the traps of creating a situation where a well-defined word is more confusing than it is helpful. But as we'll see soon, that is far from the case.

(Disclaimer: thankfully, the sociologist's definition of the word "gender" seems to be on track to become well-accepted within the next generation. I fully support the charge to normalize this word in the public mind, as whatever problems may have occured due to the initial pushback against this usage now pale in comparison to the practical need to adopt a word that can be used by, for example, trans and non-binary people to accurately describe their experiences. Hopefully normalization of this word is the first step to a more compassionate and productive discourse on the positive and negative roles of both masculinity and femininity.

That said, if I could go back to 1955 and talk to John Money, I would plead that he choose a different word.)

The finite iterated prisoners dilemma

If you're already familiar with the Nash equilibrium of the finite iterated prisoner's dilemma, feel free to skip here.

Suppose you're on a game show. A celebrity moderator stands between you and your "opponent". The rules of the game are simple: each turn, you each press one of two buttons. If you both press the "C" button ("C" for "cooperate"), you both get $100,000. If you both press the "D" button ("D" for "defect"), you each get a measly $10,000.

Easy enough? Except that if you betray the other person (by presses "D" when they press "C"), you gain $150,000 instead of $100,000! Now that's a lot of extra money (almost two years of a graduate student's salary, if anyone's asking).

Prisoner's dilemma grid: with the four possibilities (A,A), (A,B), (B, A) and
(B,B) drawn out

In real life, this could be a reasonably tense game show. With lots of pleading, promises (sometimes broken) and potentially many tears. But suppose now that instead of being out on a stage, with a celebrity host and a cheering crowd, you're instead playing from inside an interrogation room. All you were told about your opponent is that you've never met them. The windows are blacked out and the only contact you have with the outside world is the two buttons in front of you, and a timer above you, telling you how much longer you have to make your final choice.

What should you do?

Suppose your goal is to make as much money as possible, and you've been told there are ten turns. What is the rational thing to do? Is there a "rational" thing to do?

If you've taken microeconomics or read about game theory (or are particularly cynical), you'll recognize that if there was only one turn, you are always "better off" pressing "D". You have no control over what your opponent will do, but we can think about what will happen for each of their two options.

Suppose your opponent presses "C". Then if you press "D", you get $50,000 more than if you press "C"! If they are going to press "D", then you may only get $10,000 for pressing "D", but it's better than the $0 you would get for pressing "C". So in either case you're strictly better off pressing "D".

This is microeconomics's favorite "unintuitive" result. Just like a statistics professor presenting the birthday problem, a econ professor must typically jump through several hoops to convince the class that while they may not think it's "right" to defect and press "D", it is the strictly more "rational" decision. And so we arrive at the definition of "rationality" used in game theory: if there is a choice present that has a reward strictly higher than any other choice, regardless of what the opponent does (a strictly dominant strategy), a "rational" player will always choose such a strictly dominant strategy. In other words, the player is rational if they can follow the train of thought above.

This may seem like a pretty good definition of the word rational. After all, why would a "rational" agent ever do something not in their best interest? That is, however, until you ask how the original game plays out if both players are "rational". Obviously on the last turn, the game is equivalent to the one turn version, so both players will choose "D". But then if the last turn of the game is fixed, the second to last turn is also equivalent to the one-turn case, since the players don't have to take the last turn into consideration (they're both rational and know each other are rational, so they can deduce what the outcome will be for the last turn). So that means only the second-to-last turn matters in isolation, so the players also both choose "D" on the second to last turn, and on the third-to-last turn, and so on and so on.

This may seem like a sneaky argument, but a rigorous version of this analysis exists (it's called "backwards induction"), and it can be used to prove that in any finite (fixed number of turns) prisoner's dilemma, no matter how many turns there are, two "rational" players, for this game theoretic definition of "rational", will always play "D".

And depending on your risk tolerance, this might make some sense intuitively, after all, walking away with with 10*$10,000 is a pretty sweet deal, and maybe better than $0 to some theoretical "rational" agent.

However, changing the game slightly makes this result totally nonsensical to most people

Prisoner's dilemma grid: with the four possibilities (A,A), (A,B), (B, A) and
(B,B) drawn out. This time the prize for defecting is only $1.

If the prize for both people defecting is only one dollar, you'd be hard pressed to find anyone that would agree that the "rational" choice is to always defect. In fact, almost any other strategy will do better! So is game theory wrong?

Well, of course not, game theory is a mathematical framework, it simply tells us the logical conclusions given the definitions that we have laid out. On the one hand, it could be argued that the fact that the game theoretic version of "rationality" doesn't align with our colloquial understanding of the word is not a problem. After all, as long as we're clear on which version of the word we're using, we should be fine right?

On the other hand, I'm sure it's clear how it can be extremely easy to see how this definition of rationality could be incorrectly used in a headline: "game theory teaches us that betrayal is rational". Anybody who tries to claim that such a headline is purely the fault of a subpar journalist has never tried to teach game theory. When you have to spend most of the class hammering home the ideas that "humans aren't rational", and "that strategy does make the most sense, but it's not technically 'rational', because...", at some point I think you have to realize that the problem is not with the students, but with the choice of words being taught.

Even successful game theorists have been known to accidentally tie themselves into knots trying to explain how to best interpret counterintuitive results on rationality, forgetting that "rational" is just a mathematical concept that we happened to choose an extremely loaded word for, and that game theory does not provide any guidance about what rationality "should" mean, nor about the faculties that govern human reason (the traditional meaning of the word rationality).

A great example of this very issue can be found towards the end of the excellent book by Martin Osborne and Ariel Rubinstein, "A Course in Game Theory". The authors come to such disagreement about how to interpret the experimental result that humans act as if there were an infinite number of turns (i.e., they do not choose to defect immediately) that they include in the book itself a back and forth with each other (starting on page 135). Their disagreement fundamentally boils down to the question of whether or not it makes sense to model human interactions in the finite prisoners dilemma using the methods described above. And, well, of course it doesn't. There's a whole field of economics (behavioral game theory) devoted to how far human reason is from game theory's definition of "rational".

So even scientists are not immune to the dangers of misappropriated words.

And if you think that economics is still too much of a social science to count as an example of word misappropriation by the hard sciences, let's talk about:

The example of quantum "foundations"

While I don't have nearly enough room in a blog post to defend such a thesis, I can't resist writing this, if only to piss off my philosophy of physics friends.

There has been an absolute boatload of human time devoted to explaining (and trying to solve) various problems at the "heart" of quantum mechanics, collectively known as the "foundations of quantum mechanics". If you've heard of the many-worlds interpretation, the Copenhagen interpretation, the measurement problem, or other "paradoxes" of quantum mechanics, you've been exposed to these ideas before.

I think that the whole business of trying to "solve" these paradoxes is just another waste of time that is distracted perfectly good minds from attacking real, useful problems. I think that once you start to carefully frame these problems from within any reasonable, logically consistent model for what science "is", the problems completely go away.

What do I mean by that? And how does it tie into the problem of trying to rigorously define common words?

Let's take the measurement problem for example. People always complain that the quantum wavefunctions needs someone to "measure" them in order to produce "observables". As David Albert recounts to Sean Carroll in Carroll's excellent podcast, even prominent, Nobel-winning physicists like Wigner have been known to speculate totally wacky-sounding ideas in response to this "problem". Even going as far as to claim thing to the effect of "dogs could likely collapse wavefunctions, but mice probably not".

But the framing of the measurement problem itself has a problem. Wikipedia defines the measurement problem as "the problem of how (or whether) wave function collapse occurs".

To explain why this is a silly question, I'll use a simple (if slightly flawed) analogy.

In the theory of polymers, there is the notion of a "Rouse" polymer. Any polymer, when made sufficiently long, will be described well by the "Rouse" polymer model. However, the "Rouse" polymer itself (as a mathematical object) looks like a fractal, random walk through space with infinite energy. Obviously, there is no sense in which a strand of DNA looks like an infinitely jagged fractal path. If we zoom in enough, we will see individual base pairs (and eventually individual atoms) which do not resemble linear fractals in the slightest. However, DNA is a polymer, and sure enough, the Rouse polymer model has been wildly successful at describing its behavior on long length scales.

Similarly, quantum mechanics has been a hugely successful theory, and the mathematical object at its center is known as the wavefunction. This obviously does not have any bearing on whether there "is" such a wavefunction, to which me must actually assign some higher ontological status beyond its mathematical utility for calculating things that we observe in our actual universe.

In this sense, asking whether the wavefunction "actually" collapses makes a category error. The wavefunction that most physicists know and love is a mathematical object, not a physical one. Asking whether it "actually" collapses sneaks in the assumption that it "actually" exists.

So why not go and try to measure it then? Well the real crux of the issue is that it is logically inconsistent to measure the wavefunction directly according to the theory of quantum mechanics. It doesn't even make sense to ask the math that kind of question. Which doesn't mean that it's impossible to ever know whether or not there is a measurable, high-dimensional wave-like that permeates space-time. It just means that if it is possible to ask such a question, it requires a more general theory which we do not currently have.

Until then, people asking whether or not the wavefunction "really" collapses are wasting their time.

But what does this have to do with trying to assign rigorous, scientific meaning to common words? Well, quantum theory is a zoo of such assignments. The most relevant to the measurement problem is the definition of the word "observable".

The definition of a "measurement" in quantum mechanics is typically given to be "anything which is an observable". And while this may seem like a play on words from outside of the theory, it turns out to make sense because in quantum theory, the word "observable" actually has a very specific mathematical definition (a self-adjoint Hermitian operator on a Hilbert space of "states").

I cannot speak to what drove Wigner to postulate that dogs may be capable of "observing" the wavefunction, yet mice not so much. But I can say that if you teach quantum mechanics for long enough, you notice that time and time again, students will come to you with questions about the philosophy of quantum mechanics that accidentally treat this mathematical model of "observables" as corresponding to the intuitive notion of actually performing a physical measurement out in the world.

Most of these students go on to become successful practioners of quantum mechanics, but some unfortunate few end up wasting years of their life unnecessarily pondering silly questions like the "ontological status of measurement".

How to carefully appropriate words

So what do we do about this issue? We've seen that not being careful about our choice of words can lead to (in vaguely increasing order of insidiousness)

Wasting good minds on useless philosophical interpretations of perfectly well-defined (if complex) topics
Misapplying scientific results to situations they were never designed to be
Completely alienating the public

Is the answer really to have better foresight about how our definitions might be misinterpreted? I, for one, don't think so. While it is encumbent on scientists to police themselves to prevent egregiously confusing terms from coming into common use in the first place, no matter how hard you try, you're never going to be able to accurately predict the connotations society as a whole will assign to a word after you start using it in a new way.

Here are a few thoughts from my own personal experience:

Be up front: if you're not sure that your audience is aware of the actual process underlying supervised neural networks, don't just say "this AI has learned to understand what criminal faces looked like". I am usually against the use of jargon, but make sure that you use words that are clearly technical so that nobody can think that you mean (for example) "learning" in the colloquial sense as opposed to "extracted statistical patterns from human labelings of who is or is not a criminal".
Stand up against misuse: because let's face it, "A description of statistical patterns in faces humans classify as 'criminal'" is much less sexy than "AI learns how to spot a criminal" is. As scientists, the onus is often on us, as the only people with the technical understanding of the subject being spoken about, to police such...criminal...equivocation between scientific language and the common definitions of words. If Amazon starts selling these systems to police departments, we need to sound the alarm bells.
Don't try to force words on others: maybe you finally feel inspired to confront uncle Fred about the fact that his healing crystals can't tap into "quantum energies of his chakra" because energy is well defined and chakras are not. Don't make the mistake of trying to fight him on what he means by "chakra". People's irrational beliefs are their own to live with, and they persuading someone to abandon their definition of a word, no matter how problematic, is typically an uphill battle not worth fighting. What you can do is gently explain to him what "energy" actually refers to in quantum mechanics. If he independently arrives at the conclusion that it cannot be related to the spiritual idea he has of chakras, he is much more likely to listen than if you try to force that idea on him. As scientists, we occupy a position of relative intellectual authority on scientific topic, and in my experience people will tend to listen, as long as you
Stay within the realm of your expertise: let the climate scientists debate climate science. Let the physicists debate quantum mechanics. Admit the limits of your understanding, and remember: being a neuroendocrinologist does not make you qualified to talk about quantum mechanics.

What is science?

Science, at the end of the day, is really just the process by which we map our observations of reality onto the world of mathematics (the persnickety philosopher can find an incoherent rant with more details about what I mean here):

$Flowchart: a scientific theory is simply a map from observational data ($d_i$) to a set of mathematical objects ($X_i$)$

As long as we're living completely on the right side of this diagram (in the world of mathematics) it makes sense to say that "assumptions A, B, and C prove conclusion X". As long as you specify A, B, and C well enough, either "$A+B+C \Rightarrow X$"or it doesn't. There is no way for someone to disagree with a valid proof that "$A+B+C \Rightarrow X$". Sure, anyone can misunderstand a complicated enough mathematical statement, but anyone that continues to disagreee with valid logic once it's been made clear doesn't deserve to be part of the conversation.

On the other hand, scientific statements tend to be more complicated creatures. Often, they take the form of predictions about new data made using existing data:

$Diagram: a theorem like $X_1 \implies X_2$, and a scientific theory that associates $X_1 \to d_\text{existing}$ and $X_2 \to d_\text{new}$ leads to a scientific claim that $d_\text{existing} \implies d_\text{new}$$

Suddenly, there are several levels on which you can disagree with someone. Let's take a concrete example. Say my physical theory is just Newton's gravity ($T_i$), and suppose you have some set of measurements, like the initial trajectory of a satellite that you've launched ($d_\text{existing}$). If I say to you, "given this initial trajectory, the satellite's orbit will be so-and-so ($d_\text{new}$) in two days", there are several ways you can disagree with that statement:

You can disagree with the mathematical content of my claim (i.e. if I messed up my calculations, then $X_1 \nRightarrow X_2$)
You can disagree with the accuracy of the initial set of measurements, in which case my calculations could be right but I'd still be wrong (i.e. you reject $d_\text{existing}$)
You can disagree with my choice of model (i.e. you reject $T_i$). And in this case, you'd probably be right, since relativistic effects are likely not negligible in this case, we should probably use Einstein's theory ($T_{i+1}$) instead)

On the other hand, if you accept $d_\text{existing}$, you accept that $X_1 \nRightarrow X_2$, and you believe that the physical theory $T_i$ should hold, then you must agree with my conclusions.

Science is the process by which we select increasingly accurate scientific models.

And this process is entirely based on heuristics.

When Eric Weinstein and Peter Thiel get together...

...you're sure to hear grandiose claims about the stagnation of science and no real substance to back it up. Or at least that's what you'll get if you watch Eric's recently released first podcast episode (I would recommend against).

Potshots aside though, I was pretty flabbergasted by their claim that since the early 1970s, science as a whole has more or less frozen in time, and that academic progress has ground to what amounts to a complete halt. Perhaps I am blinded by young naiveté, but from my perspective several fields (especially biology, which they seem to have it out for in particular) have done nothing but accelerate the pace of their development over the past ten years. From the human genome project to CRISPr, it's hard to deny that advances in our ability to probe and modify our own biology is advancing at an exponential rate even faster than Moore's law. (And this comes just as Moore's law itself is slowing down for the first time since the 70s, the point of time at which Peter and Eric claim the great demise of science began).

So why the pessimism on their part? Well they really do harp on the idea that progress in biology, (after molecular biology was "founded by physicists") never progressed at a pace comparable to that of our understanding of "the atom". They make sure to never pin down exactly what would constitute a "fast" pace of biological discovery, but they make sure to explicitly posit that biology full of "not [as] talented people" as physics.

If you spend any non-zero amount of time at the interface of math, physics, and biology, this is a pattern that you'll see come up over and over again. Physicists (or in Thiel's case, simply self-proclaimed "intellectuals"), always think that with their superior creativity/intelligence/mathematical ability, they can swoop into biology, arguably the most populated area of modern research, and swiftly lay down some kind of simple foundational principles that have managed to elude biologists for the past century.

Invariably, some fraction of these physicists actually do start doing biological research, and quickly find themselves drowning in a sea of protein names, apparently useless acronyms for genes, and systems that require much more specification and have many fewer "universal" properties than they would have anticipated. Most run away before getting very far, but many make non-trivial contributions (after all, its true that typical biologists tend to prefer having other people do their math/modeling for them, so that they have more time to do experiments). That said, only a very few become "card-carrying" biologists, able to hold their own in a conversation with the mythical "real" biologist, whose encyclopaedic knowledge of historical experiments, networks of genes, and post-translational modifications related to their system of interest literally dwarfs even the literature itself, due to years of hearing folkloric, unpublished results through the grapevine.

What separates these "card-carrying" biologists from the physicists that run away? Is it as Eric and Peter would probably suggest, that they have simply become focused on less interesting questions? Have they simply lost their love or ability to do mathematics?

I think what separates the wheat from the chaff is the realization that the fundamental principles of biology have long been well known, just as they have been for physics. Namely:

Physics still works inside living cells. The laws of physics still lie at the heart of how cells, tissues and organisms work. Supposing God himself had gifted you with a list of every relevant molecule and interaction in a biological system, you won't meet many biologists who will claim that an appropriate computer simulation wouldn't be able to reproduce experimental results as well as can be done in fundamental physics research.
Evolution by natural selection has, broadly speaking, been in charge of the development of all modern forms of life. Therefore, all control systems within living organisms must have developed through random mutation. As a consequence, there are an absolutely massive number of leftover and/or partially-functioning systems in any given organism. You can't search for a protein by knowing what the optimal way for the cell to solve a problem would be (although biology does optimize some things pretty well) because natural selection need not find the best solution, it just finds some solution, then "tries" to make it better over time.

What the physicists-turned-biologists that I respect will all universally admit is that biology seems to follow what I like to call the "principle of most action": if you think you know the answer to how something is being done by a cell, you are probably missing 10 more key players, several dozen more backup systems, and the part that you think is most important may even be just vestigial.

Is this status quo (that most biological "models" are doomed to fail) just a side effect of biologists not being smart enough to make good models? NO! It's built right into the thing that's being studied. Organisms that spent billions of years stochastically searching DNA sequence space for genomes that would keep them around for evolutionary time are just going to be complex. The fact that biologists tend to be skeptical of the physicist's approach of building "simple" models to gain understanding in a "bottom-up" approach is well founded, because they have decades of experience as a field finding that the simplest explanation just wasn't the one that stood up to more experiments.

If you've made it this far somehow, you might be wondering why I'm writing this? I guess it upsets me that so many of Eric and Peter's valid criticisms of academia (the fact that tenure is a pyramid scheme, the perverse incentive structures, etc) are mixed with these grandiose ideas about how much smarter they must be than people in these other fields, since those fields are "moving so slow".

Hopefully someday I can come back and rewrite this into a guide: "how to be a good biologist for dummies physicists". But for now I guess it will remain a rant about the pitfalls of pontificating about a field you know nothing about.

What makes science...trustworthy?

And why? Recent polls show that scientists are largely still trusted by the American public. However, dive into conversation with anyone in the "mistrusts science" camp, and you'll quickly see that the internet age has led to an erosion of trust in science that would have been unfathomable to the generation that welcomed the atomic age.

While this eroded trust has serious societal consequences, I would challenge scientists to restrain themselves from casting aspersions simply because somebody answers "no" to a question like "do you, overall, trust the scientific process to produce reliable conclusions?" After all, even among scientists, this statement needs to be heavily qualified to be agreed with. Most scientists, in private, would admit that the scientific process is only right "eventually", or "most of the time" at best.

We hesitate to speak up about these nuances in public forums (or to aunt Betty on Facebook) because we know that any concession that science can produce bad results might be used as ammunition to not vaccinate children (or to justify all of the money spent on healing crystals).

Instead, when people make legitimate complaints about headline whiplash, we tend to claim that those effects are just due to reporting of "bad" science, or "preliminary" results. We attribute the public's frustration to the replication crisis, or to bad journalism. Or maybe we complain that the public simply does not know how to judge their sources of information.

But we rarely let ourselves admit that the process of science is imperfect, and that it often produces quite bad results.

We admonish these people for not trusting science, and in the next breath we complain about how so many papers published in Nature, Science, and Cell are full of just-plain-wrong results or conclusions. We complain that people don't believe us when we say that our climate models will still have the same predictions tomorrow, while being fully aware that they do not have any of the resources required (nor the time) to actually verify that this is true to any satisfying degree.

If you're a scientist right now, you might already be cringing even. Scared that this blog post will stray too close to dangerous waters, where my words might be misinterpreted by those seeking to denigrate science.

In this environment, it is easy for an outsider to see science as nothing but another religion trying to be sold to them. Science is the new scripture, and scientists the new priests. We are meant to trust what they say, and where we find inconsistency, we should understand that under further reflection, we would find that no inconsistency actually exists. Every time we blaspheme or dare challenge this new scripture, its priests disparage us and call us heathens, a blight upon society. They force reconciliation or exile. And so, people come to feel like they are not meant to understand science, but to simply believe it.

Clearly, if this is the impression that a person has received about science, then there is a fundamental problem in how science has been communicated to them. Many people have proposed that the solution is simply to more carefully and explicitly communicate uncertainty in scientific results.

But I think the problem runs deeper than uncertainty. It seems to me that, as a community, we lack a cohesive language with which to talk about different kinds of scientific results. You don't need to make a statistical statement to understand that building a laser trap to test predictions about entangling electrons is different in kind than measuring IQ and trying to make predictions about people's behavior (even though the error bars might appear to be of similar size!). Our intuition that these predictions have different levels of trustworthiness doesn't represent different levels of uncertainty, it represents an understanding that these are totally different types of activities.

And the dividing line is not between the social and "hard" sciences either! The hard sciences are chock full of purely statistical studies, such as GWAS studies that map out every gene that correlates with this or that disease. While genetics is a "hard" science, these results are not going to be as robust as a double-blind, longitudinal study on the effects of various evidence-based interventions designed to, for example, help traumatized children self-regulate (a "properly" scientific investigation).

The scientific community already internally uses various colloquialisms to classify different types of scientific inquiry by what their "model space" looks like:

"bottom-up" vs "top-down" modeling
"exploratory" vs "perturbatory" experiments
"mechanistic" vs "phenomenological" explanations
and many more...

In what follows, I hope to present an accounting of this language, along with recommendations on when each type of science is useful, and how much each type should be trusted.

I hope that with this more nuanced language, we can fearlessly admit that scientific results can be horribly, dangerously wrong, without jeopardizing our ability to speak authoritatively when it matters (climate change, vaccination, etc). Because what matters isn't building people's allegiance to the some new religion of science, it's giving them the language to understand how scientific results should be interpreted, so that they can make their own judgements about how this means they should act.

Defining science

You'd be hard-pressed to find a working scientist that doesn't have a personal answer to this question. If they don't have it penned into words, they surely have years of accumulated intuition that they can quickly use to decide whether something is "good science", "bad science", or "not science at all" (depending on how opinionated they are, they might even be willing to eschew the middle category altogether!)

However, you'd be equally hard-pressed to find a conference or scientific forum where two scientists aren't arguing about exactly whether a particular result is or is not "good science". I have always felt that these conversations tend to largely involve people talking right past each other. Sure, there are cases when the entire room agrees that somebody has not done good science. However, it is far more often the case that the real issue seems to be that different people simply have different approaches to "science". After the references to Popper and Hume are over, and the participants have stopped comparing their personal scientific process to the development of the Standard Model, it always seems that both parties end up agreeing that everybody's approach is important (except for the social scientists, they never get to be in the "in" group).

I whole-heartedly believe that this is because these conversations largely boil down to people trying to pitch (or defend) their particular type of science as being the "one true definition" of science. If people had the language to more easily admit that the types of activities they are doing are as different as night and day (often equally valuable) without having to defend them as living up to some arbitrary standard of "true science", I think we'd all waste a lot less time at conferences.

A Model of Models

In any case, it makes the most sense to follow my diatribe against people who try to "define" what science is with a definition of science:

Science is a set of strategies we use to build predictive models about the world around us

Hopefully this is a sufficiently abstract (read: useless) definition that nobody can argue against it. It says nothing about what those strategies are, nor does it even mention experimentation! However, "predictive model" is quite a loaded word here, so it makes sense to unpack what is meant.

By "model", I tend to mean a mapping between observables and mathematical objects. However, I don't mean that I require the "model" to be written down in the conventional language of early 21st century mathematics, I merely want whatever language is used to describe the observables to be unambiguous enough to allow you to draw clear, logical conclusions about the observables. Some examples:

Kepler's "Law of Orbits": all planets move in elliptical orbits, with the sun at one focus.
- The observable (distance to the sun) is mapped onto a mathematical object (the distance between an ellipse and one of its foci).
- While not expressed in terms of algebraic equations, the language is sufficiently clear that you can draw unambiguous logical conclusions, like "the rate of change of the distance to the sun depends on both the speed around the ellipse and where on the ellipse we are".
The inverse square law of gravity: a much more classically "mathematical" statement. Two objects $m_1$ and $m_2$ with positions $r_1$ and $r_2$ and masses $M_1$ and $M_2$ will interact with a force of magnitude $GM_1M_2/|r_1 - r_2|^2$.
- The choice of observables and mathematical objects here can be done with varying levels of sophistication, but as a simple case, we can take the observables to be the positions and velocities of any set of masses (defined as objects which we observe to travel as a unit when force is exerted on them) and the mathematical object is a set of differential equations in $\mathbb{R}^3$ describing their time-evolution due to a force that acts on $m_i$ as $\sum_{j\neq i} GM_iM_j(r_j - r_i)/|r_j - r_i|^3$.
- By virtue of being a mathematical statement (a differential equation), the unambiguity of the logical conclusions of this mapping are clear.
Descriptive statements, e.g.: We have a sun.
- The observable (existence of a sun) is trivially mapped onto the value "True" in some 2-tuple ("True", "False").
- This technically allows us to draw only the "trivial" conclusion, which is the restatement of our observable. Nonetheless, this is the foundation upon which more complicated models (which include multiple observables as input and predict others as output) are built.
Diffusion of gas molecules at equilibrium: a particle of gas at a given temperature will have a mean squared displacement proportional to the square root of the time elapsed.
- Here, properly mapping between a mathematical object (a random variable) that captures what is meant by the plain english statement and the gas particle's position requires some sophistication, but it can be done.
- In particular, I just wanted to point out that models need not be deterministic.

By "predictive", I mean something very subtle (because I don't think "causality" is really important to science at all, more on that below, I promise I'm not crazy). For a model to be "predictive", it merely needs to express a relationship between different observables that we expect to always be true. For example:

Kepler's "Law of Orbits": once we determine two points on the ellipse, we can predict what all of the values that are observable ("distances from the sun") can take are.
Newtonian gravity: this is a parametric model, meaning it requires some measurement of our observable to fully specify (in this case, we need to take enough measurements of various bodies interacting gravitationally to measure $G$). After initial parameterization, the model is predictive, it can take any set of measured velocities and positions and (in theory) predict the positions and velocities of the masses for all of time.
- NOTICE: I did not say "all of time in the future". We can just as well predict what the positions should have been in the past as we can predict what they will be in the future. We may not be able to retroactively test this type of prediction (someone would have had to measure the past ahead of time, hide the values from us, then give them to us later to compare our predictions to), but it makes just as much sense to say that we've "predicted" the past as it does to say we've "predicted" the future. This is what I meant when I said that I don't think "causality" is important to science at all. Aliens in a universe where time can be traversed forwards or backwards may have an easier time than us doing science, but they would be doing science just the same.
Descriptive statements fail to be predictive. They do not link multiple observables, and so when interpreted as "models" they can only "predict" themselves.
Diffusion of gas molecules at equilibrium: this is a simple parametric model like Newton's laws above. The important thing to notice is that while the prediction is statistical, it is expected to hold exactly in probability (hence satisfying my "always true" requirement above).

I guess it makes sense to explicitly compare my definition of "prediction" with the common usage of the word "counterfactuals" within the philosophy of science. In my opinion, this is an unnecessarily supercilious word. Basically, a model is predictive if it can take some information about how the world is and predict some other information about the world. Once one has a predictive model, predictions can be made based on actual observations, or else the inputs to the model can be drawn some other way, in which case the resulting prediction is called a counterfactual.

That's not science!

"But wait!", you might be screaming. "$X$ satisfies your definition, but I most definitely would not call it science!"

I probably agree. But as I stated from the outset, I'm not in the business of defining what science is and isn't. I'm much more interested in describing the many things that people call science, so that we can have a language for discussing their relative merits. And if $X$ satisfies my definition, I bet there's at least one person out there right now trying to pass it off as science.

Types of Science

The meat and potatoes. I'll be updating these lists as people suggest new entries to me, hopefully allowing me to apply the scientific process to my model of what science is.

Classifying science by goals

The goals of people doing science vary wildly:

Searching for trends
- All exploratory research goes here, whether experimental or computational
- Identifying a trend in observations is a necessary first step to hypothesizing the existence of a more fundamental "natural law", and so this goal is a core component of modern "science".
- "Machine learning" can largely be thought of as the partial automation of this step.
Discovering and testing "invariants"
- A trend becomes an invariant whenever it is demonstrated to be robust enough within some particular domain that there is no reasonable expectation that a new experiment or replication would find it to now be false within that domain.
- An invariant is not sometimes true, it is always true.
- For example: Newton's gravitational laws are an invariant within a particular domain (sub-relativistic interactions).
- Establishing the correct level of confidence that a trend is in fact an invariant is one of the cornerstones of modern scientific "intuition".
- Finding invariants is always the central goal of doing fundamental physics research. I would go so far as to define "physics" to be exactly this type of science (the type that searches for invariants).
- Each new "universality" result in mathematics leads to a new kind of "invariant" that we can understand, and use to link together more and different kinds of observables.
- When the community is collectively convinced that something is an invariant, we call it a "natural law".
- We will describe many ways in which we search for and test invariants below, but it is worth pointing out here that it is typically useful to not just directly probe the trend itself, but to instead probe its logical consequences (within the framework of existing, established invariants) in order to more thoroughly test its validity (e.g. searching for gravitational waves to "test" whether Einstein's relativity describes and invariant of space-time).
Classifying emergent behavior
- A predictive model (in the sense described above) is simply built by selecting a specific set of "invariants". Once those invariants are chosen as assumptions, they will often have arbitrarily complex (even incomputable) logical consequences. Understanding these consequences is one goal of "science", because without this understanding we never know if we've discovered something "new" or something we "already knew" (i.e. could have predicted with existing theory).
- Often when exploring emergent behavior, trends will be shown to come from existing (known) invariants, and instead of being promoted to invariants themselves, will be better understood as consequences of existing invariants.
- Sometimes, even things which were previously taken to be invariants (such as electricity and magnetism) will be shown to equivalently be emergent behavior from a more simple invariant (in this case local U(1) symmetry).
- Understanding the complex interplay between the many ways that known invariants and potential new (candidate) invariants interact with each other to create logical consequences (predictions), allows us to construct increasingly comprehensive (and comprehensible) models of the diverse observations that we encounter in our everyday lives.
- The coarse-graining of emergent behavior also allows us to make robust predictions about the world around us that would typically require more energy to compute than we have available, by allowing us to use heuristics that come from "first-principles" (i.e. well-established invariants).

In every scientific study I have ever participated in has been driven by the three goals above: The exploration of trends, classification into invariants, and the logical study of how invariants interact to reveal novel emergent behavior.

Classifying science by approach

However, the methods and heuristics that I have learned to apply when reaching for each of these goals are varied and overlapping, so it's worth listing them independently (as opposed to trying to associate each type of "scientific practice" with a particular "scientific goal"):

Deriving logical implications of well-established invariants
- These can be tested in order to verify the underlying assumptions (invariants)
Attempting to prove independence of particular sets of invariants
- This is the goal of those trying to axiomatize quantum mechanics
Deriving logical implications of trends that are potentially invariants
- This is typically just called experimental design, but can be a science in and of itself if the invariant set is large/complicated enough.
- In order to be a scientist specializing in this process, you must have the mathematical sophistication and sufficient knowledge of your system to generate sufficiently non-trivial logical implications while making sure they are also testable.
Exploratory experimentation
- Carrying out new tests to look for trends.
- Clever design of exploratory experiments will ensure that the observables measured span the largest possible space, while intersecting with previously measured observables just enough to give confidence that the techniques used are working.
Perturbatory experimentation
- This is basically the experimental version of #3. Often, it is impractical to predict what the consequences of a particular set of known (or proposed) invariants are, and so perturbatory experiments are used to explore the space what would otherwise have been "counterfactuals".
- Good perturbatory experimentation is reminiscent of good unit testing. You want to hit all the edge cases, see how the different variables that affect the system individual work before studying how they interact, etc.
- Perturbatory experiments are often used to construct "phenomenological" models, where no invariants are known, in order to allow us to ask scientific questions at higher levels of abstraction.

Model selection in practice

Okay, so those are the goals of science and its methods, but recall that my initial definition of science was

Science is a set of strategies we use to build predictive models about the world around us

So largely what I've said above reduces to the statement that "predictive models" are just "trends" that we are confident enough to label "invariants", and that science is about uncovering these invariants and their consequences.

But that still leaves the most important part: what are the "strategies", in the definition above? What are the ways in which we choose what sets of invariants we care about, exclude those that are impossible or trivial, and construct the best possible predictive model?

It's just heuristics.

Thousands of years of heuristics that humans have either built into them, or discovered (and lost, and rediscovered) about what types of evidence should be considered trustworthy.

Here are some of the most well-known ones

Trivial invariants are useless unless they parametrize more complex models
- saying "the sun exists" is not science
- measuring the force pulling two massive bodies together is useless unless it's simply as a proxy to measure a parameter in a more broadly predictive model like Newton's
(Occam's razor) A model that can predict more from less assumptions is typically better
- At the end of the day, humans are supposed to understand science, so simpler is better
A model that cannot be differentiated from other valid models via an actualizable observable is useless
- this is the main counter-example to point 2 above, and the main complaint about e.g. string theory
- other examples of heuristic at play are distinguishing between, let's say, a model of Newton's gravity and a model of Newton's gravity that contains an unmeasurable, invisible horse in low earth orbit. While Occam's razor is the more common thing to cite when discarding this kind of theory, the fact that the invisible horse is unmeasurable is actually a bigger ding than the added model complexity by far.
(Occam's shaving cream) There exist many systems which are complicated enough that a more complicated model should not be dismissed out of hand just because a simple model explains some set of observations.
- The set of all models that can explain existing observations is always important to consider. This is especially true in biological and soft-matter systems, i.e. in regimes where emergent, complex behavior dominates our observables.
(Anti-Hume) If something has been observed to be true enough times, then we should just act as if it's always true in practice, lest we be frozen in inaction.
(Hume's wisdom) Just because an invariant is well-established does not mean it should not be challenged.
- But maybe it should be challenged less often.
(The blight of the social sciences) Models that are proposed in systems where few invariants are known are less likely to stand up to the test of time.
- A large fraction of the complaints leveled against the social sciences by the "hard sciences" boil down to this idea: that there are no Maxwell's equations of human behavior.
- Said more formally, the less we know about a system, the larger that the space of possible models can be (and concomitantly, the larger the space of logically-sound predictions can be). So the less we know about a system (i.e. the less we understand it mechanistically) the less we can trust that a model that predicts something correctly is representing an actual invariant of the system, as opposed to a coincidence or fluke. <\span>

So what?

So I've laid out some words that I like to use in conversation that describe the various goals of different activities which are all called "science" colloquially (and often, by the practitioners). But I have yet to make many value judgements about which parts are the most important.

All parts are important.

I hope that by merely laying out a framework for discussion, I have managed to avoid too much grandiosity. The entire scientific process is important, and scientists at all levels should be able to say "I'm a X!" proudly, where X is "data scientist" (searching for trends), "bottom-up modeling specialist" (e.g. mathematician doing physics or basic research), "exploratory experimentalist" (e.g. throwing money and sequencing at medical research problems and hoping something sticks).

What matters, from my perspective, is that the statistical results of the data scientist not be confused as having the same weight as fundamental results of e.g. an experimental biochemist.

While these are both important pursuits, they are different pursuits.

And if they are both called "science", then the public is right to not blindly trust all scientists, as different types will have different types of evidence for the statements that they make, not just different levels of confidence.

Code

Python Tooling How-To and Examples

(Multiple) Trajectory Analysis Toolkit

Nucleosome Chain Simulation and Theory

Science

Nucleosome spacing heterogeneity drives chromatin elasticity

Beyond Meier-Kaplan: a statistical correction required when analyzing "lifetimes" in a finite observation window

A molecular heat engine for segregating and partitioning bacterial chromosomes

A "constant extension"/"adder" model for bacterial cell size control