The facts of life


"Facts are sacred"
Well, no - sacred things are sacred: facts are just facts.
But I think the quote is trying to say that facts are true, which is different from sacred - except in the sense that one can use 'sacred' to mean 'inviolable', or 'unquestionable' - or 'matters of faith".
A short talk by James at Woking's Cafe Scientific:
and an animated discussion that preceded it, reminded me of a statement that is commonly made but that I suspect is not so.
In a discussion, one participant may say that something they are asserting is a 'fact' - and that should end the argument, because facts are unquestionable. Except they are not. And I don't mean just in the sense that people sometimes use, where they imply that the person questioning their fact is doing so only in some deluded counter-factual sense.
In fact (sic) I would suggest that facts are inherently questionable, and mutable. In fact (sorry, I can't resist..) we can't get to the facts, we can only get to the observations that lead us to suggest what the facts might be. And worse, in science and mathematics we (or at least some of us) have learnt that facts are just not available - so even the apparent 'facts' that we infer from our observations, turn out to be only mathematical or logical models - simplifications - whose aim it to allow us to predict what the result of other observations will yield. In fact (OK, I'll stop now..) that is all that science, maths and logic do - they let us predict observations, based on models. If our predictions match observation, we continue to use the model - if not, then we discard that model and use another one. No tears, no impassioned arguments (well, OK, scientists are human, so quite a few impassioned arguments, tears and rage) - if the model doesn't work, we throw it out and adopt a new one.
So at the moment we often say we live in a 'post-fact' world - one where facts are dependent on opinion. And we are right, in the sense that if my opinion is fixed firmly enough that I will ignore the observations and stick to my own model, then I am indeed counter-factual - or at least, counter-observational. But here I am not talking about the current political 'elite', who know things that are not so just by being so clever that they don't need to doubt - I am talking about facts as ordinary people - and ordinary scientists - see them. Even our sensible, logical, clear-thinking, scientific facts are questionable, mutable, evanescent - they shift and change and evaporate in the face of new evidence - so they cannot ever have been sacred, can they?
Why do I say this?
First, let's assume that at least in some discussions, opposing parties to the discussion may both be sane and logical, clear thinking, 'scientific' people. So if one advances a 'fact' and the other questions its factuality, and agreement cannot be reached through reasoned and informed debate, then I would say the fact is perhaps not so much as fact as an assertion - perhaps even an opinion.
Second, let's take James's talk. He was talking about Information Warfare and Cyber Security, and he introduced the CIA acronym - which says that information should be Confidential, Available - and crucially should have Integrity: this last meaning, the information that was received should be the same as that which was sent. And Integrity of information, we all know in this 'post-truth' age, is incredibly hard to verify.
But we can verify the integrity of information, if we follow sound practices - mainly, keeping an audit trail. Suppose I make a measurement - say, of the distance between two magnets in a particle accelerator. As it happens, designing a computer-controlled instrument to do just this was one of my first projects as a Fellow at CERN in the 1980s - its outcome, the Distinvar, measures half a metre to an accuracy of 10 microns, as quickly as it can. So I make my measurement and it is passed on through a chain of processing and coding and transmission. The eventual result is perhaps read by some engineer whose job is to adjust the positions of the magnets. That engineer might justifiably ask if the measurement is correct - which means, really, if it matches the actual distance between the magnets. He might ask me - if I am still available - and I might say yes, I was careful, and yes, the measurement is correct. If he accepted that, then he was basing that acceptance on trust - trusting me, perhaps because he knows me, or knows my reputation, or qualification. But before he could do that, he would need to know who made the measurement, so he could know it was me, and so he could then ask me. That is one element in an audit trail.
That type of measurement, which is critical to the functioning of a very expensive system (in fact, it was the LEP, the particle accelerator that preceded the LHC that produced the Higgs Boson) - that type of measurement is in fact accompanied by a rigorous audit trail, that goes like this:
  • The Distinvar works by pulling taut a half metre length of Invar wire - a platinum-iridum alloy that has the strange property of not changing in length as it warms up. That piece of wire is a ruler - so the first thing we record is which ruler we used - and who used it, in which Distinvar instrument, when, and where.
  • The invar wire is calibrated - checked, made accurate - by comparing it to a better ruler - in fact (sorry, I really can't help it..) a bar of platinum-iridium alloy that is kept as a standard reference in controlled conditions in a tunnel at CERN for that purpose. So the next thing we record is the calibration - when it was done, what was its result, who made it, when and where, against what.
  • The CERN platinum-iridium bar is itself calibrated, by being taken to Paris each year, where it is calibrated against the bar of platinum-iridium that is (or was, before we changed to wavelengths of krypton lasers) the standard definition of the standard metre. This is the end of the audit trail - at the defined standard reference.

There will be other entries in the audit trail too - the computer that recorded the measurement, the software processes, and their version numbers, that processed it and transmitted it.
This audit trail is what makes the measurement a measurement. In conversation with a head of metrology at the National Physical Laboratory a few years ago, I was explaining a software processing chain that did not record such an audit trail, and this eminent metrologist had difficulty understanding that anyone could possibly be so daft as to not record an audit trail. (Sadly for him, software engineers don't know squat about metrology so that sort of daft mistake is common as muck.) And he said this:
"If it lacks an audit trail, then it isn't a measurement"
which is insightful - and typical of metrologists.
If you don't know who made the observation, you don't know if you can trust them. If you don't know what they did, you don't know if they did it right. If you don't know what they did it with, you don't know if that was right. This is a chain of trust - a way of establishing the integrity of the observation - the integrity of the information, the I in James's CIA acronym. To establish the integrity of the information you need its audit trail, its chain of trust.
This applies to news too, and to scams, and to hacked emails offered to newspapers. If I read in the Sunday Sport that a statue of Elvis was discovered on Mars (yes, it is true - that the Sunday Sport said it, not that there is a statue of Elvis on Mars) then I could check it by asking who published it (the Sunday Sport, now closed, was not considered by some to be a trustworthy source of news - its famous headline 'Aliens changed our son into a fish finger' saw to that) - and where they got the story from, how that person knew, and so on. If that audit trail is unavailable, or suspiciously short, than we might accord less trust to the information, and doubt its integrity.
So a chain of trust can establish the integrity of the information, insofar as it maintains the same thing at the end of the chain of communication as it did at the beginning. In science this is de rigeur: you always publish your sources, you litter your publications with references to the source that provided that information, you show your working. In fact it gets in the way of reading science, this constant referencing, even though it is essential. And it doesn't establish as much as it perhaps should: the referenced sources might not be as trustworthy as you would hope: the weakest link in the chain determines the strength of the chain. With news, when it is published on the internet, we can sometimes establish a chain of trust: you may trust the news source, or the news source may provide links to their own sources. It isn't obvious you can trust a source even if you think you can: I well remember in the Falklands War the BBC insisting that we (the British) had not lost a capital ship: only to discover after several days that the BBC - bastion of honest truth, as I then thought - had lied, parroting the Government line for propaganda purposes and to stop us (the ignorant populace) from despairing and perhaps demanding a stop to the casualties. So the chain of references - the audit trail - becomes more significant as an indicator of truth. Traditional media don't do audit trails as much as I would like: so you have to search - Googling, or verifying on a site such as Snopes that is dedicated to checking the provenance of stories. (But do I trust Snopes?)
So when the President-elect of the United States says the CIA are making stuff up, and the BBC say they probably aren't, the chain of trust can come down to opinion - to who you do trust.
But the really interesting question is, what is a fact?
In fact, we shouldn't really talk about facts, but about observations, and inferences.
An observation is something that was observed - an inference is something you… infer … to explain your observations.
Let me try to clarify.
If someone sees a saber-tooth tiger enter a cave, hears screams, and after the tiger has left, enters the cave and finds dead cavemen, then those are observations: the saber-tooth tiger was seen to enter the cave, the screams were heard, the dead cavemen were seen in the cave after the tiger left. Those observations could become facts, if I trusted their provenance enough. That chain of observations leads to an obvious conclusion: that the saber-tooth tiger killed the cavemen. That conclusion is an inference: the observations (which can be verified and audit trailed) lead to the inference of what connected, or led to, those observations. But the inference is not a fact: it is a model, an explanation, an inference. If we were doing science, we would put more cavemen in caves and send more saber-tooth tigers into those caves, and then go and see if we found dead cavemen. If we found dead cavemen in all cases, we would feel safe in inferring that saber-tooth tigers kill cavemen - and by further inference, that in the first case, the saber-tooth tiger did indeed kill the cavemen because now we have established that is what saber-tooth tigers do. But this inference is not a fact in the same way as the observations could be: because the inference might be wrong, and I would change my belief in it according to new observations. Imagine for instance that I observe more carefully - I follow the saber-tooth tigers into the caves - and I see that when I do, the cavemen are so freaked at seeing a saber-tooth tiger that they scream so much they die from hyperventilation. Now I have a new inference - that seeing a saber-tooth tiger freaks cavemen out so much that they die of hyperventilation. I am just as happy with my new inference as I was with my old one. Just as happy in fact as physicists were with Einstein's new-fangled Relativity, which replaced Newton's old Mechanics.
So I would say that we need to distinguish between observations - which can at least in principle be checked for integrity - and inferences, which are contingent upon observations. An observation is a measurement, an inference is an explanation for - or strictly speaking a predictor of - measurements.
This of course leads us into the science that most prizes observation - Quantum Mechanics.
Everybody knows about quantum mechanics, and most people know that quantum mechanics is fact - it must be, since no-one understands it, so we have to believe it in and trust it as fact.
Quantum Mechanics is the most proven of sciences - in the sense of the most tested. It predicts observations to an amazing degree. For example the finding of the Higgs Boson at CERN has a statistical significance of one in 3.5 million: whereas in fields such as cancer research a significance of 1 in 20 is considered good. Basically, that means that no measurement ever (no trusted measurement, anyway..) has ever, ever, disagreed with the predictions of Quantum Mechanics.
So one might think that Quantum Mechanics, of all sciences, deals in facts.
Well, yes and no. There is a nice equation in quantum mechanics called the Schroedinger Equation. Yes, Schroedinger of the Cat. It used to be called Schroedinger's Wave Equation, but then people started thinking everything was waves and so people left out the wave bit - which is significant. Schroedinger's Equation describes the probability of an observation yielding a particular result. That is, it predicts observations. Actually, that is all it does. Schroedninger's Equation doesn't in itself say anything - it does not infer any fact, any basis for what it predicts - it just predicts.
Which is interesting - because there is a basis, Jim, but not as we know it.
When Quantum Mechanics was first proposed, furious arguments broke out. Einstein - a trusted source if ever there was one - ridiculed it, even though he played his part in inventing it, by saying he 'refused to believe that God plays dice'. That statement in itself is intriguing - containing the word 'belief' - opinion, you might say. Those arguments turned on the oddity of Quantum Mechanics - an uncertainty intrinsic to it, encapsulated in Heisenberg's Principle of Uncertainty. The Principle of Uncertainty basically says that you can't ever know everything about a system - because to know about it you have to measure it, and to measure it you have to perturb  it - to change it. That uncertainty is intrinsic - inevitable - it arises from fundamental properties of the formulae of Quantum Mechanics. And the argument was, whether the fact that you could not know everything, in fact meant that there was nothing to know: if what you could not measure in fact existed at all, or - being unknowable - could not be considered to be real in any way. Einstein argued no: that there must be an underlying reality - unknowable, yes, but existing and following its own rules, if only we could know them. Niels Bohr - the Father of Quantum Mechanics - on the other hand argued that those things that could not be observed, that could not be measured, could not be considered as being real: could not be considered as facts. Einstein and Bohr, and many others - including a grand total of 19 scientists who either then or later won Nobel Prizes - argued this out at Copenhagen, and arrived at the Copenhagen Interpretation: agreeing ultimately with Niels Bohr, that what you can't observe, cannot be considered real.
It is this aspect of Quantum Mechanics that non Quantum Mechanicians find hardest: just because you can't see something doesn't mean it is isn't there, does it? Well, no and yes. If a tree falls in the forest, and no-one sees it fall, did the tree fall? Answer: don't know. If you looked later and saw the tree fallen you could say it fell: or maybe it was pushed? You could draw up a rule of observation: that if you see a tree standing in the forest, and later you see the same tree fallen, then the tree fell. But if someone showed you that what in fact happens is that trees sort of flip between standing and fallen, without actually falling, then you would have to give up your 'tree falling' inference and accept another, weirder, explanation.
But even this isn't my point. (I will get to my point, in the end, with luck..).
Schroedinger's Equation is an inverse equation. It is an equation that encapsulates the probability that an observation will yield a particular result, but the equation does not directly yield the result: you have to invert it - to 'solve' it. And like many such inverse equations, Schroedinger's lacks a solution.
Think of an inverse equation as something like:
distance = speed * time
(which is an equation of motion).
You can invert this to find the time as a function of speed and distance:
time = distance / speed
But you can only do this because we already know the solution: someone, in ancient times, discovered the solution to this inverse equation and it was passed down to us through generations, so we know it and can use it. But most equations aren't like that: they don't have simple known inverse solutions: we can't solve them. Schroedinger's Equation is like that: it hasn't been solved.
Well, that is not quite true. Schroedinger's Equation has been solved for a very few, very simple, cases: like the very simple system of a hydrogen atom, or a hydrogen nucleus.
And if we solved an equation for some simple cases, then we can use a trick: known as a Transformation. A transform transforms a problem into a different problem. Specifically, if we know a number of simple solutions to an inverse equation, we can transform a more complex, unknown problem into a sum of those simpler problems: and so its solution becomes a sum of the solutions to those simpler solutions. This is the magic of a transform: it is the basis (a pun, as we will soon see..) for much of western science. It is sometimes called 'the whole is the sum of its parts' - and the matching phrase 'the whole is more than the sum of its parts' shows it weakness, too. It is an approximation: but often a good one. What we do is, to break down a very complex problem into a sum of lots of simple ones: then we solve each of those simple problems, add those solutions together, and hope the result is the solution to the complex problem (it isn't, but it is often so close as to be satisfying).
The simple problems are called the 'basis' functions. We break down the complex problem into a sum - a 'linear superposition' - of basis functions, solve those, and say the sum of the solutions is the solution to the complex problem.
Neat, huh?
Except that the our desire for facts leads us to think the basis functions are real.
Take what are called 'atomic orbitals'. These are the 'clouds' within which electrons orbit the nuclei of atoms. They are lovely - beautiful spheres, figure-eights - and they define the shape of molecules (which are assemblages of atoms, linked by their atomic orbitals). Each orbital contains a pair of electrons (electrons always go round in pairs, but that is another story…). Except that they don't. Electrons don't follow the atomic orbitals at all: the orbitals are just basis functions - simplifications, mathematical abstractions to help us to solve a more complex problem. The electrons don't follow orbits - but even more, they don't even follow the orbitals as probability distributions. The orbitals are basis functions - actually, the solutions to Schroedinger's Equation for the simplest of systems, a hydrogen atom. In fact the electron's probability distribution for any mildly complex atomic system is incredibly complex - a weird Spirograph cloud - but it can be approximated reasonably (we think) as the sum of the solutions to the simple hydrogen atom case.
This wouldn't matter - in a way it is a philosophical question, whether the electron is in the orbitals or not, because its behaviour can be predicted using them. But text books are full of such cute pictures of these orbitals. You can buy plastic models of them, with which you can build models of atoms and molecules. People talk of them as if they are real. But the Copenhagen Interpretation specifically, clearly, says they are not: the electron state can be modelled by a linear superposition of basis functions, but that does not mean the basis functions are real.
How easy, how seductive, how compelling, is the slippery slide from inference to fact: from model to reality.
One last example to illustrate my point.
We all learn at school how naïve were the ancient astronomers, who thought the sun and all the planets went round the Earth. We learnt to smile at poor old Ptolemy, who modelled the observed planetary motions with the crystal spheres - and then, because the planets move in strange ways, had to add spheres within spheres, crystal spheres like cogs meshing, circles on circles, in ever-increasing complexity, to model the motions. They thought of the circles within circles as showing the harmony of the universe - they talked of the harmony of the crystal spheres. How simplistic, to model planetary motions with perfect circles.
It is one of the best known facts we learn: that the Earth goes round the Sun, not the Sun round the Earth. In fact, knowing this fact is almost a badge of being Scientific: of living in the age of the triumph of fact over belief. If anyone suggested otherwise we would think them ignorant or possibly deluded. The Earth going round the Sun is a central fact, it anchors our understanding of our solar system, we all know it to be a fact.
There is a funny story, about a man whose son asks him why the ancients thought the Sun went round the Earth. The father says: "because it looked like it did" - and the son, after a pause, asks innocently: "what would it have looked like if the Earth had gone round the Sun". To which of course the answer is: "the same". Silly Ancients, thinking otherwise...
But the same applies the other way. It did indeed look like the Sun went round the Earth - it still does.
The difference is, you need less circles within circles if you make the Sun the centre of the circles. The model isn't better, except in this sense that it is simpler - less circles. If you let the circles be ellipses it gets even simpler - less ellipses than circles - so we now choose to model with ellipsoidal functions (which are, by the way, a damn sight more awkward mathematically than circles)
There is a transformation called the Fourier Transform, that is very popular. It models all sorts of problems as sums of circular functions - harmonics. People use the Fourier Transform to model the incredible complexity of speech, of music, of images and moving pictures and of radar and sonar and medical imaging. The Fourier Transform is one of the most widely used transforms, used in almost every consumer device in all sorts of ways. All of these uses of the Fourier Transform user harmonic functions - circular functions - as basis functions to model the complex behaviours. In speech and music we even talk of the 'harmonics' - the circles within circles that make up the richness and harmoniousness of those sounds.
You can predict the observed motions of the planets by a Fourier Transform - the circles within circles are the harmonic functions of the transform. It works perfectly well: easily as well as the ellipse model - you can't tell the difference. So then, are the Ptolemaic spheres like the atomic orbitals - real things? Or are they like the atomic orbitals - abstractions whose only purpose is to make a simple mathematical model that predicts observations?
I think, and here I am following the Copenhagen Interpretation, that the problem lies in saying what is a fact. An observation - a measurement - can be a fact. It can be verified, checked, audited. Observations of planetary motions are facts in this sense. An inference Is not a fact. Ellipsoidal, or circular, orbits of planets are inferences, so in the same sense they are not facts.
So we do need to be sure of our facts: but we also need to be sure what a fact is.

Comments

Popular posts from this blog

Wave Watching

Bread and cocktails

Homeschool maths