ARVAC Bulletin 99

Problems with evidence-based policy

Secretly recorded at a private dinner last year, the government’s Respect Tsar, Louise Casey, was overheard uttering the unministerial phrase: “If No.10 says bloody ‘evidence-based policy’ to me once more I’ll deck them.” As researchers on the track of evidence, it seems to me we should look a little bit more closely at what she said. What, after all, is wrong with evidence-based policy? Surely it is better than policy based on hunch.

And yet anyone who flies too close to the official mind will realise that there is a misunderstanding there about what constitutes evidence. Those in government who demand hard evidence, and mean figures – and have never been taught that such things just aren’t possible, because what is most important is not susceptible to such things – have created a bastard form of evidence-based policy which is no such thing. That is why people like Louise Casey, who are frustrated because nothing happens without this chimera that never arrives, see ‘evidence-based policy’ as an excuse for complete inactivity. Why does it never arrive? Well, here are some reasons that social researchers know all too well but officials often fail to grasp.

1. Counting means comparing like with like, and that is inexact.
It means defining what is being counted, so that it can be categorised, when different people, different cultures and different eras, count differently – because they define things differently.

The problem of translation is well-known. Pepsi-Cola found that their famous slogan ‘Come alive with the Pepsi generation’, translated into Chinese, meant ‘Pepsi will bring your ancestors back from the grave’. Anyone who has studied crime statistics knows that they change with people’s perception of different crimes, as well as with the definitions of them – and whether people think there is any point in reporting them. They are, as much as anything else, measurements of fear and security.

These problems are particularly important when money values get involved. The first practitioner of cost-benefit analysis, Adolphe Jullien in the 1840s, was asked to work out the precise cost per kilometre travelled of each rail passenger, to work out what the fares should be on the new railways. He came up with the figure of 0.01254 francs per kilometre. “Ah, but what about administration and interest on capital?”, he was asked. And to meet these objections, he arbitrarily doubled the figure. It was more accurate, but no longer precise. These kinds of trade-offs lie behind the hardest government statistics anywhere.

2. You cannot count what is really important
Simply because it is so hard to measure what is really important, governments and institutions pin down something else. They have to. But the consequences of pinning down the wrong thing are severe: all their resources get focussed on achieving something they did not quite intend. How do you measure the success of a military unit in the Vietnam War? Answer: body count. Result: terrible loss of life among the Vietnamese, but no American victory. How do you make sure schools are living up to parents’ expectations? Answer: test the children as much as possible. Result: exhausted kids who can see no further than exams. How do you make trains more punctual? Answer: measure how often they’re late. Result: train companies simply lengthen the official journey times.

3. Controlling people with numbers does not work
The principle that numerical measurements will always be inaccurate if they are used as a means of control is now known as Goodhart’s Law. The reason is that, however incompetent staff may be, they will always be skilful enough to make targets work for them rather than against them. Take for example, the rule that patients shouldn’t be kept on hospital trolleys for more than four hours. In practice, some hospitals have got round this by putting them in chairs. Others have bought more expensive kinds of trolleys and re-designated them as ‘mobile beds’. This is the fundamental flaw of the target culture, which social researchers find themselves drawn inadvertently into collusion with.

4. Numbers have lives of their own
The story of Soviet glass manufacturing is horribly reminiscent of 21st century Whitehall. When output was evaluated by weight, it was too thick; when it was switched to targets by area, it was found to be too thin. Most of the key figures were falsified – something you can do during a reign of terror – but they carried within them a terrible authoritarianism to try to force them to be true. Which is why strikes had to be redefined as ‘sabotage’, and why after 1939, employees had to be fired if they were once more than twenty minutes late for work.

We don’t do that kind of thing nowadays. But we do measure our progress towards so-called ‘best practice’, locking local authorities into a dull and uninspired version of what may once have been best, but is now hopelessly second rate.

5. Number-crunchers tend to find what they look for
In quantum physics, the mere presence of the observer in sub-atomic particle experiments can change the results. In anthropology, researchers have to report on their own cultural reactions as a way of offsetting the same effect. Once you start looking critically at the measurement obsession, you keep falling over a strange phenomenon, which is that the official statistics tend to get worse when society is worried about something.

UK child abuse statistics stuck at the 1,500-a-year mark until 1984, when unprecedented publicity catapulted the issue to the top of the public agenda. Between 1984 and 85, child sex abuse cases shot up by 90 per cent. And in the following year they did the same again. Whatever the reason for this – hidden abuse or moral panic – what you get is not an accurate reflection of the real situation.

Conclusions
The real problem is how to interpret numbers. The basic problem of interpretation is that, no matter how many screeds of figures are available, they will not automatically tell you what causes what. That requires common sense, good judgement and intuition – none of which are encouraged by the neo-utilitarian ‘evidence-based’ policy at the heart of government. It is accepted wisdom to say the real answer is distinguishing between measuring outputs and outcomes, but this begs the question.

What is the outcome measure for the NHS for example? The number of patients successfully treated? Or is it the health of the population? The statistics for those would be diametrically different.

Outcome measurements assume that our institutions should be permanent. They are about organisational control. They don’t let us imagine whether we might be better off with different institutions instead. Real outcome measurements – if you can find them – are usually outside the control of institutions anyway.

What do we do as social researchers in the face of this serious confusion in the official mind? We can de-standardise the counting process so that the subjects of measurement do their own measuring - the pupils, the patients or the poor. One Latin American city measures air pollution by the number of days you can see the Andes from the city centre. That is a so-called ‘hot indicator’ that can inspire and involve people, when measuring ozone parts per million has to be done silently in a laboratory by a technician. We can make more use of stories than figures because they can often communicate complex, paradoxical truths better than figures. And we can ask questions: because they can devastate political statistics.

Yes, the carbon monoxide rate’s gone down, but is the air cleaner? Yes, our university professors have produced a record number of published papers, but is their teaching any good? Evidence-based policy is as vulnerable as the Emperor’s New Clothes to the incisive, intuitive human question.

The real problem is one of centralisation. Empires require measurement systems to control their outlying departments which they dare not trust to take their own decisions. Any measure of decentralisation can roll back the contradictions at the heart of evidence-based policy. We need school league tables that much less if we know our local headmaster.

So yes, social researchers need to provide evidence, but we have to remember we are part of a wider movement that in the end depends as much on common sense and humanity than it does on technocracy. We need to tell the truth about the importance of being generally right. As James Anyon, the first accountant in the USA, urged his students:
“Use figures as little as you can. Think and act upon facts, truths and principles and regard figures only as things to express these.”

David Boyle is an associate of the New Economics Foundation and the author of The Tyranny of Numbers.

Contact: Davidboyle1958@aol.com

Contact Details | About ARVAC | Aims and Values | Community Research Project
Publications | Information Service | Membership | Seminars and Conferences


About ARVAC Contact Details Aims and Values Community Research Project Publications Information Service Membership Seminars and Conferences