Wednesday, January 27, 2021

Discontinuities and Discounting

Summary: Even with a zero rate of pure time preference, and persistent zero real interest rates, we should use discount rates of at least 1% for any real-world project or policy. Health, lives, or utility should be discounted because of the probability that a discontinuity in resources or policy or the state of the world will cause projected future effects to not happen. With policies that provide economic benefits, we must also increase the discount rate to reflect the fact that it will probably be more expensive to purchase lives, health, or utility as overall conditions improve.

This post is meant for economists, effective altruists, and others who are doing cost-benefit calculations over a long time horizon to decide what actions we should be taking now. I am trying to be as plain-language and non-technical as possible, which will involve simplifications of well-studied things. I do assume basic knowledge of cost-benefit analysis and time value of money calculations. Greaves 2017 is an excellent survey article for some of the relevant technical aspects, and this post builds on its foundation. 

Past use of Discount Rates
In the past, cost-benefit analysis was mainly used for investment projects, and was mainly concerned with money flows. Discount rates were based on the agent's opportunity cost of money, which was usually based on a market rate. There were no hard questions or philosophical considerations beyond "Don't waste money." Market rates were never close enough to zero, and time horizons were never long enough, for the considerations in this post to be relevant.

However, in recent years, people have started to use cost-benefit analysis as a decision analysis tool for many more kinds of policies, including those that do not use money at all but involve trade-offs between lives or health or utility today and in the future. This has forced a rethink of how to apply discount rates.

Pure Time Preference
In experiments, tests, and surveys, people always prefer things now to things later even when there is no possibility of trade and investment. This has been called 'pure time preference'. There are many good arguments that society or a government should have a zero rate of pure time preference i.e. it should care just as much about a life today as a life in 100 years.

I don't believe that there's really any such thing as a pure rate of time preference. I think that people discount the future based on uncertainty that the expected benefits will ever actually arrive. Of course, usually they may do this subconsciously, out of evolved heuristics. The monkeys who assumed things would reliably be around in the future died off, and the monkeys that grabbed what they could at each moment reproduced. We observe throughout history that whenever the economic and political situation becomes more stable, people's discount rate goes down and they start caring more about the future. Attempts at measuring time preference typically fail to adequately correct for these uncertainties.

I suspect, but am too lazy to prove, that hyperbolic discounting is actually based on a subconscious assumption of some fixed probability that the person is lying to you about ever delivering future benefits, plus a variable probability over time that they will be unable to even if they intend to. Hyperbolic discounting is entirely rational in any situation other than our weird world with giant reliable financial systems and strong enforcement of contracts, and people's instincts haven't caught up. (If I am right, an experiment that deliberately made the promise more trustworthy would see a lower discount rate, and I would love to see someone try to falsify this, even if I am proven wrong.)

Real Interest Rates
Everything will be discussed in terms of real values, not nominal ones, because tracking inflation is an unnecessary complication.

It makes sense to discount the future based on how much our money would grow if we just put it in the bank. There's no sense implementing a policy that has a 2% return if it costs us 3% to borrow money. For almost all of human history, governments paid a positive real interest rate to borrow. Now they don't, and it may be decades or more before they do again. Similarly, no individual can or will earn a positive real risk-free return on their money for the foreseeable future. A full discussion of this is beyond the scope of this post, but for our purposes it lets us zero out a complication. I'll make the simplifying assumption that governments will keep borrowing more until their borrowing costs increase and they pay zero real interest rates on all their debt, so we can just assume no interest.

Increased Cost of Utility
We don't care about money; we care about what money buys us. This could be lives, or health, or utility. The exchange rate between money and these things changes over time. When society is poor, or unequal, a little money goes a long way. As things get better, it is more expensive to do good.

Discussions of discounting often involve the Ramsey Rule, which is based on increases in wealth and the decreasing marginal utility of consumption, but I think that is both too technically intimidating, and also too simplistic because it just uses averages and does not take inequality into account.  

If people are using money at all intelligently to purchase utility, they'll be using it to increase the utility of the people who are worst off. This means that if inequality goes down in the future, or if the social safety net improves, the cost of purchasing utility will go up, probably significantly more than the overall economic growth rate. 

Our discount rate for a policy that has economic effects will be, at minimum, the expected annual rate of increase in the cost of purchasing utility. Ideally this would be estimated by forming a historical data set on the cost per QALY achieved by the 10% most effective charities, or by tracking the percentage of the population in various states of extreme distress or subject to easily preventable deaths, but as a kludge we can just double the real per capita rate of economic growth.

Discontinuities
The primary message of this post is that we must discount based on the fact that everything we do can only affect the future with a probability rather than a certainty. There are many things that could cause our actions to have no effect, and each of them causes us to increase the discount rate. I call these discontinuities: anything that causes a break between what we hoped to happen and what actually happened.

Any competent cost-benefit analysis takes into account known unknowns, listing various scenarios and averaging them for an expected value. But I've never seen one that explicitly accounts for the annual tail risks of unknown unknowns, the discontinuities that could scramble plans and predictions by making big changes in the world. These things can usually be safely ignored on a short time horizon, but over long periods they matter a lot more.

I will generally be speaking in terms of an annual discount rate, but as a simple example, imagine that there are three different things that each cause a 10% cumulative chance, between now and the time period we want to affect, of the plan failing. The expected value of the benefits is roughly 70% of what they would be if the plan was guaranteed to do exactly what we projected. (Technically the expected payoff is 0.9^3 ≈ 73% chance of success, but that assumes risk neutrality. A more realistic assumption of risk aversion decreases the expected utility, so I'll cancel them out and just add up failure probabilities for simplicity.) So if the plan gains us 2 utility on a success, our expected value is 1.4 utility.

However, we still spent the resources, meaning that we lost the ability to do anything with them today. This means that, when doing a cost-benefit analysis, we should discount all future benefits by 30%. If this plan gives us at least 30% more benefits than spending money today, we should do it, and if it gives us less, we should not.

In this example, the failure probabilities were fixed, so it is no different than a normal expected-value scenario analysis. But in reality, the chances of these things increase as more time passes. Every year, there is a chance of something scrambling our plans, either directly or by scrambling the entire world.

To gain an intuition for this, imagine that you are a university professor, with a big endowment and research budget, in a random developed country in the year 1920, and that you are implementing a plan to make life better for people in the year 2020. What is the probability that your plan will survive? (Keep in mind that there's a good chance you are Jewish and live in a country in continental Europe.) Then imagine that you live in 1820, or 1720. Consider all of the economic, political, moral, and scientific shifts that have happened in those years. What is the probability that anything you do can even persist at all in a form you intended? Even then, what is the probability that your plan actually made the world better, in a way that the people of the future agree with and appreciate?

So, our annual discount rate must always include the sum total of all the annual probabilities of everything that could possibly happen to make the plan fail or become irrelevant.

As a concrete example, consider a climate policy that costs a billion dollars a year but would, if consistently implemented for a hundred years, prevent a trillion dollars worth of damages in year 100 of the policy. (This is simplistic, but many proposed policies have a similar payoff structure.) The break-even discount rate is between 3 and 4 percent: with a discount rate of 3%, it's a good idea, but with a discount rate of 4%, it's a bad idea.

When determining our discount rate to use, we must add up:
1) the probability that something else will solve climate change in the meantime, perhaps a cheap geoengineering solution. If that happens, we wasted the resources.   
2) the probability that a future administration will overturn the policy. If they do so, the future benefits will never materialize. 
3) the probability that the country ceases to exist (or becomes conquered by a different power that changes the policy). If that happens, we don't get the benefits.
4) the probability of any form of existential risk that would not be affected by the proposed policy.
5) the probability of any form of catastrophic risk that reduces the population enough to make the benefits irrelevant.

All of those estimates should be based on an outside view whenever possible. For example, for #3, look at how many nations in the last 200 years have collapsed, or undergone a revolution, or been conquered, and how often they tend to last before that happens. Given that data set, I'd guess there is a roughly 50% chance of the country ceasing to exist in its current form over the next 100 years, so the discount rate, for just point 3, should be about 0.5%. 

Estimates of x-risk add 0.15% for point 4, and probably another 0.15% for point 5. Point 2 probably adds another 1-2% to the discount rate, as does point 1 (I am a tech optimist). So the point estimate is a 3.8% discount rate.

Of course, this assumes that discontinuities result in zero benefit. If they still allow some benefit, the math gets more complicated, but we can approximate that by adjusting the discount rate downward for the policies where a discontinuity still gives some benefit. If there is a 0.5% chance of government change, but we assume a 50% chance that a revolution or invasion does not change the policy, then the adjusted discount rate is 0.25%.

A less controversial policy, with less chance of being made irrelevant by other means, might have a discontinuity-based discount rate as low as 1%, but I think this is as low as we can safely go. Chaos happens, and we are entering an uncertain age.

(Addition March 2022: The damages from carbon emissions that happen today will eventually happen regardless of intervening policy, but could still be made irrelevant by geoengineering or x-risk, so I would use a 1.5% discount rate for the Social Cost of Carbon, even with a zero rate of time preference. By a happy coincidence, this is the utility discount rate used by the standard models.)

Because people don't deal well with very small probabilities, it may be best, when doing the estimation (which will require surveys of experts) to focus on 10- or 25- or even 100-year blocks of time, and then divide.

Then, if the benefit is an economic effect rather than a pure impact on health or lives or utility, we also have to add the expected annual increase in the price of utility, conditional on no discontinuity. (Technically the total discount rate we should use is (1+u)/(1-d) - 1, where u is the growth rate in utility cost and d is the annual chance of a discontinuity. This will always be slightly higher than d + u, which will matter in the long run, but since these things are estimated with high uncertainty the difference is well within the confidence interval so we might as well keep things simple.)

To make an analogy to finance and investing, the discontinuity probability is the risk premium, and the utility price increase is the risk-free rate. We should only invest if our return is higher than both of these put together. In finance, the risk premium of the safest investment is assumed to be zero, and everything else is compared to that, but in this case, there is nothing with a zero risk premium, because we are comparing a certain result today with an uncertain result in the future. Anything we do has risk.

As a second example, consider a private action, perhaps setting up a think tank to research ways to stop a catastrophe, and that nobody else is trying to solve the problem or ever will. The discount rate is based on:
1) the probability that the organization suffers value drift.
2) the probability that the organization is taken over by a malevolent or selfish actor.
3) the probability that an outside force seizes all of its resources.
4) the probability that the government, society, and/or economy collapse for some other reason, making it impossible to continue work, or that another catastrophe makes the work pointless.
All of these risks increase with time, so the further out the benefit is, the less likely it is that the money spent will accomplish anything.

Even in the extreme case where the plan, once set in motion, cannot be disrupted, for example a rocket launch to nudge the trajectory of an asteroid that will hit the Earth in 200 years, we should discount. There are still discontinuities. Every year in the intervening time, there is a probability that people will develop a cheap asteroid deflection system, or that the asteroid will be consumed by mining, or that some other x-risk will make our effort irrelevant.

It should be clear by now that this is not an estimation problem for economic data sets and equations with greek letters. This is something we must think of by looking at historical trends in the appropriate reference class. Look at what has happened to similar plans or organizations in the past, and estimate the annual probability of a discontinuity of policies or resources between now and the time period we wish to affect. 

Scientific and Moral Uncertainty  
I'm less confident about this, and it doesn't affect the main result, but I suspect that epistemic uncertainty also increases our expected discount rate. Learning that the world doesn't work the way we thought it does is another kind of discontinuity. The annual probability that our theories are wrong, and that our policy is therefore mistaken, should be added to the discount rate.

Given how much our science and understanding of the world has changed and improved in the last hundred years, it seems reasonable to increase the discount rate by another 1% to account for epistemic humility and the potential realization that what we think we know is wrong. 

I think the annual discount makes sense, because long chains of causality introduce more chaos and more opportunities to be wrong. It's kind of like weather forecasting: we can be pretty confident in our ability to predict things a few days out, but after 10 days predictions are basically just swallowed up in chaos. 

Currently, with economic forecasting of government policies, it is rare to go out past 10 years, because of this kind of uncertainty. Assuming no large-scale change in economic conditions, we can predict the short-term effects of a tax or transfer policy fairly well. The Congressional Budget Office can know with decent precision what the effects of a tax cut will be over the next few years, and how that will affect labor supply and similar things. But there are a lot of second order effects and long-term adjustments that they don't know how to deal with, so as we project the changes of a policy more years out into the future we become less and less certain of anything. Typically they do this by increasing the error bars, and just stopping when they get too big.

But if we were doing our best to predict the effect of a large-scale change, because if we don't then people will treat all the side effects as though they did not exist, and while we are uncertain we know they are not zero, it might be more responsible to run the model the best we can, include its probabilistic output in the expected value calculation, and then add in an extra 1% or 2% discount rate on all future effects to account for the extra chaotic uncertainty.

Conclusion
None of this changes the fact that the value of preventing human extinction is huge, at least three orders of magnitude more than the entire annual world GDP. And that is just counting the VSL-based value that currently-existing humans place on preventing their own death; it does not include the value that future people place on existing.

In fact, this discounting strengthens the case for making serious sacrifices to stop x-risks. Anything else we might do to affect the far future is rendered irrelevant by the discounting, including the harmful effects of significantly slowing down economic growth. This is because there will be great unpredictability in human affairs as long as they still exist.

It can be argued that, ironically, the ethical implication of all this consequentialist math is to push people into a precautionary-principle-based deontology where we focus all of our actions on stopping whatever is likely to cause existential or catastrophic risk in the short term.  

However, if we cannot do anything about an x-risk, then we should be doing things to help people now rather than in the far future. The certainty of actually helping now is better than a tiny probability of a plan surviving and doing what we think it will, and this is reflected in the discounting.

No comments: