# Approaches to Moral Weights: How GiveWell Compares to Other Actors

Published: November 2017

In a previous blog post, we described how we use cost-effectiveness analyses when deciding which charities to recommend to donors.

This report discusses how GiveWell and other actors, such as governments and global health organizations, approach one of the most subjective and uncertain inputs into cost-effectiveness analyses: how to morally value different good outcomes.

For example, GiveDirectly, one of GiveWell's top charities, increases recipients' consumption, while the primary benefit we see from our top charity the Against Malaria Foundation is that it averts the deaths of young children. How can one make a direct comparison between the amount of "good" achieved by each of these charities?

GiveWell does this by assigning quantitative "moral weights" to different outcomes in our cost-effectiveness analyses. As a check on how sensitive our recommendations are to our moral assumptions, we investigated how others typically answer these questions in their cost-effectiveness analyses. This report discusses our findings from this investigation.

### Summary

We focus on the following questions:

• Why does GiveWell explicitly include moral weights in our cost-effectiveness analyses, and how do we decide on moral weights?
• Is there a "standard" approach to moral weights in cost-effectiveness analyses? How do other actors, such as governments and the World Health Organization, make these judgments?
• How much would GiveWell's cost-effectiveness analyses change if we took a "standard" approach to moral weights?

In brief:

• We include moral weights in our cost-effectiveness analyses because they are an important part of any giving decision and we think it is valuable to be transparent about them. The moral weights that drive our cost-effectiveness estimates are based on our staff's personal values.1
• Governments and other prominent actors often use "value of a statistical life" estimates to compare the value of improving health relative to raising incomes. These estimates often imply that a year of healthy life is roughly 2-3x as valuable as a year of doubling someone's income. However, there is little relevant research to inform such estimates in low- and middle-income country (LMIC) contexts; we understand that how income is valued relative to health may shift when a population is much poorer.
• There does not seem to be a standard approach for comparing the value of life at different ages; the most commonly used framework that we have seen (the disability-adjusted life year framework) explicitly does not provide judgments on this topic. Nevertheless, most other analyses that we have seen assume that averting death during childhood is about 1-2x more valuable than averting death during adulthood.
• Our initial analysis suggests that using relatively "standard" moral weight assumptions (i.e., the assumptions in the previous two bullet points) instead of our staff's moral weights would not change our overall view of the relative cost-effectiveness of our current top charities. It may affect how we view some interventions in the future, particularly those that disproportionately focus on averting deaths for young children or adults. We plan to include explicit comparisons between staff moral weights and relatively "standard" moral weights in our analyses going forward.

### Why does GiveWell explicitly include moral weights in its cost-effectiveness analyses, and how does GiveWell decide on moral weights?

#### Why we include moral weights

GiveWell aims to find and recommend charities that are evidence-backed and cost-effective and to make a bottom-line recommendation for donors about where to give. In order to do this, we spend a lot of time thinking about the relative cost-effectiveness of different opportunities.

In order to reach an overall comparison of the cost-effectiveness of different charities, we first use empirical analysis to estimate the cost per outcome achieved by different programs. For example, we currently estimate that it costs about $7,000 to avert the death of one under-5-year-old by distributing malaria nets via the Against Malaria Foundation and that it costs about$1,200 to double a household's consumption for a year via GiveDirectly.2 These estimates are subject to substantial uncertainty and should not be taken literally, but they largely involve judgment calls about evidence—for example, how similar you think the organization's current work is to the randomized controlled trials that were done of its program—and not ethics.

However, when it comes to making a decision about where to donate or how cost-effective one charity is relative to another, one also needs to consider moral questions such as:

• Valuing health vs. income: How much do I value averting the death of a 2-year-old relative to doubling the income of an extremely poor household?
• "Age-weighting": How much do I value averting the death of a 2-year-old relative to averting the death of a 30-year-old?

Setting moral weights (incorporating the value judgments in the bullet points above, or similar, by giving them numeric value in our cost-effectiveness analyses) is uncomfortable and challenging. In response to a question about whether giving insecticide-treated nets (bed nets) to prevent malaria or giving cash transfers would be a better use of marginal funds, the development economist Esther Duflo responded:3

I think it’s the type of question that will be frankly difficult to address because you’re required to value benefits across sectors. So it depends what your objective is. If your objective is to control malaria, you can do an experiment where you give money and you give people bed nets or you do something else and you can see how many people sleep under a bed net at the end of the day, and what’s the number of malaria infections you’ve averted. So that’s a question that’s well defined. Whether it’s better to give cash or give bed nets would require you to make a judgement about what is the importance of making people healthy versus having them buy a roof. And that’s one I’m not prepared to make.

However, answering these questions is unavoidable. Anyone deciding to donate to one charity over another is implicitly using moral weights, even if that person is not explicitly engaging with them.

GiveWell openly engages with these questions for a variety of reasons, including:

• We want to be transparent about the moral values underlying our recommendations,
• We want to give donors who rely on our research the opportunity to change the moral weights in our analyses so that they can choose a charity based on their own values, and
• We hope that by being explicit about these tradeoffs, we will be more likely to reflect carefully on them and find the best giving opportunities according to our values.

#### How we include moral weights

We would ideally incorporate the views of additional people who don't work on GiveWell directly, including our recommended charities' beneficiaries, in the moral weights we use. Unfortunately, little information about others' moral weights exists. We are partially working to address this by funding new research on beneficiaries' preferences through our Incubation Grants program.

In the meantime, we set moral weights by asking our staff for their values; when setting their weights, staff consider a variety of factors including the approaches of other organizations (described below). You can see the moral weights that our staff assign to different outcomes here. Anyone can make a copy of our cost-effectiveness analysis from this page and input their own moral weights to determine which charity is most cost-effective, given their values.

For previous discussion of some philosophical considerations relevant to setting moral weights, see our December 2016 blog post on this topic.

### Is there a "standard" approach to moral weights in cost-effectiveness analysis?

We found that:

• Using estimates of the "value of a statistical life" seems to be a fairly standard approach by governments and other major actors to compare the value of income relative to health. These estimates typically find that one year of healthy life is worth about two to three times a country's gross domestic product per capita, though there is high variability in estimates and major methodological limitations of the research on which they are based. In addition, little of this research has been conducted in LMIC contexts, where GiveWell's top charities work, and so may have limited applicability to outcomes there.
• There is less relevant literature and discussion about the value of averting deaths at different ages than there is about estimating the value of a statistical life. The most common approach that we have seen for valuing averting death at different ages is the disability-adjusted life year (DALY) framework. This framework is not intended to fully account for moral considerations. Our impression is that there is not a "standard" way to assign moral weights related to age. However, most other analyses that we have seen typically assume that the lives of children are about one to two times as valuable as the lives of adults.

#### Research process

In order to assess the sensitivity of our recommendations to our staff's moral weights, we looked at how others approach these questions. To limit this investigation, we began by focusing on the assumptions other actors make about the two questions mentioned above: 1) how to value income relative to health, and 2) how to value averting the deaths of young children relative to those of adults. We focused particularly on the approaches of governments and international institutions, such as the World Health Organization, because these actors play major roles in allocating resources in a variety of contexts.

The below represents our impressions based on a literature review and talking to researchers who have worked on these topics.4 In general, the researchers that we have spoken with seem to agree that there is too little research on this topic given its importance.

#### Valuing income versus health

Our understanding is that the most common way to estimate the value of income versus health is to estimate how much people are willing to pay to avert death or to add healthy years to their lives. Such estimates are often presented as the "value of a statistical life" (VSL) or the cost per disability-adjusted life-year (DALY) averted.

Governments and other major international actors such as the World Health Organization sometimes use such estimates to determine which programs to support. For example, if a government is considering an environmental regulation that would decrease economic output but also save many lives, it might estimate the value of a life to determine whether the benefits of the regulation outweigh the costs. Or say, for example, that an international organization distributes medicine and cash. If we assume that someone would be willing to pay $3,000 for an additional year of healthy life and that it would cost$1,000 to purchase a medicine that would provide them with an additional year of healthy life, then it is 3x more cost-effective to provide that person with the medicine than to give them $3,000, or the amount of cash that would achieve an equivalently good outcome.5 Revealed and stated preference research Governments and other actors generally use at least two major methodologies to arrive at these estimates: "revealed preference" research and "stated preference" research. Revealed preference methods look at people's choices in real-world environments to assess how much they must be paid to take a particular risk of death. For example, they may estimate how much additional money someone needs to be paid to take a job that carries a 1% higher mortality risk than similar jobs they could attain. Stated preference methods directly ask people questions about these tradeoffs, such as how much they would be willing to pay to reduce their risk of death by 1 in 10,000.6 The U.S. government more commonly uses revealed preference analyses to estimate VSL; other Organisation for Economic Co-operation and Development (OECD) countries more commonly use stated preference analyses. We do not know why these groups prefer different methodologies.7 Research based on these methods often concludes that a year of healthy life is roughly as valuable as 2-3x gross domestic product (GDP) per capita. For example: • WHO’s CHOosing Interventions that are Cost-Effective (CHOICE) team, which assists country policymakers with decisionmaking, distinguishes between the following tiers of cost-effectiveness:8 • "Very cost-effective": Cost per DALY averted is less than GDP per capita • "Cost-effective": Cost per DALY averted is between 1-3x GDP per capita • "Not cost-effective": Cost per DALY averted is greater than 3x GDP per capita • The Lancet Commission on Investing in Health's "Global Health 2035" project estimated "that the value of a life year (VLY) averages 2.3 times GDP per capita for low and middle–income countries (LMICs)" based on U.S. VSL estimates that were adjusted for lower-income contexts using a variety of assumptions.9 • High-income country governments often use VSLs that range from about$3 million to $7 million, which can be converted to a value per DALY of about$60,000 to $230,000, or about 1-6x GDP per capita.10 We have not yet vetted the research that leads to these estimates and we see a variety of major limitations of both revealed preference and stated preference research. Limitations of these analyses A few general issues that limit the usefulness of both types of research are: • Difficulty of comprehending small probabilities: It may be challenging for people to understand probabilities well enough to indicate how much they value small chances of averting death. • Lack of information: People may not have or consider basic information relevant to thinking about the value of mortality risks. • Preferences may not maximize well-being: Even if people perfectly understood the probability and information components of trading off income and mortality risk, they might not be able to reliably anticipate what would maximize their well-being, all things considered. This may apply to people in general, including both the populations surveyed in the literature mentioned above and the beneficiaries of programs recommended by GiveWell. For example, maybe people do not realize how important health is to their happiness and their long-term goals and undervalue it. Donors who use GiveWell's recommendations may want to consider the potential disconnect between preferences and well-being when making giving decisions. A further challenge is that there is little revealed preference or stated preference research conducted in LMICs; most VSL and similar analyses estimate how much people value life in LMICs by extrapolating from high-income country research.11 A key issue with extrapolation is that one needs to make an assumption about how much the relative value of income versus health changes when a population is much poorer (often referred to as "the elasticity of demand for health"). Perhaps someone who barely has enough money to survive would greatly prefer any increase in income more than an additional year of life. Different assumptions about how to extrapolate can lead to estimates of the value of a DALY that vary by at least an order of magnitude.12 Though the literature on VSL in LMIC contexts is limited, we are aware of a few potentially relevant empirical papers on the topic, which are briefly summarized in León and Miguel 2016, itself an estimate of VSL in an LMIC context (see following footnote).13 These papers generally appear to find substantially lower values of health relative to income than are estimated in high-income countries.14 We have not yet carefully vetted these papers and expect to review them more closely in the future, but our impression is that estimates of the value of life from these papers have not yet been used by major decisionmakers and are based on different methodologies than typical VSL estimates, so they should not yet be interpreted as "standard" assumptions.15 Because of limitations in the existing literature, we do not see current "best guess" estimates of the relative value of income versus health in LMICs as robust. #### Valuing deaths of young children versus adults DALY framework The method that we have seen used most often for valuing deaths averted at different ages, the DALY framework, is described and updated as part of the series of Global Burden of Disease (GBD) reports. GBD is a global observation epidemiological study that reports on mortality and morbidity from a variety of causes. The DALY framework is used by the World Health Organization (WHO)16 and the Disease Control Priorities Project (a major research initiative to help prioritize spending within global health),17 and our impression is that it is widely used in other research about the cost-effectiveness of global health and development programs. Broadly, the DALY framework values a death averted by adding up the total years of life saved by an intervention. For example, the death of a male infant (life expectancy 80 years) would be counted as 80 years of life lost, while the death of a 45-year-old female (life expectancy 83 years) would be counted as 38 years of life lost. Without further adjustments, this implies that the death of a single infant is considered about as bad as the death of two adults. (We provide more background on the DALY framework in two 2008 blog posts: here and here.) The DALY framework also provides optional adjustments to 1) factor in the idea that life years might be more valuable when one is in the "prime" of one's life (called "age-weighting"), and 2) to assign less value to years of life saved that occur farther in the future ("time discounting").18 The GBD has varied its assumptions about age-weighting and time discounting over its history.19 For example, the GBD's 2004 report included age-weighting and time discounting in its main analyses, but in 2010 both the GBD and WHO decided not to factor in either adjustment in their core analyses.20 The GBD and WHO's justifications for removing age-weighting and time discounting was that they wanted to leave these kinds of moral judgments to policymakers. Therefore, we do not interpret their decision as reflecting researchers' or decisionmakers' moral views.21 However, in practice we have not seen policymakers explicitly adjust the GBD's output based on their views about age weights and discount rates. This leaves us in a position where we are not able to look to one of the most commonly used tools in cost-effectiveness analysis (the DALY framework) for guidance on how to approach age-weighting and time discounting as part of our decisionmaking process. Other estimates In our brief literature search, we also came across a few other estimates of the relative value of averting deaths of young children versus adults, including: • An OECD literature review of VSL research that recommended, based on information about how much parents value averting mortality risk for their children, the VSL for a child should be ~1.5-2x higher than the mean adult VSL.22 We have not vetted the underlying methodology for this estimate and do not know whether any decisionmakers actually followed the OECD report's recommendation. The OECD literature review also noted that policymakers in the U.S. have been reluctant to make age adjustments to VSL figures because there was controversy when it was reported that the Environmental Protection Agency (EPA) assigned lower VSL estimates for elderly people as part of its analyses.23 • A survey by Julian Jamison conducted via Amazon Mechanical Turk, a website on which workers can be paid to complete "human intelligence tasks." Jamison published a paper reporting that respondents valued adult women roughly 1.5-2x as much as fetuses and infants up to 1 week old, but that they valued 1-year-old children roughly equally to adult women.24 Jamison notes that roughly 40% of respondents failed to input a response and that this study was done on a relatively narrow population, among other limitations.25 Unfortunately, we did not see other highly relevant research on this topic in our preliminary search, though it is possible that a more comprehensive search would identify additional research. Philosophers' views on moral weights One might expect that questions such as these might be frequently researched and discussed by philosophers. However, even though philosophers have long considered the question of what makes life valuable in general, our understanding from speaking with experts and searching for literature is that philosophers have not done much work to consider how best to assign quantitative value to different kinds of outcomes.26 For further discussion of philosophical work that is potentially relevant to assigning moral weights, see our previous blog post on the Against Malaria Foundation and population ethics. ### How much would GiveWell's cost-effectiveness analyses change if we took a "standard" approach to moral weights? Changing our cost-effectiveness analyses to use "standard" moral weights rather than staff values would not substantially change the estimated relative cost-effectiveness of our current top charities (which mainly focus on averting children's deaths and increasing adult income and consumption), though it could make a large difference to our estimates of the impact of programs we may work on in the future, such as antiretroviral therapy (which is mainly focused on extending adult lives). A table comparing the median cost-effectiveness estimates for top charities and antiretroviral therapy charities under varying assumptions is below: Median cost-effectiveness, relative to unconditional cash transfers (GiveDirectly)27 Deworm the World Initiative Schistosomiasis Control Initiative Against Malaria Foundation Malaria Consortium Antiretroviral therapy "Standard" moral weights ~10x ~8x ~4x ~5x ~0.5x Actual staff moral weights ~10x ~8x ~3x ~3x ~2x The above "standard" results are based on a version of our cost-effectiveness analysis that replaces staff moral weights with relatively "standard" moral weights based on the conclusions of the research discussed above. The "standard" moral weight assumptions that we made were:28 • Averting a DALY is roughly as valuable as providing a cash transfer of ~2.5x GDP per capita. • Averting the death of a young child is equivalent to averting about 37 DALYs. This is consistent with the 2004 GBD methodology, which factored in discounting and age-weighting.29 We are uncertain whether this estimate was intended to fully reflect the GBD researchers' moral views at the time, but it seems to us more likely to be a reflection of their moral views than the 2010 GBD figures, which exclude age-weighting and time discounting adjustments. • Averting the death of an adult is equivalent to averting about 30 DALYs. This also relies on the 2004 GBD methodology. All staff-specific parameters for judgment calls about evidence (e.g., how much to discount the expected effect of deworming due to concerns about evidence quality) have not been changed in this model. It seems that using "standard" assumptions could make a large difference to our cost-effectiveness estimates for programs that are primarily focused on reducing mortality for very young children or mortality for adults, as illustrated by the relatively large cost-effectiveness changes under different assumptions for Malaria Consortium's seasonal malaria chemoprevention program (mainly focused on averting infant deaths) and antiretroviral therapy (mainly focused on extending adult lives).30 GiveWell staff often value averting adult deaths about 1-5x more than infant deaths relative to "standard" approaches, so changes to these weights become particularly relevant for interventions where mortality reduction of either group is the primary outcome.31 The cost-effectiveness of deworming charities and the Against Malaria Foundation did not change considerably under "standard" assumptions; further explanation for this is in the following footnote.32 In future cost-effectiveness analyses, we plan to include an entry for "standard" assumptions as one of the published inputs so that we and others who use our research can track how much our moral weights differ from our best guess of the kinds of inputs that would be used by other mainstream decisionmakers. 