Weighting effect sizes
After calculating effect sizes from your primary studies, weights can be calculated for those effects to determine how much they should each contribute to the overall effect size estimate. Weighting can be used to take into account the relative precision of the effect sizes calculated from primary studies, rather than assuming that they should all contribute equally to the final estimate of the overall effect size. Although weighting is not a mandatory step in meta-analysis, it is generally a recommended one, as studies can vary in the quality and precision of the information that they provide, whether that is due to their sample size or due to variation caused by measurement error, variation in experimental methodology, or environmental variation. Weighting allows studies with higher precision (i.e., a lower variance) to contribute more to the overall effect size estimate, helping to improve its accuracy and precision. However, weighting is not always necessary, so in future sub-sections we will discuss circumstances when unweighted analyses may be preferable or necessary.
Weighting can be executed in a variety of ways, and the best weighting scheme depends on characteristics of the effect size data, and on the type of statistical model that you plan to implement to assess your effect sizes. Weights come in parametric or nonparametric varieties.
Parametric weights
- Based on variance
- fixed effect model weights: 1 / (within-study variance)
- random effects model weights: 1 / (within-study + between-study variance)
Nonparametric weights
- Based on sample size, study “quality”, or other criteria defined by meta-analyst
- most commonly implemented in fixed effect models,
- implemented by occassionally in random effects models, however this approach has not been evaluated
- limitation: you can’t conduct heterogeneity tests with nonparametric weights, because you can’t partion the variance components
Parametric weights are the most commonly used and the most informative weights, so let’s look at how they are calculated for different types of statistical models.
Parametric weights
Parametric weights are based on estimates of variance.
In fixed effect models, the weight ((w)) for study (i) is the reciprocal of the within-study variance of the observed effect size ((V(e_i))). Thus, [w_{i} = frac{1}{V(e_{i})}]
In random effects models, parametric weights incorporate both within- and among-study variance. [w^*_{i}= frac{1}{V(e_i ) + T^2}]
- (e_i) = effect size of (ith) study
- (V(e_i)) = estimate of the within-study variance of (e) for the (ith) study
- (T^2) = estimate of among-study variance
Within-study variance calculations
Within-study variances of effects sizes can sometimes be taken directly from a primary study, but in most cases it must be calculated by the meta-analyst. It’s important to note that the formula for estimating the within-study variance of an effect size is unique to each effect size metric. The equations for calculating within-study variance for common effect size metrics can be looked up in the literature, and include:
Within-study variance of the log-response ratio (lnRR): [V(lnRR_{i}) = V(ln(bar{X_{i}}/bar{C_{i}})) approx frac{V(bar{X_{i}})}{bar{X_{i}^2}} + frac{V(bar{C_{i}})}{bar{C_{i}^2}}]
- (bar{X_{i}}) is the mean response in group (X) in study (i)
- (bar{C_{i}}) is the mean response in group (C) in study (i)
Within-study variance of Hedges’ (d): [ V(d_{i}) approx frac{n_{C,i} + n_{X,i}}{n_{C,i}~n_{X,i}} + frac{d^2_{i}}{2(n_{C,i} + n_{X,i})} ]
- (n_{X,i}) is the sample size of the treatment group in study (i)
- (n_{C,i}) is the sample size of the control group in study (i)
- (d_i) is the Hedges’ d effect size in study (i)
Within-study variance of the response difference ((D = bar{X_E} – bar{X_C})):
When we assume the populations that the two groups represent have the same standard deviations: [V_D = frac{n_E + n_C}{n_E n_C}s_{pooled}^2] Where (s_{pooled}) is [s_{pooled} = sqrt{frac{(n_E – 1)s_E^2 + (n_C -1)s_C^2}{n_E + n_C -2}}]
and when we assume the populations have different standard deviations: [V_D = frac{s_E^2}{n_E} + frac{s_C^2}{n_C}]
Within-study variance of a slope or other parameter-based effect size
For an effect size based on a parameter estimate like a regression slope, you can obtain the within-study variance directly from the model output.
Calculating within-study variance for a new effect size metric
However, if you constructed a new effect size metric ((e = f(X_{1}, X_{2}))), you will have to derive an estimator for the within-study effect size variance. You can do this through application of the Delta Method.
The Delta Method (one variable)
Starting at the most basic level, a one variable system, the Delta Method is used to estimate how variation in (x) translates into variation in (f(x)) (i.e. in (y)). The mathematics of the Delta Method are built from a Taylor series expansion, which allows you to estimate the (y) that results from adding a small change (Delta) to (x).
[y = f(x + Delta) = f(x) + f'(x)(Delta) + f''(x)Delta^2 /~ 2! ~…]
The estimated y is the y calculated by the function (f(x)) at the original (x) value, plus the effect of the small increment given the slope of the function at (x) (i.e., the first derivative of (f(x))), plus the effect of the acceleration of the function (the second derivative), plus additional terms representing higher derivatives of the function. The higher order terms, meaning the terms with the second derivative and every derivative above that, are generally so small that you can discard them without much loss of information. Because of that, we’re going to drop the second order term and higher. Thus, we have an approximation for the new (y: f(x+Delta) approx f(x)+f'(x)(Delta)) from the forumla. (f'(x)) gives the slope at point (x), so that multiplied by the small increment (Delta), added to the starting point, (f(x)), approximates where (f(x + Delta)) will be located. Error in the estimates arises from non-linearity in (f) (i.e., due to the higher order terms that we dropped).
Now let’s consider variation around (x) (e.g., increasing (x) by (Delta) and decreasing it by (Delta)). (Delta) essentially represents variation in (x). What we are trying to figure out is how this variation in (x) produces variation in (f(x)), i.e., how (x) produces variation in (y).
This can be visualized in Fig. 1:
Fig. 1) Visualization of how a Taylor series exapansion allows the estimation of how variation in (x) (i.e., as measured by (Delta)) produces variation variation in (f(x)).
(f'(x)) gives the slope at point (x), so that multiplied by the small increment (Delta), added to the starting point, (f(x)), approximates where (f(x + Delta)). (Delta) essentially represents variation in (x). What we are trying to figure out is how this variation in (x) produces variation in (f(x)), i.e., how (x) produces variation in (y).
Recall that the equation for variance ((V)) of (y) is: [V(y) = V(f(x)) = frac{sum_{i=1}^{n}(y_{i} – bar{y})^2}{n-1}]
We know (bar{y}) and we can use the Delta Method to estimate (y_i) [y_i approx bar{y} + f'(x)(x_i – bar{x})] If we then substitute the equation for (y_i) into the variance equation for (y), we obtain: [V(y) = frac{sum_{i=1}^{n} ~[bar{y} + f'(x)(x_i -bar{x}) – bar{y}]^2}{n-1}] Now, canceling out the (bar{y}), and pulling out the slope, which is a constant, we can rearrange the equation to: [V(y) = f'(x)^2 frac{sum_{i=1}^{n} ~(x_i -bar{x})^2}{n-1}] The fraction in the last half of the equation is the same as the variance in (x) (i.e., (V(x))), thus [V(y) approx f'(x)^2 V(x)] and now you’ve used the Delta Method to calculate variance in (y).
Delta Method (2 variables)
In meta-analysis, you typically have an effect size that is based on at least two or more variables (e.g., the mean for the control group and the mean for the treatment group). Thus, we need to apply the Delta Method for a two-variable system. This is slightly more complicated than the example provided above, but is still do-able. Let’s think of the effect size, (e), as a function of two variables, (x) and (y). We now have to consider how variation in both the (x) and (y) directions alter the effect size.
The Taylor series expansion for this two-variable system is: [y = f(x,y)~ + f’x(x,y)(Delta_x) + f’y(x,y)(Delta_y)~ + ~ …]
- Note that in this case the derivatives are partial derivatives.
Now, apply the Delta Method to obtain an estimate for the variance effect size, (V(f(x,y))): [V[f(x,y)] = V(e) approx left( frac{partial f}{partial x} right)^2 ~V(x) + left( frac{partial f}{partial y}right)^2 ~V(y)]
- (V(x)) and (V(y)) are estimated from the data (e.g., the (s^2/n) for each treatment group), while the derivates are defined based upon the functional form of the effect size metric (and thus will depend on how the effect size has been defined: e.g., lnRR vs. Hedges’ d).
[Future: add example calculations of simple effect size metrics]
Non-parametric weights
Non-parametric weights, or weights based on anything besides the inverse of the variance, such as sample size, study “quality”, etc. are also used in meta-analysis, especially in the social sciences. These types of weights are used when variance-based weights can’t be calculated, or are problematic for analyses. For example, sometimes the within-study variance can’t be calculated for a study due to zeroes that lead to undefined terms in the variance formula, or because the study didn’t report variances or sample sizes for both groups.
Implementation in fixed effect models
Typically, non-parametric weights are used in fixed effect models. In this case, their implementation is straightforward–weights should be designed to let studies of higher quality, or estimated with higher precision contribute more to the estimation of the overall mean effect size. Thus, for example the weight of study (i) could be calulated as proportional to the sample sizes in two groups, e.g., [w_i = frac{n_E n_C}{n_E + n_C}] In other words, a study with larger sample sizes would be given more weight than a study with smaller sample sizes.
Implementation in random effects models
In random effects models, which incorporate among-study variance into the weighting term, the usage of non-parametric weights is more complicated, and less understood. Let’s assume that we have an index, (I_i), that represents the quality of study (i). Let’s further assume that this index is assumed proportional to the within-study variance, although we don’t know exactly how. If this proportionality holds across all studies that we are summarizing, then: [w_i = frac{1}{alpha I_i + T^2}] Of course, we do not know the proportionality, (alpha). We can consider two extreme cases. 1) If the among-study variance ((T^2)) is reasonably small relative to the within-study variance, then the weights for each study will be based primarily on the within-study variance. As a result, we can conduct a fixed effects analysis (because the (T^2) term is unimportant). 2) On the other hand, if (T^2) is extremeley large relative to the within-study variance term, then the weights will be similar across all studies, and it would be appropriate to conduct an unweighted analysis.
[Future: provide illustration of why.]
The solution for intermediate cases is unclear, and approaches that have been used the literature need to be evaluated before any recommendations can be made.
[Future: recommendations for determining when non-parametric weights are appropriate]
Unweighted analyses
The overall mean effect size ((bar{E})) of an unweighted analysis is just the simple mean of the effect sizes ((e)) of all the studies in the meta-analysis (studies (1) to (k)) [bar{E} = frac{sum_{i=1}^k e_i}{k}]
An unweighted analysis gives unbiased estimates of the mean effect size. However, unweighted analyses generally result in less precise estimates, (e.g., estimates with larger confidence intervals), as well as reduced power to detect a significant effects of moderator variables. Therefore, unweighted analyses are usually not recommended if it is possible and appropriate to conduct a weighted analysis.
Summary
Weighting is an optional, but recommended, step in meta-analysis, which allows the meta-analyst to systematically assign greater importance to studies where the effect size was estimated with higher precision, and allows these studies to factor more heavily into the estimation of the overall mean effect size. Parametric weights based on the inverse of the variance are the most common and statistically efficient weights for meta-analysis. These weights differ in how they are implemented for fixed vs. random effects models. When variance-based weights are not feasible to implement, non-parametric weights can be used instead; however their appropriateness has not been examined in detail. In many cases, unweighted analyses provide a reasonable, but often less precise, alternative for conducting a meta-analysis.
Next steps
The above discussion focuses on the use of weights to obtain mean effect sizes over a collection of studies. Note that with random effects models, these weights require estimates of (T^2), which we discuss in the module on statistical models. Once we’ve obtained effect sizes and their weights, and used these estimates to obtain mean effect sizes, the next step is to determine the variance of this estimate, explore if there is heterogeneity in the effect sizes estimated in different studies (i.e., evaluate if (T^2) is demonstrably different from 0), and evaluate if moderating variables can explain any observed heterogeneity. We’ll discuss how to do this in the section on Exploring heterogeneity.
Last updated: 2019, January 22
References
