Capstone: Meta-analysis with publication-bias correction

Part 10 — Real-research capstones

Learning objectives

Synthesise study-level effect estimates using fixed-effect and DerSimonian-Laird random-effects meta-analysis
Diagnose publication bias visually (funnel plots) and formally (Egger's regression)
Apply Duval-Tweedie trim-and-fill to estimate a publication-bias-corrected pooled effect
Report and interpret heterogeneity statistics Q, τ², I²
Communicate the limitations of all bias-correction methods honestly

The third research capstone walks an end-to-end meta-analysis: synthesise K study-level effect estimates, characterise between-study heterogeneity, diagnose and correct for publication-bias-driven funnel-plot asymmetry. This is the workhorse evidence-synthesis tool of Cochrane reviews, evidence-based medicine, and policy-research syntheses worldwide.

The data-generating model

Random-effects meta-analysis models each study's true effect as a draw from a Normal population:

\theta_k \sim N(\mu, \tau^2), \quad \hat{\theta}_k | \theta_k \sim N(\theta_k, \mathrm{SE}_k^2).

μ is the population-mean effect (the "synthesis"). τ² is the between-study variance — the heterogeneity. SE_k is the within-study sampling error, typically inversely proportional to study sample size √n_k.

Fixed-effect pool: inverse-variance weighting

The classical pool assumes τ² = 0 (all studies estimate the SAME effect):

\hat{\mu}_{FE} = \frac{\sum_k w_k \hat{\theta}_k}{\sum_k w_k}, \quad w_k = 1/\mathrm{SE}_k^2.

Larger studies (small SE) get more weight. Fixed-effect is appropriate when heterogeneity is low (I² < 25 %) and inappropriate otherwise — because it understates uncertainty.

Random-effects pool (DerSimonian-Laird)

Add the between-study variance τ̂² to each weight:

\hat{\mu}_{RE} = \frac{\sum_k w^*_k \hat{\theta}_k}{\sum_k w^*_k}, \quad w^*_k = \frac{1}{\mathrm{SE}_k^2 + \hat{\tau}^2}.

τ̂² is estimated via DerSimonian-Laird from Cochran's Q statistic. RE has wider CIs than FE because it accounts for between-study variation — closer to honest under heterogeneity.

Heterogeneity statistics

Cochran's Q: $Q = \sum_k w_k (\hat{\theta}$ . Distributed χ²(K-1) under no heterogeneity.
τ̂²: between-study variance, in the same units as θ. Crucial for interpretation: τ̂ ≈ 0.5 means the typical study's true effect deviates by 0.5 from the mean.
I² = max(0, (Q - (K-1)) / Q): fraction of total variance due to between-study heterogeneity. < 25 % low; 25-50 % moderate; > 50 % substantial; > 75 % considerable.

Publication bias and funnel-plot asymmetry

Studies showing statistically significant (often positive) results are more likely to be PUBLISHED than null or negative findings. Result: the meta-analytic pool over-represents extreme positive results, biasing μ̂ upward.

Visual check: the FUNNEL PLOT (θ̂_k vs SE_k). Under no selection, points should be symmetrically distributed around μ. Under selection, the lower-left corner (small studies with null / negative results) is empty.

Formal test: EGGER'S REGRESSION (Egger et al. 1997). Regress θ̂_k / SE_k on 1/SE_k. The intercept is the asymmetry parameter — significant ≠ 0 → bias suspected.

Trim-and-fill correction (Duval & Tweedie 2000)

An iterative procedure:

Compute the pooled estimate.
Identify the most extreme (right-asymmetric) studies whose removal would symmetrise the funnel.
Recompute the pooled estimate from the symmetric subset.
MIRROR the trimmed studies onto the left side around the new pooled estimate (the "fill" step).
Report the final pooled estimate from the full augmented set as a publication-bias-corrected estimate.

Trim-and-fill is HEURISTIC, not gold-standard. It assumes that the asymmetry is due to selection (not heterogeneity) and that suppressed studies mirror published ones. Modern alternatives: selection models (Vevea & Hedges 1995), p-curve (Simonsohn et al.), p-uniform.

Try it

Defaults (μ = 0.3, τ = 0.15, K = 20, selZ = 0): no selection. FE and RE both recover μ̂ ≈ 0.3. The funnel plot is symmetric. Egger's test fails to reject.
Crank selection threshold to z* = 2.0. Now only studies with |θ̂/SE| ≥ 2 survive. Watch the bias: μ̂_FE jumps to ≈ 0.5 (massively over-estimated). Funnel becomes asymmetric — small-SE studies (top of funnel) keep both positive and negative; large-SE studies (bottom) keep only positive. Egger's rejects.
Now toggle trim-and-fill on (it's automatic in the readout). It imputes 4-6 missing studies (red dots on funnel) and corrects μ̂ downward — often back to ≈ 0.35-0.40, much closer to truth than the uncorrected 0.5.
Crank τ to 0.3 (high heterogeneity, I² ≈ 50 %). RE's CI widens but its centre is correct. FE's CI is too narrow — RE is appropriate here.
Reduce K from 20 to 5. Watch all CIs widen and bias-test power drop. Meta-analysis on few studies is unreliable; small K should trigger conservative bias-correction.

A meta-analyst reports I² = 85% — substantial heterogeneity. Should they report the random-effects pooled estimate as the headline number, or do something else? Justify in two sentences.

What you now know

You can run an end-to-end meta-analysis: synthesise K study-level estimates, characterise heterogeneity, diagnose publication bias, and correct for it. The RE estimator is the workhorse for heterogeneous evidence; trim-and-fill is a useful heuristic correction for funnel asymmetry. Reporting standards (PRISMA) require explicit disclosure of all these analyses; sensitivity analyses across different bias-correction methods are best practice.

References

DerSimonian, R., Laird, N. (1986). "Meta-analysis in clinical trials." Controlled Clinical Trials 7(3), 177–188.
Egger, M., Smith, G.D., Schneider, M., Minder, C. (1997). "Bias in meta-analysis detected by a simple, graphical test." BMJ 315, 629–634.
Duval, S., Tweedie, R. (2000). "Trim and fill: a simple funnel-plot-based method." Biometrics 56(2), 455–463.
Higgins, J.P.T., Thompson, S.G. (2002). "Quantifying heterogeneity in a meta-analysis." Statistics in Medicine 21(11), 1539–1558.
Borenstein, M., Hedges, L.V., Higgins, J.P.T., Rothstein, H.R. (2009). Introduction to Meta-Analysis. Wiley.
Vevea, J.L., Hedges, L.V. (1995). "A general linear model for estimating effect size in the presence of publication bias." Psychometrika 60(3), 419–435.