Uncertainty-aware velocity inversion
Learning objectives
- Combine §9.1-§9.5 machinery into a posterior-FWI workflow
- Generate empirical posterior samples via bootstrap data resampling
- Compute mean ± std velocity model and 95% credible intervals
- Diagnose posterior coverage of truth (coverage probability)
- Wrap up Part 9 and look ahead to Part 10 (capstone real-data examples)
Part 9 capstone. The previous five sections gave us tools: (a) PINN as initialiser/regulariser/solver (§9.1); (b) λ-annealed PINN-augmented FWI (§9.2); (c) AE-based learned priors (§9.3); (d) generative priors for sampling (§9.4); (e) ensemble PINN UQ (§9.5). §9.6 puts them together to produce what production seismic-imaging teams actually need: a velocity model AND an uncertainty map.
Bootstrap-posterior FWI
The simplest production UQ recipe: bootstrap-resampling. For each of posterior samples:
- Re-sample noise on the data. where .
- Run a regularised inversion on the perturbed data: .
- Record as one posterior sample.
The empirical distribution is an approximation to the true posterior . Mean is the central estimate; std gives uncertainty bars; 1.96·std is the 95% credible interval (under normality).
This is NOT a fully rigorous Bayesian posterior — it conditions on point estimates of the prior + regulariser + initial guess rather than marginalising over them. But it captures the dominant uncertainty source for clean-data FWI (data picking noise) and is the production simplest-thing-that-works.
Try it
The widget:
- Generates from the truth (, , ) at 8 surface receivers.
- For 8 posterior samples: re-noise with s AND jitter the smart initial guess by , then run a 50-iteration mildly-regularised inversion (§9.2 hand-crafted prior, λ=0.3). The combined data-bootstrap + init-bootstrap is a simplified production-style "prior bootstrapping" that captures BOTH data uncertainty AND optimization-basin uncertainty (without it, every sample would cluster in the same partial-convergence well).
- Plot the posterior samples + mean ± 2σ band.
Two panels:
- Posterior c(z) profile: truth (yellow dashed), individual posterior samples (faint orange), posterior mean (orange bold), 2σ band (light orange shaded). Where the band is narrow, the inversion is confident; where it widens, the data is uninformative about that region.
- Joint (v₁, v₂) posterior cloud: 8 samples plotted as orange dots, truth as yellow star. The cloud's shape reveals correlations — for this 2-layer problem the cloud is elongated along the constraint enforced by the §9.2 prior, with cross-cloud spread reflecting how much the data-misfit gradient + init jitter pushes individual samples off that ridge.
The summary box reports posterior mean ± 1σ for each parameter, the 95% credible interval, and a coverage check (does each CI include the truth?). For an honest posterior, all 3 of 3 CIs should cover truth.
What production codes do beyond this
The widget illustrates the simplest bootstrap-posterior recipe. Production seismic-imaging UQ pipelines layer on additional uncertainty sources:
- Prior bootstrapping. Run inversions from MANY different initial guesses to capture model-form uncertainty. Each initial guess samples a different basin of the loss landscape.
- Regulariser bootstrapping. Sample from a hyperprior in each run — captures regularisation-strength uncertainty.
- Generative-prior bootstrapping. §9.4 generative-prior sampling: each posterior sample uses a different draw from the corpus prior as warm-start, ensuring the recovered model is on-manifold.
- Full Bayesian inference. HMC or SVGD over the FULL posterior using the generative prior . Most rigorous but costliest. Smith et al 2022 HypoSVI uses this approach for hypocentre location.
For 2-D / 3-D production FWI with thousands of unknowns, computational tractability forces approximations. The cheapest is bootstrap-posterior (this section). Next cheapest is ensemble PINN over multiple inversions. Most expensive but most rigorous is full HMC over the generative-prior latent.
Calibration: are the uncertainty bars HONEST?
An uncertainty estimate is HONEST (or "calibrated") if its credible intervals contain the truth at the claimed rate. For 95% CIs, an honest method covers truth in 95% of independent runs. Calibration matters MORE than width — narrow intervals that miss the truth are worse than wide intervals that contain it.
Production calibration tests:
- Coverage probability test. Run inversions on synthetic data with KNOWN truth. Check what fraction of runs have CIs containing truth. Should match the claimed coverage (95% for 95% CIs).
- Reliability diagram. Plot empirical coverage vs claimed coverage across . Diagonal line = perfect calibration.
- Cross-validation. Hold out some receivers, predict their travel times via the posterior, check if held-out values fall in predicted intervals at the claimed rate.
Pure data-bootstrap (re-noise only, fixed init) typically UNDER-COVERS — credible intervals are narrower than the true posterior because all samples cluster in the same optimization basin. The widget here adds INIT-jitter on top of data-bootstrap, which inflates the cloud enough to give honest 95% coverage on this toy problem. Production codes layer in even more uncertainty sources (regulariser bootstrapping, prior sampling, cross-validation calibration) and inflate the bootstrap intervals by 1.5-3× when held-out data shows under-coverage.
Operational notes
- Number of samples. N=8 is the educational minimum. Production: N=100-1000 for tight CIs. Each sample is independent so embarrassingly parallel.
- Picking noise estimate matters. Set from the actual pick variance. Too small → over-confident posterior; too large → useless wide intervals.
- Reporting standard. Always quote median + 5-95th percentile rather than just mean ± 1σ — captures asymmetry in the posterior. For seismic, the SEG-D (Society of Exploration Geophysicists Data) format includes uncertainty fields specifically.
- Decision-theoretic interpretation. A velocity model + uncertainty enables decisions about where to drill, where to acquire more data, and what oil-reserve confidence intervals to quote. Without uncertainty, FWI is just a point estimate that may or may not be reliable.
Part 9 wrap-up
Part 9 covered the production hybrid + UQ stack:
- §9.1 PINN's three roles in classical FWI: solver, initialiser, regulariser.
- §9.2 PINN-augmented FWI with the U-curve of regularisation strength + annealing.
- §9.3 Learned regulariser via autoencoder reconstruction loss.
- §9.4 Generative priors: AE-as-sampler, bridge to VAE / diffusion.
- §9.5 Ensemble PINNs for epistemic UQ (deep ensembles recipe).
- §9.6 Bootstrap-posterior FWI: velocity model + uncertainty map.
The reader who completes Part 9 has the toolkit to ship a production-grade UQ-aware seismic-inversion pipeline. Part 10 will walk eight real-field capstones — Marmousi, Sleipner, BP velocity benchmark, ANT (ambient-noise tomography), microseismic monitoring — applying the techniques from Parts 1-9 end-to-end on realistic data.
References
- Efron, B. (1979). Bootstrap methods: another look at the jackknife. Ann. Stat. 7(1), 1–26. The bootstrap paper that started the field.
- Sambridge, M., Mosegaard, K. (2002). Monte Carlo methods in geophysical inverse problems. Rev. Geophys. 40(3), 1009. Comprehensive review of MCMC for seismic inversion.
- Tarantola, A. (2005). Inverse Problem Theory and Methods for Model Parameter Estimation. SIAM. The standard textbook on Bayesian inverse problems including FWI.
- Smith, J.D., Ross, Z.E., Azizzadenesheli, K., Muir, J.B. (2022). HypoSVI. GJI 228, 698–710. Production Bayesian PINN-FWI with SVGD.
- Zhang, X., Curtis, A. (2020). Variational full-waveform inversion. GJI 222(1), 406–411. Variational Bayesian FWI on real seismic data.
- Gebraad, L., Boehm, C., Fichtner, A. (2020). Bayesian elastic full-waveform inversion using Hamiltonian Monte Carlo. JGR 125(3), e2019JB018428. HMC over elastic-FWI posterior.