ML-assisted first-break picking

Part 9 — Machine Learning in Processing

Learning objectives

  • State the first-break picking problem and why it matters for refraction statics
  • Compare STA/LTA, AIC, and CNN pickers and their failure modes
  • Describe the U-Net training regime for first-break picking
  • Recognise why ML picking has become the production standard

The first break — the onset of the direct or refracted arrival on a shot record — is the single most-picked quantity in seismic processing. Every shot on every survey gets first-break picks used for: refraction static corrections (land), direct-wave velocity QC, near-surface tomography, overburden velocity building. Classical pickers were heuristic and required constant human oversight. Modern CNN-based pickers have taken over this role in production.

1. The three classical pickers

  • STA/LTA (short-term / long-term average). Compute the ratio of a short-window (10–40 ms) energy to a long-window (50–200 ms) energy as a function of time; trigger when the ratio crosses a threshold (typically 2–5). Simple, fast, but sensitive to pre-arrival noise and threshold choice. Systematic late-pick bias at low SNR.
  • AIC (Akaike Information Criterion). Model the trace as two segments split at index kk: pre-arrival (white noise) + post-arrival (white noise with different variance). Compute AIC as a function of kk; the minimum marks the split. More robust than STA/LTA to noise level but assumes noise is white and stationary.
  • Correlation. Cross-correlate each trace with a template (an assumed wavelet). The correlation peak marks the first break. Good when the wavelet is known; poor when it varies shot-to-shot.

2. Why ML wins at this task

First-break picking is a pattern-recognition problem with a tight labelled training set available (any historical survey has hand-picks). A CNN sees:

  • The full waveform context of each trace (not just a moving window).
  • Adjacent traces' arrivals (moveout consistency).
  • Wavelet variations learned from training examples.
  • Noise patterns learned across many surveys.

A trained U-Net pick is ~1 ms/trace vs 10–100 ms/trace for a careful human. Accuracy typically 2–5 ms RMS vs hand picks (which are themselves uncertain at this level).

3. The widget

Fbp DemoInteractive figure — enable JavaScript to interact.

24-trace shot gather with first arrivals on a linear direct-wave moveout (V = 1800 m/s, 50 m spacing). The user-controlled pre-arrival noise level affects classical pickers more than the CNN. Three pick lines overlaid on the gather: yellow STA/LTA, orange AIC, pink CNN. Ground truth (green dashed) lies between them for reference.

At low noise (~0.02) all three pickers are within a few ms of truth. At moderate noise (~0.10) STA/LTA and AIC start to show systematic late-picking and scatter; the CNN stays within ~3 ms. At high noise (~0.25) STA/LTA and AIC produce wild outliers; the CNN still roughly tracks truth because it was trained on noisy examples and learned to use trace-to-trace consistency.

4. U-Net for first-break picking

Input: a 2D patch of the shot gather (traces × time). Output: a binary mask where 1 marks the first-break arrival. The network is trained with a cross-entropy loss on the mask. At inference, the argmax of the predicted mask per trace gives the pick time. Training data: tens of thousands of shots with hand-picks, usually from multiple surveys to capture wavelet variety.

Hybrid variants use the mask output as an initial pick, then refine with AIC over a narrow window around it; this combines ML robustness with AIC's sub-sample accuracy.

5. Production deployment considerations

  • Cross-survey generalisation. A network trained on one survey (one wavelet, one noise regime) may systematically miss arrivals on another. Standard practice: pre-train on a diverse set of surveys, fine-tune per project.
  • Human QC. Every production workflow retains interpreter QC on a subset of picks. The CNN replaces the bulk labour but not the final review.
  • Uncertainty flagging. CNN outputs a confidence map; low-confidence picks are flagged for manual review. Saves time by focusing QC on ambiguous cases.
  • Outlier rejection. Post-CNN, apply a moveout-consistency filter: picks that deviate by more than some threshold from a smooth fit are flagged or rejected.

6. Failure modes

  • Refraction static errors. If the first breaks are systematically mis-picked by a few ms, near-surface velocity models become biased, and refraction statics are wrong — the downstream imaging shows residual statics artefacts.
  • Cycle-skipping. At very low SNR the CNN may pick the wrong cycle (one period late or early). Harder to catch than STA/LTA cycle-skipping because CNN outputs look confident. Mitigation: compare predicted picks to a smoothed moveout model, flag outliers.
  • Domain shift under novel noise. Electrical hum, drone noise, unexpected infrastructure — the network has not seen these in training and may produce plausible but wrong picks. Mitigation: add diverse noise to training; include "synthetic noise augmentation" during training.
**The one sentence to remember**

CNN first-break picking (typically U-Net on 2D gather patches) has replaced STA/LTA and AIC in production because it is ~100× faster at comparable accuracy, with the main caveat that cross-survey domain shift requires fine-tuning per project.

Where this goes next

§9.5 closes Part 9 with the most recent ML frontier in seismic processing: accelerating FWI. Instead of replacing the physics, ML initialises FWI with a learned prior, replaces expensive iterations with network inference, or provides a learned regulariser that keeps the solution geologically plausible.

References

  • Yilmaz, Ö. (2001). Seismic Data Analysis (2 vols.). SEG.
  • Sheriff, R. E., Geldart, L. P. (1995). Exploration Seismology (2nd ed.). Cambridge UP.
  • Claerbout, J. F. (1976). Fundamentals of Geophysical Data Processing. McGraw-Hill.

This page is prerendered for SEO and accessibility. The interactive widgets above hydrate on JavaScript load.