Probabilistic earthquake early warning in complex earth models using prior sampling

Probabilistic earthquake early warning in complex earth models using prior sampling Andrew Valentine, Paul Käufl & Jeannot Trampert EGU 2016 21 st April www.geo.uu.nl/~andrew a.p.valentine@uu.nl

A case study: The Whittier and Chino faults

An ideal EEW source determination system Ø Treats all physical effects properly, particularly: Ø Complex wave propagation phenomena Ø Strongly heterogeneous crustal structures Ø Operates in a probabilistic framework includes full treatment of uncertainties Ø Has low computational costs during operation Achieving any two of these is reasonably straightforward but can we have all three?

Treating physical effects properly COMPUTATIONAL INFRASTRUCTURE FOR GEODYNAMICS (CIG) PRINCETON UNIVERSITY (USA) CNRS and UNIVERSITY OF MARSEILLE (FRANCE) ETH ZÜRICH (SWITZERLAND) SPECFEM 3D Cartesian User Manual Version 3.0 g Ø Full numerical regional wavefield simulations: SPECFEM 3D (e.g. Peter et al., GJI, 2011). Ø 3D structural model for Southern California, CVMH11.9 (Tape, Liu, Maggi & Tromp, Science, 2009). Ø Simulation cost: ~100 CPU hours/source REPORTS 121 120 119 118 117 116 115 121 120 119 118 117 116 115 121 120 119 118 117 116 115 G 35 Depth = 10.0 km Depth = 20.0 km Vs km/s (3.10 ± 15 %) SA SA Depth = 2.0 km 2.8 3.0 3.2 3.4 3.6 G G SA 34 33 SA SA SA 36 Vs km/s 3.2 3.4 3.6 3.8 (3.50 ± 10 %) Vs km/s 3.4 3.6 3.8 4.0 (3.67 ± 10 %) Fig. 4. Horizontal cross sections of V tomographic model m at depths of 2, 10, and 20 km. See fig. S1 for locations of major features; Garlock (G) and San

MCMC: Posterior sampling Sampling process cannot start until after observations have been made: infeasible for EEW scenarios

An alternative approach: Prior sampling 1. Draw model parameter(s) at random from the prior distribution 2. Compute synthetic data corresponding to that model 3. Add random noise chosen to represent observational and modeling uncertainties

Joint data-model probability density function

Learning a representation of the pdf How can we represent and store the joint distribution effectively? Ø Assume it to be smooth and continuous Ø Represent 1D marginal distributions as Gaussian mixture models (GMMs) Ø Coefficients of GMM are the outputs of a neural network, called a Mixture Density Network (MDN) p(m d 0 )= NX i (d 0 )exp i=1! (m µ i (d 0 )) 2 2 2 i (d 0) Bishop, 1995. Neural networks for pattern recognition, OUP

Prior sampling: Ø Separates computationally-intensive sampling stage from time-critical evaluation stage Ø Is highly inefficient for a single observation but ideally-suited to a monitoring setting, where the same inverse problem must be solved repeatedly Ø Results in conservative uncertainty estimates that could be reduced with more targeted sampling Evaluation takes a fraction of a second!

Can we make sampling tractable for EEW? Source parameters of interest: Ø 3 location parameters Ø 5 moment tensor components Ø Source half-duration 9-dimensional model space Number of samples required to achieve a given sampling density in D dimensions ~ exp(d)

Can we make sampling tractable for EEW? Source parameters of interest: Ø 3 location parameters Ø 5 moment tensor components Ø Source half-duration 9-dimensional model space However: Point source seismogram can be expressed as linear combination of 6 independent Green s functions:! 6X s(m, x,t,, x c, )=f(t, ) M i i (x,t,x c, ) i=1 Expensive sampling only needs to be done in 3 dimensions! Strategy: 1. Randomly sample locations; compute 6 Green s functions using SPECFEM3D 2. Randomly sample moment tensors and source halfdurations at each location and construct seismograms

150 ( CPU allocation/(6*cost per simulation)) locations distributed randomly in box enclosing Whittier & Chino faults

Record wavefield at: Ø 1300 real stations in Southern California, plus Ø Regular grid of 600 virtual stations Simulate wave propagation for 200s after origin time, complete to 0.5Hz Waveforms available for download (12Gb!): www.geo.uu.nl/~jeannot 36 N 35 N With thanks to: Cartesius, the Dutch national supercomputer 34 N 33 N 122 W 121 W 120 W 119 W 118 W 117 W 116 W 115 W

Ø From 150 sets of Green s functions we generate ~2 million samples of (location, moment tensor, half duration, seismograms) Ø Assume all source mechanisms are equally likely but would be straightforward to prefer alignment with fault Ø Use only a small subset of stations; filter data at 0.2Hz Ø Train learning algorithms to provide a smooth representation of the joint data-model distribution

Does it work? Synthetic test Synthetic test 6s 15s 30s 45s 0.02 0.92 0.72 0.78 0 2 apple (strike) Ø Window starts when the first station triggers Ø Wait t seconds and use recordings from all stations some may not have recorded signal Ø Each window length is implemented as a separate learning algorithm

Does it work? Synthetic test Synthetic test 6s 15s 30s 45s 0.02 0.06 0.44 0.20 0.02 0.92 0.72 0.78 0.10 0.28 0.63 0.56 /6 0 /6 0.06 0.02 0.24 0.03 0 2 apple (strike) 1.74 2.04 2.81 2.55 /2 0 /2 (rake) 0.24 0.30 0.81 0.86 0 0.5 1 h (cos(dip)) 0.92 1.57 1.80 1.71 5 6.5 8 M w 0.85 1.82 1.94 1.86 1.5 10.75 20 depth [km] 1.63 1.66 1.87 1.87 33.79 33.92 34.05 lat [ ] 118.05-117.81 117.57 lon [ ] 0 5 10 [s]

Does it work? M w 5.4, Chino Hills, 2008 USGS CMT USGS BW GCMT SCSN SCSN DC Hauksson 0.01 0.09 0.37 0.40 0.18 0.41 0.83 0.70 0.07 0.12 0.12 0.38 /6 0 /6 0.02 0.01 0.09 0.07 0 2 apple (strike) 1.66 1.89 2.38 2.76 /2 0 /2 (rake) 0.23 0.68 0.79 1.13 0 0.5 1 h (cos(dip)) 0.82 1.41 1.45 1.35 5 6.5 8 M w 0.88 1.38 1.77 1.73 1.5 10.75 20 depth [km] 1.46 1.64 1.70 1.80 33.79 33.92 34.05 lat [ ] 118.05-117.81 117.57 lon [ ] 0 5 10 [s]

T = 10s T =8s 34 N Should we bother with the source at all? 33 N 33 N 5 log(pgd [m]) 1 If we have seismic observations at 10 randomly-chosen locations, 32 N 32 N what can we say about regional peak ground acceleration? 0 116 W 118 W 116 W 10 8 6 4 Inputs to system: log(pgd [m]) 34 N 34 N 34 N 34 5 N 5 Ø N receiver locations 4 4 Ø N seismograms, at those locations 3 3 Ø Point(s) at 33 which N 33PGA N estimation is desired 33 N 33 N 2 2 1 1 Output: P 32 logn 10 (P32GA) N d 1,...,d N, x 1,...x N 32 N 32 0 N 0 118116 W W 116 W 118 W 118116 W W 116 W 10 810 6810 46 8 24 6 0 34 N 34 N 5 5 log(pgd 5 log(pgd [m]) log(pgd [m]) 34 4 N 34 4 N 4 3 3 3 33 N 33 N 33 2 N 33 2 N 2 1 1 1 118116 W W 32 N 32 N 32 0 N 32 0 N 0 116 W 118 W 118116 W W 116 W 10 810 6810 46 8 24 6 0 8 8 8 34 N 4 3 2

Summary Ø By making use of learning algorithms in a prior sampling framework, EEW results can be made available within milliseconds. Ø It is becoming feasible to take advantage of state-of-the-art numerical wave propagation codes and heterogeneous 3D crustal models Ø The approach can be extended to allow direct inversion for almost any quantity of interest Waveform database: Ø Download from www.geo.uu.nl/~jeannot References: Ø Käufl, Valentine, de Wit & Trampert, BSSA, 2015. (Waveform inversion) Ø Käufl, Valentine, de Wit & Trampert, GJI, in press. (Prior sampling) Ø Käufl, Valentine & Trampert, in prep. (EEW in 3D media) a.p.valentine@uu.nl www.geo.uu.nl/~andrew