Probabilistic earthquake early warning in complex earth models using prior sampling

Similar documents
Cluster Analysis. (see also: Segmentation)

Support Vector Machines

Vision for SCEC. John E. Vidale

Case Bb (elastic, 1D vertical gradient)

Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships

Tengyu Ma Facebook AI Research. Based on joint work with Yuanzhi Li (Princeton) and Hongyang Zhang (Stanford)

Data Assimilation in Geosciences

4th International Industrial Supercomputing Workshop Supercomputing for industry and SMEs in the Netherlands

MIPAS Temperature and Pressure Validation by RO Data

Processes. Criteria for Comparing Scheduling Algorithms

Tengyu Ma Facebook AI Research. Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC)

Case 0:09-cv ADM-FLN Document 42 Filed 07/30/10 Page 1 of 15 UNITED STATES DISTRICT COURT DISTRICT OF MINNESOTA

Theory and practice of falsified elections

Comparison Sorts. EECS 2011 Prof. J. Elder - 1 -

FM Legacy Converter User Guide

Michael Laver and Ernest Sergenti: Party Competition. An Agent-Based Model

A comparative analysis of subreddit recommenders for Reddit

Biogeography-Based Optimization Combined with Evolutionary Strategy and Immigration Refusal

Measuring Bias and Uncertainty in Ideal Point Estimates via the Parametric Bootstrap

Climate Change Around the World

NBER WORKING PAPER SERIES THE PERFORMANCE OF THE PIVOTAL-VOTER MODEL IN SMALL-SCALE ELECTIONS: EVIDENCE FROM TEXAS LIQUOR REFERENDA

Robust Electric Power Infrastructures. Response and Recovery during Catastrophic Failures.

Fall 2016 COP 3223H Program #5: Election Season Nears an End Due date: Please consult WebCourses for your section

Revisiting the Effect of Food Aid on Conflict: A Methodological Caution

Constraint satisfaction problems. Lirong Xia

P(x) testing training. x Hi

The HeLIx + inversion code Genetic algorithms. A. Lagg - Abisko Winter School 1

The Performance of Pivotal-Voter Models in Small-Scale Elections: Evidence from Texas Liquor Referenda

UTS:IPPG Project Team. Project Director: Associate Professor Roberta Ryan, Director IPPG. Project Manager: Catherine Hastings, Research Officer

Climate Change Around the World

CS 5523: Operating Systems

The National Earthquake Hazards Reduction Program (NEHRP): Issues in Brief

Lecture 6 Cryptographic Hash Functions

A procedure to compute a probabilistic bound for the maximum tardiness using stochastic simulation

Political Economics II Spring Lectures 4-5 Part II Partisan Politics and Political Agency. Torsten Persson, IIES

Search for Dark Matter Captured in the Sun with the IceCube Neutrino Observatory

Classifier Evaluation and Selection. Review and Overview of Methods

Random Forests. Gradient Boosting. and. Bagging and Boosting

Rumor Spreading and Voting

Do Individual Heterogeneity and Spatial Correlation Matter?

Dimension Reduction. Why and How

The predictive power of subjective probabilities: probabilistic and deterministic polling in the Dutch 2017 election

DYNAMIC RISK MANAGEMENT IN ELECTRICITY PORTFOLIO OPTIMIZATION VIA POLYHEDRAL RISK FUNCTIONALS

Report No. UCB/CSD November Computer Science Division (EECS) University of California. Berkeley, California 94720

The National Earthquake Hazards Reduction Program (NEHRP): Issues in Brief

The National Earthquake Hazards Reduction Program (NEHRP): Issues in Brief

Modeling delay relations based on historical train monitoring data: A Chinese high-speed railway case

Final Review. Chenyang Lu. CSE 467S Embedded Compu5ng Systems

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract

Chapter. Estimating the Value of a Parameter Using Confidence Intervals Pearson Prentice Hall. All rights reserved

Supreme Court of Florida

Supreme Court of Florida

Malta s Demographic Challenges

Deep Learning and Visualization of Election Data

Iterated Prisoner s Dilemma on Alliance Networks

Frequency-dependent fading bad for narrowband signals. Spread the narrowband signal into a broadband signal. dp/df. (iii)

Optimizing space-based microlensing exoplanet surveys

The parametric g- formula in SAS JESSICA G. YOUNG CIMPOD 2017 CASE STUDY 1

Hoboken Public Schools. Project Lead The Way Curriculum Grade 8

Statistical Analysis of Endorsement Experiments: Measuring Support for Militant Groups in Pakistan

UC-BERKELEY. Center on Institutions and Governance Working Paper No. 22. Interval Properties of Ideal Point Estimators

Ø Project Description. Ø Design Criteria. Ø Design Overview. Ø Design Components. Ø Schedule. Ø Testing Criteria. Background Design Implementation

Hoboken Public Schools. Algebra II Honors Curriculum

A Calculus for End-to-end Statistical Service Guarantees

Hoboken Public Schools. AP Statistics Curriculum

Applica'on of UQ Principles to Calibra'on, Sensi'vity, and Experimental Design

Coalition Formation and Selectorate Theory: An Experiment - Appendix

The Analytics of the Wage Effect of Immigration. George J. Borjas Harvard University September 2009

American Manufacturing: The Growth since NAFTA*

The Timeline Method of Studying Electoral Dynamics. Christopher Wlezien, Will Jennings, and Robert S. Erikson

Supreme Court of Florida

RECOMMENDED CITATION: Pew Research Center, May, 2017, Partisan Identification Is Sticky, but About 10% Switched Parties Over the Past Year

PPIC Statewide Survey Methodology

Do two parties represent the US? Clustering analysis of US public ideology survey

Introduction to Text Modeling

Instructors: Tengyu Ma and Chris Re

RECOMMENDED CITATION: Pew Research Center, October, 2016, Trump, Clinton supporters differ on how media should cover controversial statements

Can Ideal Point Estimates be Used as Explanatory Variables?

UTAH LEGISLATIVE BILL WATCH

Announcements. HW3 Due tonight HW4 posted No class Thursday (Thanksgiving) 2017 Kevin Jamieson

Response to the Report Evaluation of Edison/Mitofsky Election System

CS 229: r/classifier - Subreddit Text Classification

Computational challenges in analyzing and moderating online social discussions

AERONAUTICAL MOBILE COMMUNICATIONS PANEL (AMCP) EIGHTH MEETING. Montreal, 4 to 13 February 2003 AGENDA ITEM 5

Real-Time Wireless Control Networks for Cyber-Physical Systems

Do ideologically intransigent parties affect the policy positions of other parties?

ADVANCED SCHEDULING - OVERVIEW OF CHANGES COMING AUGUST 2014

POLITICAL EQUILIBRIUM SOCIAL SECURITY WITH MIGRATION

A 209 SRO Telecommunications (Terminal Equipment and Public Network) Regulations

Model-Based Pre-Election Polling for National and Sub-National Outcomes in the US and UK

The John Marshall Journal of Information Technology & Privacy Law

Income, Deprivation, and Perceptions in Latin America and the Caribbean:

Prediction of Refugee Population in School Districts. Chunhao SHAO* Takeyoshi TANAKA**

Collective Decisions, Error and Trust in Wireless Networks

arxiv: v2 [physics.soc-ph] 3 Jul 2016

Abstract: Submitted on:

Intersections of political and economic relations: a network study

A Dead Heat and the Electoral College

The Effectiveness of Receipt-Based Attacks on ThreeBallot

CS 5523 Operating Systems: Intro to Distributed Systems

Transcription:

Probabilistic earthquake early warning in complex earth models using prior sampling Andrew Valentine, Paul Käufl & Jeannot Trampert EGU 2016 21 st April www.geo.uu.nl/~andrew a.p.valentine@uu.nl

A case study: The Whittier and Chino faults

An ideal EEW source determination system Ø Treats all physical effects properly, particularly: Ø Complex wave propagation phenomena Ø Strongly heterogeneous crustal structures Ø Operates in a probabilistic framework includes full treatment of uncertainties Ø Has low computational costs during operation Achieving any two of these is reasonably straightforward but can we have all three?

Treating physical effects properly COMPUTATIONAL INFRASTRUCTURE FOR GEODYNAMICS (CIG) PRINCETON UNIVERSITY (USA) CNRS and UNIVERSITY OF MARSEILLE (FRANCE) ETH ZÜRICH (SWITZERLAND) SPECFEM 3D Cartesian User Manual Version 3.0 g Ø Full numerical regional wavefield simulations: SPECFEM 3D (e.g. Peter et al., GJI, 2011). Ø 3D structural model for Southern California, CVMH11.9 (Tape, Liu, Maggi & Tromp, Science, 2009). Ø Simulation cost: ~100 CPU hours/source REPORTS 121 120 119 118 117 116 115 121 120 119 118 117 116 115 121 120 119 118 117 116 115 G 35 Depth = 10.0 km Depth = 20.0 km Vs km/s (3.10 ± 15 %) SA SA Depth = 2.0 km 2.8 3.0 3.2 3.4 3.6 G G SA 34 33 SA SA SA 36 Vs km/s 3.2 3.4 3.6 3.8 (3.50 ± 10 %) Vs km/s 3.4 3.6 3.8 4.0 (3.67 ± 10 %) Fig. 4. Horizontal cross sections of V tomographic model m at depths of 2, 10, and 20 km. See fig. S1 for locations of major features; Garlock (G) and San

An ideal EEW source determination system Ø Treats all physical effects properly, particularly: Ø Complex wave propagation phenomena Ø Strongly heterogeneous crustal structures Ø Operates in a probabilistic framework includes full treatment of uncertainties Ø Has low computational costs during operation Achieving any two of these is reasonably straightforward but can we have all three?

MCMC: Posterior sampling Sampling process cannot start until after observations have been made: infeasible for EEW scenarios

An alternative approach: Prior sampling 1. Draw model parameter(s) at random from the prior distribution 2. Compute synthetic data corresponding to that model 3. Add random noise chosen to represent observational and modeling uncertainties

Joint data-model probability density function

Joint data-model probability density function

Learning a representation of the pdf How can we represent and store the joint distribution effectively? Ø Assume it to be smooth and continuous Ø Represent 1D marginal distributions as Gaussian mixture models (GMMs) Ø Coefficients of GMM are the outputs of a neural network, called a Mixture Density Network (MDN) p(m d 0 )= NX i (d 0 )exp i=1! (m µ i (d 0 )) 2 2 2 i (d 0) Bishop, 1995. Neural networks for pattern recognition, OUP

Prior sampling: Ø Separates computationally-intensive sampling stage from time-critical evaluation stage Ø Is highly inefficient for a single observation but ideally-suited to a monitoring setting, where the same inverse problem must be solved repeatedly Ø Results in conservative uncertainty estimates that could be reduced with more targeted sampling Evaluation takes a fraction of a second!

Can we make sampling tractable for EEW? Source parameters of interest: Ø 3 location parameters Ø 5 moment tensor components Ø Source half-duration 9-dimensional model space Number of samples required to achieve a given sampling density in D dimensions ~ exp(d)

Can we make sampling tractable for EEW? Source parameters of interest: Ø 3 location parameters Ø 5 moment tensor components Ø Source half-duration 9-dimensional model space However: Point source seismogram can be expressed as linear combination of 6 independent Green s functions:! 6X s(m, x,t,, x c, )=f(t, ) M i i (x,t,x c, ) i=1 Expensive sampling only needs to be done in 3 dimensions! Strategy: 1. Randomly sample locations; compute 6 Green s functions using SPECFEM3D 2. Randomly sample moment tensors and source halfdurations at each location and construct seismograms

150 ( CPU allocation/(6*cost per simulation)) locations distributed randomly in box enclosing Whittier & Chino faults

Record wavefield at: Ø 1300 real stations in Southern California, plus Ø Regular grid of 600 virtual stations Simulate wave propagation for 200s after origin time, complete to 0.5Hz Waveforms available for download (12Gb!): www.geo.uu.nl/~jeannot 36 N 35 N With thanks to: Cartesius, the Dutch national supercomputer 34 N 33 N 122 W 121 W 120 W 119 W 118 W 117 W 116 W 115 W

Ø From 150 sets of Green s functions we generate ~2 million samples of (location, moment tensor, half duration, seismograms) Ø Assume all source mechanisms are equally likely but would be straightforward to prefer alignment with fault Ø Use only a small subset of stations; filter data at 0.2Hz Ø Train learning algorithms to provide a smooth representation of the joint data-model distribution

Does it work? Synthetic test Synthetic test 6s 15s 30s 45s 0.02 0.92 0.72 0.78 0 2 apple (strike) Ø Window starts when the first station triggers Ø Wait t seconds and use recordings from all stations some may not have recorded signal Ø Each window length is implemented as a separate learning algorithm

Does it work? Synthetic test Synthetic test 6s 15s 30s 45s 0.02 0.06 0.44 0.20 0.02 0.92 0.72 0.78 0.10 0.28 0.63 0.56 /6 0 /6 0.06 0.02 0.24 0.03 0 2 apple (strike) 1.74 2.04 2.81 2.55 /2 0 /2 (rake) 0.24 0.30 0.81 0.86 0 0.5 1 h (cos(dip)) 0.92 1.57 1.80 1.71 5 6.5 8 M w 0.85 1.82 1.94 1.86 1.5 10.75 20 depth [km] 1.63 1.66 1.87 1.87 33.79 33.92 34.05 lat [ ] 118.05-117.81 117.57 lon [ ] 0 5 10 [s]

Does it work? M w 5.4, Chino Hills, 2008 USGS CMT USGS BW GCMT SCSN SCSN DC Hauksson 0.01 0.09 0.37 0.40 0.18 0.41 0.83 0.70 0.07 0.12 0.12 0.38 /6 0 /6 0.02 0.01 0.09 0.07 0 2 apple (strike) 1.66 1.89 2.38 2.76 /2 0 /2 (rake) 0.23 0.68 0.79 1.13 0 0.5 1 h (cos(dip)) 0.82 1.41 1.45 1.35 5 6.5 8 M w 0.88 1.38 1.77 1.73 1.5 10.75 20 depth [km] 1.46 1.64 1.70 1.80 33.79 33.92 34.05 lat [ ] 118.05-117.81 117.57 lon [ ] 0 5 10 [s]

T = 10s T =8s 34 N Should we bother with the source at all? 33 N 33 N 5 log(pgd [m]) 1 If we have seismic observations at 10 randomly-chosen locations, 32 N 32 N what can we say about regional peak ground acceleration? 0 116 W 118 W 116 W 10 8 6 4 Inputs to system: log(pgd [m]) 34 N 34 N 34 N 34 5 N 5 Ø N receiver locations 4 4 Ø N seismograms, at those locations 3 3 Ø Point(s) at 33 which N 33PGA N estimation is desired 33 N 33 N 2 2 1 1 Output: P 32 logn 10 (P32GA) N d 1,...,d N, x 1,...x N 32 N 32 0 N 0 118116 W W 116 W 118 W 118116 W W 116 W 10 810 6810 46 8 24 6 0 34 N 34 N 5 5 log(pgd 5 log(pgd [m]) log(pgd [m]) 34 4 N 34 4 N 4 3 3 3 33 N 33 N 33 2 N 33 2 N 2 1 1 1 118116 W W 32 N 32 N 32 0 N 32 0 N 0 116 W 118 W 118116 W W 116 W 10 810 6810 46 8 24 6 0 8 8 8 34 N 4 3 2

Summary Ø By making use of learning algorithms in a prior sampling framework, EEW results can be made available within milliseconds. Ø It is becoming feasible to take advantage of state-of-the-art numerical wave propagation codes and heterogeneous 3D crustal models Ø The approach can be extended to allow direct inversion for almost any quantity of interest Waveform database: Ø Download from www.geo.uu.nl/~jeannot References: Ø Käufl, Valentine, de Wit & Trampert, BSSA, 2015. (Waveform inversion) Ø Käufl, Valentine, de Wit & Trampert, GJI, in press. (Prior sampling) Ø Käufl, Valentine & Trampert, in prep. (EEW in 3D media) a.p.valentine@uu.nl www.geo.uu.nl/~andrew