Bandit Approaches for Border Patrol

Similar documents
Errata Summary. Comparison of the Original Results with the New Results

The Australian Society for Operations Research

Pork Barrel as a Signaling Tool: The Case of US Environmental Policy

Trading Goods or Human Capital

Department of Industrial Engineering: Research Groups

Female Migration, Human Capital and Fertility

A procedure to compute a probabilistic bound for the maximum tardiness using stochastic simulation

Chapter. Estimating the Value of a Parameter Using Confidence Intervals Pearson Prentice Hall. All rights reserved

Do Individual Heterogeneity and Spatial Correlation Matter?

A Calculus for End-to-end Statistical Service Guarantees

Curriculum vitae (Resumen)

Appendix: Uncovering Patterns Among Latent Variables: Human Rights and De Facto Judicial Independence

Event Based Sequential Program Development: Application to Constructing a Pointer Program

DYNAMIC RISK MANAGEMENT IN ELECTRICITY PORTFOLIO OPTIMIZATION VIA POLYHEDRAL RISK FUNCTIONALS

Constraint satisfaction problems. Lirong Xia

Fall 2016 COP 3223H Program #5: Election Season Nears an End Due date: Please consult WebCourses for your section

Is Democracy Possible?

ARTICLE. Correlation Among Tardiness Based Measures for Scheduling using Priority Dispatching Rules

The Provision of Public Goods Under Alternative. Electoral Incentives

Coalitional Game Theory

The Impact of Having a Job at Migration on Settlement Decisions: Ethnic Enclaves as Job Search Networks

Geographical Wage Differentials, Welfare Benefits and Migration

Reverting to Simplicity in Social Choice

Essential Questions Content Skills Assessments Standards/PIs. Identify prime and composite numbers, GCF, and prime factorization.

Policy Reputation and Political Accountability

Commuting and Minimum wages in Decentralized Era Case Study from Java Island. Raden M Purnagunawan

Migrant Wages, Human Capital Accumulation and Return Migration

Women as Policy Makers: Evidence from a Randomized Policy Experiment in India

A New Method of the Single Transferable Vote and its Axiomatic Justification

Combating Human Trafficking Using Mathematics

Voting on combinatorial domains. LAMSADE, CNRS Université Paris-Dauphine. FET-11, session on Computational Social Choice

Estimating the Margin of Victory for Instant-Runoff Voting

Immigration and property prices: Evidence from England and Wales

Migration With Endogenous Social Networks in China

Political Economics II Spring Lectures 4-5 Part II Partisan Politics and Political Agency. Torsten Persson, IIES

autonomous agents Onn Shehory Sarit Kraus fshechory, Abstract

EXPORT, MIGRATION, AND COSTS OF MARKET ENTRY EVIDENCE FROM CENTRAL EUROPEAN FIRMS

Aggregating Dependency Graphs into Voting Agendas in Multi-Issue Elections

Lobbying in Washington DC

Hierarchical Item Response Models for Analyzing Public Opinion

A Retrospective Study of State Aid Control in the German Broadband Market

Hurricanes, Climate Change and Electoral Accountability

An Empirical Assessment of The Determinants of Tourist Arrivals in the Caribbean Region: Evidence from Tobago

Randomized Pursuit-Evasion in Graphs

Adaptive QoS Control for Real-Time Systems

Modeling Selective Violence in the Guatemalan Civil War Timothy Gulden 1

ABSTRACT. SENGUPTA, BHASWATI Real Options Approach in Migration for two Specific Labor Markets. (Under the direction of Professor John Seater).

CONCRETE: A benchmarking framework to CONtrol and Classify REpeatable Testbed Experiments

Computational Inelasticity FHLN05. Assignment A non-linear elasto-plastic problem

Biogeography-Based Optimization Combined with Evolutionary Strategy and Immigration Refusal

Section Apportionment Methods. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

This situation where each voter is not equal in the number of votes they control is called:

Collective Decisions, Error and Trust in Wireless Networks

A kernel-oriented algorithm for transmission expansion planning

Median voter theorem - continuous choice

Introduction to Computational Social Choice. Yann Chevaleyre. LAMSADE, Université Paris-Dauphine

The Influence of Climate Variability on Internal Migration Flows in South Africa

A Geometric and Combinatorial Interpretation of Weighted Games

Migration and Tourism Flows to New Zealand

Immigration, Information, and Trade Margins

Vote Compass Methodology

Trade and Inequality: Educational and Occupational Choices Matter

Immigration, Trade and Productivity in Services: Evidence from U.K. Firms

Bribery in voting with CP-nets

The growth and decline of the modern sector and the merchant class in imperial China. Ken Chan and Jean-Pierre Laffargue

Statistical Analysis of Endorsement Experiments: Measuring Support for Militant Groups in Pakistan

U.S. District Court [LIVE] Eastern District of TEXAS

Candidate Citizen Models

Economic Systems 3/8/2017. Socialism. Ohio Wesleyan University Goran Skosples. 11. Planned Socialism

Liberalization of European migration and the immigration of skilled people to Sweden

Genetic Algorithms with Elitism-Based Immigrants for Changing Optimization Problems

Immigration and Internal Mobility in Canada Appendices A and B. Appendix A: Two-step Instrumentation strategy: Procedure and detailed results

4.1 Efficient Electoral Competition

The Principle of Convergence in Wartime Negotiations. Branislav L. Slantchev Department of Political Science University of California, San Diego

Optimal Voting Rules for International Organizations, with an. Application to the UN

Urban population as percent of total: China

Smartocracy: Social Networks for Collective Decision Making

Can Mathematics Help End the Scourge of Political Gerrymandering?

Randomized Pursuit-Evasion in Graphs

WhyHasUrbanInequalityIncreased?

Exports and Governance: is Middle East and North Africa different? InmaculadaMartínez-Zarzoso 1,2 and Laura Márquez-Ramos 2,3

arxiv: v1 [cs.si] 6 Apr 2017

A Size-Biased Probability Distribution for the Number of Male Migrants

Jens Hainmueller Massachusetts Institute of Technology Michael J. Hiscox Harvard University. First version: July 2008 This version: December 2009

ANALYSES OF JUVENILE CHINOOK SALMON AND STEELHEAD TRANSPORT FROM LOWER GRANITE AND LITTLE GOOSE DAMS, NOAA Fisheries

MEXICO-US IMMIGRATION: EFFECTS OF WAGES

Innovation and Intellectual Property Rights in a. Product-cycle Model of Skills Accumulation

2 Political-Economic Equilibrium Direct Democracy

DHSLCalc.xls What is it? How does it work? Describe in detail what I need to do

Climate Change Around the World

Regulations of the Audit, Compliance and Related Party Transactions Committee of Siemens Gamesa Renewable Energy, S.A.

Labour demand and the distribution of wages in South African manufacturing exporters

Final Review. Chenyang Lu. CSE 467S Embedded Compu5ng Systems

Australian AI 2015 Tutorial Program Computational Social Choice

This situation where each voter is not equal in the number of votes they control is called:

Combining national and constituency polling for forecasting

Random Forests. Gradient Boosting. and. Bagging and Boosting

International Trade, Risk and the Role of Banks

The EU s New Economic Geography after the Eastern Enlargement

Estimating Global Migration Flow Tables Using Place of Birth Data

Transcription:

Bandit Approaches for Border Patrol STOR-i Conference 2017 Thursday 12 th January James Grant 1, David Leslie 1, Kevin Glazebrook 1, Roberto Szechtman 2 1 Lancaster University; 2 Naval Postgraduate School

Border Patrol 12/01/2017 Bandits for Border Patrol 2

Border Patrol with Drones 12/01/2017 Bandits for Border Patrol 3

When is this interesting (mathematically)? When you can t be in all places at once Have to accept detection probabilities <1 somewhere. 12/01/2017 Bandits for Border Patrol 4

When is this interesting (mathematically)? When you can t be in all places at once Have to accept detection probabilities <1 somewhere. Heterogeneous Drones Combinatorial aspect. 12/01/2017 Bandits for Border Patrol 5

When is this interesting (mathematically)? When you can t be in all places at once Have to accept detection probabilities <1 somewhere. Heterogeneous Drones Combinatorial aspect. Unknown intensity of events Uncertainty around what is a good/bad allocation. 12/01/2017 Bandits for Border Patrol 6

Previous Work Multiple searchers, single event E.g. missing persons, life rafts Focus on collaboration between drones Design of flightpaths etc. Search on a border Szechtmann et al. (2008) 12/01/2017 Bandits for Border Patrol 7

What will we consider? Discretisation Multiple drones Sequential problem Looking for a best allocation 12/01/2017 Bandits for Border Patrol 8

Modelling the Problem Generation of events Piecewise constant, Nonhomogeneous Poisson Process. Probability of detection Assume to be calculable Depends on: Drone in question Number of cells to search Other exogenous variables? 12/01/2017 Bandits for Border Patrol 9

Modelling the Problem Generation of events Piecewise constant, Nonhomogeneous Poisson Process. Probability of detection Assume to be calculable Depends on: Drone in question Number of cells to search Other exogenous variables? We can combine these two to find the expected rate of event detection. 12/01/2017 Bandits for Border Patrol 10

Full information problem λ i : rate of NHPP in cell i for i = 1,, m τ i,j,k = P(drone j detects event in cell i if it has to search k cells event has occurred) Can calculate expected # detections for any drone j and subset of cells A j τ i,j, Aj λ i i A j Thus can determine a best action by Integer Programming 12/01/2017 Bandits for Border Patrol 11

Reinforcement Learning 12/01/2017 Bandits for Border Patrol 12

Exploration v Exploitation With low information, no choice but to explore With high information, freedom to exploit 12/01/2017 Bandits for Border Patrol 13

Exploration v Exploitation? With low information, no choice but to explore What do we do in between? With high information, freedom to exploit 12/01/2017 Bandits for Border Patrol 14

Multi-Armed Bandits Play one arm in each round receive reward Underlying reward dist. associated with each arm Want to find the optimal arm (highest mean μ ) Challenging due to stochasticity 12/01/2017 Bandits for Border Patrol 15

Algorithms Rules for decision making Consider data observed so far Balance exploration (long-term benefit) exploitation (immediate gain) 12/01/2017 Bandits for Border Patrol 16

Algorithms Rules for decision making Consider data observed so far Balance exploration (long-term benefit) exploitation (immediate gain) Quality metrics? Expected Regret R n = nμ n t=1 E(μ A t ) Difference between best and selected 12/01/2017 Bandits for Border Patrol 17

Regret 12/01/2017 Bandits for Border Patrol 18

UCB Algorithms Indices based on: Mean estimate + Uncertainty Measure Play arm with highest index Large indices may reflect Large previous rewards High level of uncertainty 12/01/2017 Bandits for Border Patrol 19

UCB Algorithms Indices based on: Mean estimate + Uncertainty Measure Play arm with highest index Large indices may reflect Large previous rewards High level of uncertainty 12/01/2017 Bandits for Border Patrol 20

UCB Algorithms Indices based on: Mean estimate + Uncertainty Measure Play arm with highest index Large indices may reflect Large previous rewards High level of uncertainty Example: UCB1 algorithm (Auer et al., 2002) INITIALISATION: Play each arm once LOOP: For each arm calculate μ t = μ t + 2 ln t T i (t) Play maximising arm 12/01/2017 Bandits for Border Patrol 21

Back to our problem Rather more complicated setup Combinatorial Poisson reward Thinning of true counts 12/01/2017 Bandits for Border Patrol 22

12/01/2017 Bandits for Border Patrol 23

12/01/2017 Bandits for Border Patrol 24

12/01/2017 Bandits for Border Patrol 25

Robust-F-CUCB INITIALISE: Play combinations of arms such that each arm is played once LOOP: For each arm calculate Play the combination of arms that maximises IP wrt μ Observe rewards and update mean estimates 12/01/2017 Bandits for Border Patrol 26

Where to now?? 12/01/2017 Bandits for Border Patrol 27

Thank you Questions? References: Auer, P., Cesa-Bianchi, N., and Fischer, P. (2002). Finite-Time Analysis of the Multiarmed Bandit Problem. Machine Learning, 47 (2-3): 235-256. Bubeck, S., Cesa-Bianchi, N., and Lugosi, G. (2013). Bandits with Heavy Tail. IEEE Transactions on Information Theory, 59 (11): 7711-7717. Chen, W., Wang, Y., and Yuan, Y. (2013). Combinatorial Multi-Armed Bandit: General Framework and Applications. In Proceedings of the 30 th International Conference on Machine Learning, 151-159. @STORiJamesG 12/01/2017 Bandits for Border Patrol 28