The Pennsylvania State University The Graduate School College of the Liberal Arts. A Dissertation in Political Science. James E.

Size: px

Start display at page:

Download "The Pennsylvania State University The Graduate School College of the Liberal Arts. A Dissertation in Political Science. James E."

Branden Osborne
5 years ago
Views:

1 The Pennsylvania State University The Graduate School College of the Liberal Arts A NUANCED STUDY OF POLITICAL CONFLICT USING THE GLOBAL DATASETS OF EVENTS LOCATION AND TONE (GDELT) DATASET A Dissertation in Political Science by James E. Yonamine c 2013 James E. Yonamine Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy August 2013

2 The dissertation of James E. Yonamine was reviewed and approved* by the following: Bumba Mukherjee Associate Professor of Political Science Dissertation Advisor and Co-Chair of Committee Philip A. Schrodt Professor of Political Science Co-Chair of Comittee James Honaker Lecturer in Political Science Le Bao Assistant Professor of Statistics Lee Ann Banaszak Professor of Political Science Director of Graduate Studies of the Department of Political Science *Signatures are on file in the Graduate School ii

3 ABSTRACT As you read this sentence, know that a riot is occurring somewhere in the world. Elsewhere, or perhaps in the same location, government forces are constructing a barricade and a politician is being abducted. These types of politically motivated conflictual events are always occurring. In most places, accounts of these events quickly make their way online in the form of electronic news stories. In this dissertation, I utilize a new datasets called the Global Dataset of Events, Location, and Tone (GDELT) which contains almost 200 million politically relevant events that have been extracted from freely available online news articles. With this data, I analyze the effects of political violence on the Tel Aviv Stock exchange, measure how much civil war effects interstate war, and build an empirical model that generates accurate predictions of future conflict in Afghanistan. iii

4 TABLE OF CONTENTS List of Tables v List of Figures vi Acknowledgements vii Chapter 1: Motivation Introduction 1 The Evolution of Political Conflict Data 2 The Substance 19 References 30 Chapter 2: Using Political Event Data to Analyze Variance in Tel Aviv 100 Index Returns Introduction 42 The Literature 44 Brief Explanation of Competing Hypotheses 47 Data and Research Design 49 The GARCH Models 55 Results 57 Robustness Checks Focusing on Insurance Equities 59 Conclusion 61 References 64 Chapter 3: Effects of Domestic Conflict on Interstate Conflict: An Event Data Analysis of Monthly Levels of Onset and Intensity Introduction 78 Building Hypotheses from the Literature 80 Research Design 83 Empirical Tests 89 Conclusion 98 References 100 Chapter 4: Predicting Future Levels of Violence in Afghanistan Districts Introduction 115 Literature Review 117 Research Design 120 Forecasting Approach 121 Results 127 Future Directions 129 Conclusion 133 References 135 Chapter 5: Concluding Remarks Conclusion 147 Empirical Findings and future avenues of research 148 The Future of event data and political violence research 152 Final Thoughts 153 References 156 Appendix: A Guide to Aggregation Choices when Working with Event Data iv

5 LIST OF TABLES Chapter 1: Motivation Table 1: Example of a COW interstate war observation 39 Table 2: Example of a MID observation 39 Table 3: Example of a UCDP observation 39 Table 4: Example of an ACLED observation 39 Table 5: Example of a COW interstate war observation 40 Table 6: Example of a UCDP-GED observation 40 Table 7: Example of a KEDS Event 40 Table 8: Example of a GDELT Event Data Event 40 Chapter 2: Using Political Event Data to Analyze Variance in Tel Aviv 100 Index Returns Table 1: Correlation Matrix of Counts 67 Table 2: Correlation Matrix of material conflict MAs 67 Table 3: GARCH models of daily TA100 with DJIA control 68 Table 4: GARCH models of daily TA100 without DJIA control 69 Table 5: GARCH models of daily MGDL with DJIA control 70 Table 6: GARCH models of daily CLIS with DJIA control 71 Chapter 3: Effects of Domestic Conflict on Interstate Conflict: An Event Data Analysis of Monthly Levels of Onset and Intensity Table 1: The Effects of Lagged Domestic Conflict Onset on Interstate Conflict Onset with 1-month Lag 103 Table 2: The Effects of Lagged Domestic Conflict Onset on Interstate Conflict Onset with 2-month Lag 104 Table 3: The Effects of Changes in Intensity of 1-month Lagged Ongoing Domestics Conflicts on Interstate Conflict Onset 105 Table 4: The Effects of Changes in Intensity of 2-month Lagged Ongoing Domestics Conflicts on Interstate Conflict Onset 106 Table 5: The Effects of Changes in Intensity of 1-month Lagged Ongoing Domestics Conflicts on Whether an Ongoing Interstate Conflict Becomes more Intense 107 Table 6: The Effects of Changes in Intensity of 2-month Lagged Ongoing Domestics Conflicts on Whether an Ongoing Interstate Conflict Becomes more Intense 108 Table 7: The Effects of Changes in Intensity of 1-month Lagged Ongoing Domestics Conflicts on Changes in Interstate Conflict Intensity 109 Table 8: The Effects of Changes in Intensity of 2-month Lagged Ongoing Domestics Conflicts on Changes in Interstate Conflict Intensity 109 Table 9: How much more likely is an interstate conflict onset in month t relative to neither state experiencing an onset of domestic conflict in month t Table 10: How much more likely is an interstate conflict onset in month t relative to neither state experiencing an onset of domestic conflict in month t Table 11: How many times more likely is an interstate conflict onset in month t relative to neither state experiencing a more intense ongoing domestic conflict in month t Table 12: How many times more likely is an interstate conflict onset in month t relative to neither state experiencing a more intense ongoing domestic conflict in month t Chapter 4: Predicting Future Levels of Violence in Afghanistan Districts Table 1: Assessing Accuracy at the District Level 141 Table 2: Assessing Accuracy at the Province Level 142 Table 3: Assessing Accuracy at the Country Level 143 v

6 LIST OF FIGURES Chapter 1: Motivation Figure 1: The Number of GDELT-derived Violent Events in Homs and Aleppo from January 2012 through June Figure 2: The Number of Ground-truthed Violent Events in Homs and Aleppo from January 2012 through June Chapter 2: Using Political Event Data to Analyze Variance in Tel Aviv 100 Index Returns Figure 1:Total number of material conflict events, daily from 1992 to 2012 Figure 2: The number of material conflict events with the time trend removed, daily from 1992 to Figure 3: Raw, first-differenced, and logged first-differenced of TA100 prices 74 Figure 4: Comparison of closing TA100, CLIS, and MGDL prices 75 Figure 5: Raw, first-differenced, and logged first-differenced MGDL prices 76 Figure 6: Raw, first-differenced, and logged first-differenced CLIS prices 77 Chapter 3: Effects of Domestic Conflict on Interstate Conflict: An Event Data Analysis of Monthly Levels of Onset and Intensity Figure 1: The Effects of Domestic Conflict Onset in Month t-1 on the Likelihood of Interstate Conflict Onset in Month t 111 Figure 2: The Effects of Domestic Conflict Onset in Month t-2 on the Likelihood of Interstate Conflict Onset in Month t 112 Figure 3: The Effects of an Increasingly Severe Domestic Conflict in Month t-1 on the Likelihood of Interstate Conflict Onset in Month t 113 Figure 4: The Effects of an Increasingly Severe Domestic Conflict in Month t-2 on the Likelihood of Interstate Conflict Onset in Month t 114 Chapter 4: Predicting Future Levels of Violence in Afghanistan Districts Figure 1: The Number of Material Conflict events per Afghani District from 2001 to Figure 2: One-month Forecast of the # of Material Conflict Events in Bughran District using arfima package, with mean, 90%, and 95% confidence intervals. 145 Figure 3: Average Farm-Gate Prices for Dry Opium in Afghanistan, September 2004-March vi

7 ACKNOWLEDGEMENTS Throughout my academic career, I have received tremendous support by a host of individuals. First and foremost, I would like to thank my advisor Phil Schrodt. When deciding where to attend graduate school, Phil was the primary factor that caused me to choose Penn State and throughout my time here, I cannot overstate how valuable Phil has been in educating, mentoring, and motivating me. Additionally, James Honacker has provided invaluable methodological training and support. Likewise, Bumba Mukherjee has consistently supported my academic development. And of course, I must thank Robert Packer, who initially sparked my interest in the empirical study of conflict in 2004 and has been a steady mentor and friend since then. Lastly, a number of other people deserve mention, including Chris Zorn, Burt Monroe, Doug Lemke, Glenn Palmer, Scott Bennett, Le Bao, Daniel Kifer, Kevin Jones, and Burt Johnson. vii

8 For Mom, Dad, Grandma, and Grandpa. viii

9 CHAPTER 1. MOTIVATION 1. Introduction For better of worse, human history has been defined by the violent competition for political power. Given its ubiquity and importance, scholars have been interested in understanding and even predicting political violence for centuries. If we conservatively assume Thucydides to be the first rigorous student of political conflict, then scholars have been at this for over 2,500 years. For the better part of those years, scientific progress was relatively non-existent, as we could prove little more in the 1960s about the causes and effects of political violence than we could in 400 B.C.. But why? Especially when most other disciplines made such tremendous progress in the same time frame: in 400 B.C., leading physical scientists thought that the world was flat and that the sun was a chariot of fire, but by 1959 they had landed a man on the moon. For one, unlike physical particles, humans are complex creatures that exhibit free will, meaning that rules and laws governing their behavior are less concrete and more difficult to empirically demonstrate. Additionally, and perhaps more importantly, unlike other disciplines, the amount and quality of data on political violence was basically stagnant for a millennium. Historically, experimental data had been scarce, and whereas it has been feasible for scholars of the biological or physical science to conduct experiments, this has not been possible for scholars interested in dynamics of large-scale political violence. Moreover, nuanced observational data had been similarly difficult to collect on a large scale since this wold have required unfeasibly large numbers of highly educated and disciplined manpower to witness, record, store, and then distribute data about political events, as well as the technology to do so. Fast-forward to today, and experiments to learn about political violence are still not feasible (which is probably a good thing), but our ability to collect, store, and analyze observational data has grown exponentially. It is this rapid and relatively recent explosion in both the volume and quality of data on political conflict that makes this dissertation possible. The overarching goal of this dissertation is to illustrate how we are able to perform increasingly nuanced empirical analyses of political conflict as our data simultaneously becomes increasingly voluminous and fine-grained. Released in 2012, the Global Database of Event Location and Tone (GDELT) dataset is the largest and arguably the most technologically advanced publicly available, 1

10 political-conflict dataset. Equally importantly, this dissertation is the first ever rigorous, empirical analysis of the GDELT dataset. This introductory chapter proceeds in two main sections. In Section 2, I provide a history of political conflict data collection efforts, culminating with a discussion of GDELT. In Section 3, I frame the three distinct substantive chapters of this dissertation within the relevant literature on political conflict and provide brief overviews of findings. 2. The evolution of political conflict data Large-scale, systematic efforts at representing political violence as data did not occur until the 20th century. 1 In the last 50 years, researchers have created dozens, if not hundreds of datasets about political violence. As with any historical account, there are a number of ways to structure this discussion of the history of political violence data. I choose to structure my discussion in terms of three conceptual schools of development. Two of these schools actually center around specific universities, while the third was decentralized across institutions and organizations. As we will see, the most commonly used political violence datasets fit cleanly into one of these three categories, with little overlap. 2 First, is the University of Michigan school, which produced the Correlates of War (COW) and Militarized Interstate Dispute (MID) data sets; second, is what I deem the Scandinavian school, comprised of the Peace Research Institute of Oslo (PRIO) and Uppsala University. The Scandinavian school produced the Uppsala Conflict Data Program (UCDP), UCDP/PRIO, UCDP Georeferenced Event Dataset (UCDP-GED), and the Armed Conflict Location Event Dataset (ACLED) datasets, as well as motivating additional, ACLED-like offshoots. Third, is what I call the Machinecoded school since unlike the Michigan school and the Scandinavian school, this lineage of data collection efforts have been decentralized. The Machine-coded school is responsible for the Kansas Event Dataset (KEDS), Virtual Research Associates (VRA), the 10 Million International Dyadic Events dataset, the Integrated Conflict and Early Warning System (ICEWS), and GDELT. 1 In order to qualify as political violence, the goal of the violent act, which may either be carried out, attempted, or simply threatened, must be to intimidate or coerce a government or civilian population in furtherance of social objective or access to positions of power. This definition is borrowed from 2 I do not include in my discussion datasets that focus on a specific subset of political conflict, such as terrorism or ethnic conflict. 2

11 To facilitate discussion of the history of data collection efforts of these three schools, I use some additional terminology. First, Schrodt (2012) provide a useful scheme for differentiating different types of data based on the degree of aggregation: 3 (1) Episodic Episodic data are those coding characteristics of an extended set of events, such as a war or a crisis: the COW project [discussed in detail below] is the archetype. (2) Composite Composite events are those which occur in a relatively short period of time and limited geographical space for example, a specific skirmish that occurs during a war and multiple characteristics of the indecent are coded (3) Atomic Atomic events are basic units of political interaction date, source, target, event found in classic event data sets as World Event Interaction Survey (WEIS) and Conflict and Peace Data Bank (COPDAB), and in contemporary coding schemes such as Integrated Data for Event Analysis (IDEA) and Conflict and Mediation Event Observations (CAMEO) [which are discussed in detail below]. Second, from the earliest attempts nearly 100 years ago to the most cutting edge programs today in 2013, scholars have followed three main steps in order to build the types of datasets above: (1) Build coding rules and ontology (2) Obtain sources (3) Generate data by applying coding rules and ontology to sources As I discuss the history of political violence data collection across the three schools, I provide as detailed information as possible about this three-step data generating process University of Michigan School. In 1963, David Singer launched the COW project at the University of Michigan, which produced the first conflict dataset widely used in empirical analysis (see Singer (1972)). Although COW is often credited with producing the first dataset on political conflict, it was actually preceded by at least four earlier efforts, including Woods and Baltzly (1915), Sorokin (1937), Wright (1942), and Richardson (1960). According to Geller (2004) and Ward et al. 3 I borrow identical language from Schrodt (2012), with one minor difference. Where I write a specific skirmish that occurs during a war, Schrodt (2012) writes a terrorist attack. I believe this change provides additional conceptual clarity. 3

12 (2012), these previous projects, and especially Richardson (1960), helped to motivate and shape the COW project. The primary breakthrough of the COW project was the rigor of its coding rules and ontology (i.e. Step 2 ) for defining exactly what must occur for a series of events to be considered a war, which is still employed (with a few changes) today. For an interstate war, a minimum of 1,000 battle fatalities between two officially recognized armies within a 12-month period, with at least 100 fatalities occurring on both sides. Thus, COW provided episodic data, meaning that it thought about political conflict in terms a broader war, rather than the specific battles comprising the war or the individual events making up the battles. The release of Singer (1972) and the accompanying dataset, which provided detailed dyadiclevel data on all inter-state wars from , ushered in a new era in the study of conflict, enabling scholars to use empirical models to test hypotheses about the causes and consequence of wars. Subsequent COW projects expanded beyond inter-state wars and provided data on intrastate(small and Singer (1980)), non-state (Sarkees and Wayman (2010)), and extra-state (Sarkees and Wayman (2010)) wars. In the 40 years since the COW s first published dataset, it has been the most heavily used source of political conflict data. The process of building the COW datasets is similar today to how it was in 1963, and likely to how it was while Woods and Baltzly (1915) were working almost 100 years ago. First, COW researchers initially constructed an ontology of conflict, specifying rigorous requirements that must be met in order to political violence to qualify as various types (inter-, intra-, non-, and extra-state) of wars. Next, researchers (i.e. low paid or volunteer undergraduate/graduate students) combine Step 2 and Step 3 of the conflict-data generating process, pouring over various sources, such as newspapers, books, microfilms, and (more recently) online documents in order to determine whether historical events qualify as a type of war. These researchers then discuss amongst themselves, and eventually reach agreement about how to code the wars. 4 Below, I provide an example of how the COW database presents an interstate war: 5 [INSERT TABLE 1 HERE] Table 1 reflects that a war according to COW definition is a dyadic event occurring between two states. Each row reflects one of the two states involved in the war. In the entry above, the 2 4 The resulting data are available at 5 In the actual dataset, the day, month, and year are provided in separate columns. 4

13 entry in the WhereFought reflects that the bulk of the fighting occurred in the second state listed, which in this instance is France. Despite the popularity of COW data among empirical studies of political conflict, the coarse, episodic nature of the data constrained the types of questions that researchers could ask. Three aspects of COW were particularly limiting. First, the requirement of 1,000 battle-related fatalities excluded a large number of smaller-scale, yet important conflicts. Second, since the unit of analysis of the COW datasets is a war, COW does not provide specific, sub-state level information regarding where the war or its component battles occurred. Third, since COW uses the war as its unit of analysis, it is impossible to study the more intricate dynamics of violence that occur during the fighting. 6 Cognizant of the limitations of using COW to study conflictual activities short of war (as defined by COW), Gochman and Maoz (1984) introduced the MID dataset (also housed at the University of Michigan), which provides data regarding three types of dyadic level (i.e. occurring between two states in the international system) conflictual events: the threat of force, the display of force, and the use of force. 7 The initial MID dataset included 886 MIDs that occurred from 1816 to 1975, while the most recent published iteration (see Ghosn, Palmer and Bremer (2004)) extends coverage covered through 2001, and the forthcoming MID 4.0 will cover Additionally, whereas the original MID dataset contained three levels of dispute, MID 3.0 contains five categories: (1) No militarized action (2) Threat to use force (3) Display of force (4) Use of force (5) War [following COW guidelines] Each MID contains information regarding the states involved, the type of event, and the specific day of the onset and termination of the dispute. The types of events coded as MIDs vary greatly. For example, the model length of all MIDs is one day, though some MIDs last longer than 10 years. The figure below provides an example of a MID observation: 8 [INSERT TABLE 2 HERE] 6 See Moore (2005) for a more rigorous critique of the COW datasets. 7 See Gochman and Maoz (1984) page 588 for greater detail about these events. 8 This example is from the participant level dataset. A dispute level dataset provides data in a different format, though the information contained is identical. See for the data. 5

14 In Table 2, DispNum is the dispute number, StateAbb reflects the abbreviate name of the states, and HostLev reflects the highest level of hostility reached, on the scale of one to five as listed above. Overall, there are two primary contributions of the MID dataset, neither of which were possible using only the COW datasets. First, it allowed researchers to systematically analyze conflictual inter-state behavior short of war. Second, it enabled the study of conflict escalation, from an initial threat, to the display of force, to the use of force, all the way to large-scale war. Although MID has enabled many interesting research agendas, the data still contain some potential shortcomings. For example, each MID is assigned an initiator and a target. However, it is often difficult to discern which state in an ongoing dyadic dispute made the first threat, and ultimately subjective judgment is often required. Additionally, a dispute between states can last many years, but the MID dataset only provides information on the initiation and termination of the dispute. Thus, MID, like COW, prevents scholars from analyzing conflict dynamics that occur during a dispute. Finally, as discussed in the following paragraph, MIDs are tedious to code, which prevents analysis of political conflict in real- or near real-time. The data-collection process of the early MID efforts closely resembled that of COW, with all three steps in the process being performed without meaningful computational assistance. In the nearly 30 years since the initial release of MIDs, the coding rules and ontology have remained mostly consistent. Additionally, Step 3 (i.e. generating data by applying coding rules and ontology to sources) has likewise unchanged. However, the process by which MID researchers access relevant information (i.e. Step 2), has changed greatly. Originally, MID researchers would manually search newspapers, microfilms, etc. without technological support. In the mid 1990s, MID began to utilize assisted search engines like LexisNexis. MID 4.0 introduced automated text classification techniques, first introduced by Schrodt, Palmer and Hatipoglu (2008) and later implemented D Orazio et al. (2012), to isolate electronic news stories likely to contain relevant information form those containing unimportant information. According to D Orazio et al. (2012), this has saved considerable time and enhanced coding accuracy. However, even with these increases in efficiency, the process of updating the MID database is still tedious, meaning that new version are released sporadically rather than in real time The Scandinavian school. Like MID, the main goal of the Scandinavia school has been to move beyond the coarse structure of COW in order to generate increasingly nuanced political conflict data, though the focus has been primarily on domestic conflict. Wallensteen and Axell 6

15 (1993) introduced Scandinavia school s first dataset, called the Uppsala Conflict Data Program (UCDP) dataset. The UCDP created five different categories of conflict based on the number of battle fatalities, and like COW, UCDP required that fighting occur between the state and an armed rebel group: (1) armed conflicts, >25 total casualties (2) minor armed conflicts, >25 but <1,000 total casualties (3) intermediate conflicts, >1,000 total casualties but <1,000 in a year (4) wars, >1,000 casualties in a year (5) major armed conflicts, all intermediate conflicts and wars The most notable contribution of this initial UCDP dataset was to provide data on armed conflicts and minor armed conflicts, which were not included in COW datasets. The lower threshold of 25 battle deaths for armed conflicts measure meant that a single, small-scale battle between the military and a rebel group occurring in a specific location over the course of a single day would gain includion into the UCDP dataset. Thus, although UCDP provided information about episodic events like major war, it also contained more fine-grained, composite events that start and finish on the same day. Wallensteen and Axell (1993) initially provided data on armed conflicts (as well as larger scale intermediate and major armed conflicts) occurring globally from 1989 to 1992, and subsequent versions were released annually in the Journal of Peace Research. Table 3 provides an example of a UCDP observation. The type columns takes on an integer one through four, corresponding to the four types of conflict listed above. Also, note that the location is at the country level: [INSERT TABLE 3 HERE] In 2002, Gleditsch et al. (2002) back-coded the UCDP to 1946, resulting in a complete dataset of low, intermediate, and major conflicts from Although Gleditsch et al. (2002) improved the UCDP dataset by broadened the temporal coverage, the resulting UCDP/PRIO continued to follow UCDP coding procedure, which meant that data continued to be presented at the binary, state-year measure. Moreover, the UCDP/PRIO dataset continued to require the conflicts occur between a government and rebel actor, meaning that conflict occurring between two rebels groups would not gain inclusion into the dataset. 9 This dataset is referenced under multiple names (Prio-Uppsala, Uppsala/UCDP/PRIO, etc.), but I will refer to it exclusively as UCDP/PRIO. 7

16 In order to provide increasingly comprehensive and detailed information about political conflict beyond what was available in the UCDP/PRIO dataset, Raleigh et al. (2010) released the ACLED dataset with the goal of enabling more micro-level analyses of political conflict. Unlike UCDP/PRIO, which provides data in episodic and composite form, ACLED provides data in atomic form, and focuses exclusively on eight types of violence: 10 (1) Battle - No Change of Location (2) Battle - Rebels Control Location (3) Battle - Government Regains Control (4) Battle - Headquarters of Base Established (5) Non-violent Conflict Event (6) Rioting/Protesting (7) Violence Against Civilians (8) Non-Violent Transfer of Location Control All of the event types above, with the exception number seven, must occur between actors deemed rebels and actors affiliated with the official government, while number seven can clearly involve civilians. This meant that if rebels attacked and killed 25 civilians in a politically motivated act, ACLED would code this event, even though the COW intrastate conflict dataset and UCDP/PRIO would ignore it because a government actor was not involved. Additionally, and perhaps more importantly, ACLED was the first dataset to provide specific sub-state location information regarding where the conflict events occurred. Each event in the ACLED dataset, be it composite or atomic, The original ACLED dataset provided limited temporal coverage for 50 countries. As of February 2013, it provides 75,000 atomic events for 60 countries, with key aspects of an observation from ACLED presented below: 11 Table 4 provides an example observation from the ACLED dataset. [INSERT TABLE 4 HERE] Only a year after the release of ACLED, Melander and Sundberg (2011) released a new geo-coded dataset contains both composite and atomic conflict events called UCDP-GED (see Sundberg and Lindgren (2011) for coding rules). Like ACLED, the UCDP-GED codes events according to 10 See Raleigh et al. (2010) page 656 for a detailed list of the eight types of events coded in the ACLED dataset. 11 ACLED also provide the number of casualties as well as other information. See for more information about ACLED data. 8

17 the same four types discussed above, and provides data in a highly similar format to ACLED, as illustrated in Table 5: 12 [INSERT TABLE 5 HERE] Currently, the UCDP-GED dataset contains approximately 24,000 events for all African countries from 1998 to One major difference is that UCDP-GED contains composite events that occur across multiple days, while ACLED only focuses on specific, atomic events that occur on a single day. Despite the similarities, Eck (2012) has found that considerable differences exist between ACLED and UCD-GED codings of the same countries during the same time periods. To generate their data, the UCDP/PRIO, UCDP-GED, and ACLED project utilize a similar approach to MID, with researching searching various electronic sources of data in order to obtain news stories likely to contain relevant information about political conflict. According to Chojnacki et al. (2012), the UCDP-GED dataset collects data exclusively from five news wires: BBC Monitoring, Reuters News, Agence France Presse, Dow Jones International News, and Xinhua. Additionally, Chojnacki et al. (2012) reports that ACLED varies it sources based on the country in focus. After obtaining sources, the researchers apply the coding rules to the sources, using their best subjective judgement to code new events. Although human-coding has a number of strengths, the primary benefit for ACLED and UCDP-GED has been human s ability to connect events with places. According to Raleigh et al. (2010), this allowed ACLED to be the only dataset at the time of its release to provide geo-coded political conflict data. But, like all human-coded datasets, the process of generating data for these datasets is slow, which limits the spatial and temporal domain of coverage, the volume of total events, and the abilities to produce data in real or near real time More focused extensions. More recently in 2012, scholars released four additional humancoded atomic and composite event data datasets on political conflict. Although these were not created at one of the institutions comprising the Scandinavian school, they fit most cleanly into this section since they use similar data-generating approaches as ACLED and UCDP-GED, but do so for more focused spatial coverage. First, Urdal and Hoelscher (2012) generated of conflict events, focusing exclusively on 55 major cities in Asia and Africa. The dataset contains approximately 4,000 events geo-coded to the city level, and is generated based on human coding of Keesings Record of World Events (KRWE). Second, Daly (2012) introduced a dataset that codes for 4, The UCDP-GED also provides information on number of fatalities, the source of the the article used to code the event, and a number of other attribute. See for the data. 9

18 events across six types conflictual at the municipal-month level for Columbia, from : 2,260 combat events, 1,103 assassinations, 778 ambushes, 405 terrorist attacks, 388 capture of towns, 173 massacres, and 121 attacks on infrastructure. Third, Salehyan et al. (2012) released the Social Conflict in Africa Database (SCAD), which uses keyword searches in the Associated Press and Agence France Presse to code 7,200 instances of protests, riots, strikes, government repression, communal violence, and other forms of unrest for 47 African countries from Fourth, Chojnacki et al. (2012) collected geo-coded conflict events for Congo-Brazzaville, Democratic Republic of Congo, Sierra Leone, and Liberia using the Guardian, New York Times, Washington Post, BBC Monitoring, and LexisNexis keyword searches The Machine-coded school. Unlike the University of Michigan and the Scandinavian schools, the Machine-coded school has not been centered at specific university/universities, but rather decentralized throughout a number of institutions and organizations. Additionally, whereas the previously discussed datasets have all focused specifically on aspects of political conflict, the Machinecoded datasets account for a wider range of politically relevant events, including cooperative acts like sending aid or releasing prisoners, while also providing rigorous coverage of political conflict events. Indeed, the majority of studies utilizing machine-coded event data focus primarily on the conflictual events. Although machine-coded event data has many similarities to ACLED, SCAD, and EDACS, machine-coded event data has evolved on a considerably different tract largely independent of the human-coded datasets. Whereas the most recent human-coded efforts like SCAD and EDACS trace their roots to ACLED and UCDP-GED, which in turn used COW as a foundation, the most recent machine-coded efforts like ICEWS and GDELT stem back to WEIS and COPDAB. Thus, despite similarities, the differences are sufficiently strong to warrant discussion of machine-coded event data collection efforts separately from human-coded. Additionally, since this dissertation utilizes machine-coded event data, I give the history of the evolution of machine-coded event data more attention that given to the University of Michigan or Scandinavian school. In my discussion of the machine-coded school, I first discuss McClelland (1976) s WEIS and Azar (1980) s COPDAB. Although these projects were human-coded, they provided a foundation on which later machine-coded efforts were built. Second, I discuss the rise of modern, machinecoded event data datasets that followed the computer / internet revolution of the late 1980s and 10

19 early 1990s, like the Kansas Event Dataset (KEDS) and the Integrated Conflict Early Warning System (ICEWS) WEIS and COPDAB. In the late 1970s and 1980s, McClelland (1976) s WEIS and Azar (1980) s COPDAB began an alternative type of atomic event data (henceforth referred to as event data ) as a more nuanced alternative to the coarse episodic COW style data. Whereas the episodic COW project generally utilized a war as its unit of analysis, the WEIS and COPDAB projects were interested in capturing the specific, daily-level events (such as attacks, protests, demonstrations, meetings, etc.) that taken together, may form a battle or a way but occur in real-time as independent events. Thus, much like how COW focused heavily on a clearly defined set of rules to define both a domestic and inter-state war, WEIS and COPDAB similarly required robust ontologies and coding rules that would allow them to extract specific atomic events in a [subject-verb-object] format from news sources in a consistent and replicable way. Here, the formal definition of an event provided in Gerner et al. (1994) is useful: An event is an interaction which can be described in a natural language sentence which has as its subject and direct or indirect object an element of a set of actors, and as the verb an element of a set of actions, all of which are transitive verbs, and which can be associated with a specific point in time. Gerner et al. (1994) further specifies that, In event coding, the subject of the sentence is the source of the event, the verb determines the event code, and the object of the verb is the target. Coding ontologies or schemes are the rules by which the source, the object, and the verb presented in natural language in articles are converted into categorical actor and event codes suitable for empirical aggregation and analysis. For WEIS, COPDAB, and all subsequent event data efforts, development of the ontologies was critical, since these form the rules by which the source, the object, and the verb presented in natural language in text are converted into categorical actor and event codes suitable for empirical aggregation and analysis. 13 Since event data projects capture a broad range of events, from meetings to bombings to the provision of aid, they require far more detailed ontologies than the University of Michigan or Scandanavian school datasets. Generally, two ontologies are used in machine-coded event data datasets, one for actors that informs that source and the target, and one for verbs that informs the actions. 14 WEIS and 13 In this instance, WEIS and COPDAB are both the names of the event data projects as well as the name of the ontologies. 14 Other ontologies have been developed to code other characteristics of the sentencefor example, COPDAB and the Protocol for the Assessment of Nonviolent Direct Action (PANDA) coded for political issues and CAMEO and IDEA have ontologies for general agents. 11

20 COPDAB were the first event data ontologies. Reflecting the status of international relations at the time, both followed the realist tradition in assuming that states operated as unitary actors. This meant that all events between individuals were treated as occurring between the states of each individuals respective citizenship. For example, if a group of Pakistani rebels attacks Indian civilians across this border, both WEIS and COPDAB treat this as an attack of Pakistan against India. Thus, officially recognized states in the international system were the only actors in the WEIS and COPDAB ontologies. Consequently, both the WEIS and COPDAB event coding ontologies were also structured to capture important inter-state interactions. The WEIS event ontology is based on 22 distinct cue, or parent categories of actions (such as Consult, Reward, Warn, etc.), which take on 2-digit codes, and 63 sub categories. The sub-categories indicate additional information beyond the parent category. For example, Threaten is one of WEIS s cue categories and its 2-digit code is 17. However, when more information is presented in the article regarding the type of These systems did code a small number of militarized non-state actors such as the Irish Republican Army (IRA) and the Palestinian Liberation Organization (PLO), as well as the United Nations, but the overall focus was on nationstates. COPDAB utilizes a similar verb typology that focuses on capturing interstate events but instead of WEISs 22 cue categories, COPDAB uses 16 and places these on a conflict-cooperation continuum to facilitate empirical analyses. In comparison, recall that COW codes for five types of events (interstate war, intrastate war, non-state war, and extra-state war), MID codes for five types of events (no militarized action, threat to use force, display of force, use of force, and war), UCDP/PRIO codes 4 types of events (armed conflicts, minor armed conflicts, intermediate conflicts, and wars), and ACLED codes for nine (four types of battles, non-violent conflict events, rioting/protesting, violence against civilians, and non-violent transfers of location control). While WEIS and COPDAB were the most commonly used ontologies from the first phase of event data analysis, quite a few additional systems were developed that never gained a foothold. For example, the Behavioral Correlates of War (BCOW) data set coded historical as well as contemporary crises and had more than 100 distinct event codes, including Assume foreign kingship; the Comparative Research on the Events of Nations (CREON) data set [(Harmann et al. (1973)) 12

21 was customized for coding foreign policy behaviors, and the Sherman Facts (SHERFACTS) (Sherman and Neack (1993)) and Computer-Aided System for Analysis of Local Conflicts (CASCON) (Bloomfield and Moulton (1989)) data sets coded crisis behavior using a crisis-phase framework. Although COW and the WEIS/COPDAB projects utilized considerably different coding rules and ontologies, Step 1 and Step 3 of the conflict-data-generating process were quite similar and done almost entirely by human hand. Generally, graduate students would scour books, newspapers, and microfilms in order to procure as many sources as possible that may yield relevant information. Although coders were instructed about what types of articles to gather, they relied on their subjective judgment to determine whether an article was relevant and warranted inclusion into the archive of articles from which events were derived. Step 3 was also quite straightforward and similar. Researchers applied the coding rules and ontologies to the corpus of sources. For WEIS and COPDAB, researchers would manually recorded dozens of relevant events of interest a day, which were then transferred to punch cards and eventually to magnetic tape The computer revolution and rise of machine-coded event data. Unfortunately, the actual WEIS and COPDAB datasets were never transferred to electronic records. Thus, the resulting datasets are inaccessible, and have not been used in the empirical study of conflict in decades. However, the WEIS and COPDAB are nonetheless highly important, since they served as a launching pad for future, machine-coded projects. Throughout the 1980s, all three steps of the data-building process were done entirely by humans, with virtually no computational assistance. Then, in the late 1980s and early 1990s, major technological innovations made possible to rise of machine-coded data. To most efficiently discuss the evolution of machine-coded event data, I divide my discussion into the three component parts of the conflict-data generating process: ontology development, obtaining news stories, and processing. Ontology and coding rules. Although WEIS and COPDAB deserve much credit for spearheading the entry of event data into the mainstream of political science, a number of shortcomings became apparent over time. Gerner et al. (2002) report that the state-centric focus of WEIS and COPDAB made them ill-suited to account for sub-state level events between domestic actors. Additionally, Gerner et. al. [2002] explain that both WEIS and COPDABs verb typologies contained too few event categories: For instance, WEIS has only a single cue category of Military engagement that must encompass everything from a shot fired at a border patrol to the strategic bombing of 13

22 cities...copdab contains just 16 event categories, spanning a single conflict- cooperation continuum that many researchers consider inappropriate. Reacting to these shortcomings, a group of researchers led by Doug Bond constructed the first version of the PANDA dataset in The leading motivation behind PANDA was to more thoroughly account for domestic events, especially non-violent direct action found in protests and demonstrations but overlooked by the WEIS and COPDAB schemes. Ten years later in 1998, Bond et. al. built upon PANDA to create the more comprehensive Integrated Data for Event Analysis (IDEA), adding incorporating codes from Taylor and Jodice (1983) s World Handbook of Social and Political Indicators, WEIS, and MID. Furthermore, IDEA created additional event codes for economic events, biomedical phenomena such as epidemic disease, and various additional jurisprudence and electoral events [see Bond et al. (2003) for a further discussion of PANDA and IDEA]. In building the 10 Million International Dyadic Events dataset, King and Lowe (2004) also utilize the IDEA action ontology. In 2002, Gerner et al. (2002) released the Conflict and Event Mediation Event Observation (CAMEO) coding framework. Like PANDA and IDEA, CAMEO was designed to capture substate events and capture nuanced attributes of the actors. However, there are two differences between CAMEO and IDEA. First, while IDEAs extensions preserved backwards compatibility with multiple earlier systems, CAMEO started only with the WEIS system (plus some of the IDEA extensions) and combined WEIS categories such as WARN/THREATEN and PROMISE/REWARD that were difficult to disambiguate in machine coding. Second, CAMEOs actor codes utilize a hierarchical structure of one or more three-character codes, which reflect the country or nation of origin and as much supplementary information as the article provides regarding region, ethnic/religious group, and domestic role (military, government, etc.). Recently in 2010, the ICEWS projectusing a variety of sources such as the national government lists of the CIA World Factbook [ and lists of IGOs, NGOs, multinational corporations, and militarized groups, built on CAMEOs actor dictionary, eventually collecting over 40,000 names of important political figures in any countries in the world who had a position of prominence anytime from 1990 to This was a considerable improvement over previous 15 The PANDA dataset is no longer publicly available. See Bond et al. (2003) for a discussion of the PANDA ontology from its creators. 14

23 projects like the 10 Million International Dyadic Events dataset, which only included 450 sub-state actors. 16 Obtaining stories. Due largely to technological limitations of the era (i.e. the lack of electronic articles and computational power), the WEIS and COPDAB projects relied on human analysts to physically collect newspaper clippings, press reports, and summary accounts from Western news sources to obtain news stories. Although coders were instructed about what types of articles to gather, they relied on their subjective judgment to determine whether an article was relevant and warranted inclusion into the archive of articles from which events were derived. This manual approach began to be replaced with automated coding with the first iteration of the Kansas Event Data Set (KEDS) project in the late 1980s (see Schrodt (1990) and Schrodt (1994)). By this time, two major computing developments had occurred. First, the rise of the internet and the advent of large-scale data aggregators such as Lexis-Nexis allowed news reports to be obtained in machine-readable form. Second, computational power and natural language processing methods had advanced to the point where processing of large quantities of information was possible using personal computers. In its earliest version, the KEDS project automatically downloaded and archived Reuters leads from the NEXIS (precursor to Lexis-Nexis) service into an electronic database, then coded these using a custom computer program. Following the success of KEDS, other event data programs, such as the PANDA project adopted an automated data collection process. By 2000, virtually all large-scale event data projects in political science relied on automated collection of news stories. In addition to the data collection efforts becoming almost exclusively electronic and automated, the scope of media coverage also increased. However, until recently, academic projects like KEDS and the 10 Million International Dyadic Events dataset with global coverage relied on a small number of sources, including Reuters (or Reuters Business Briefing) and Agence France Presse (AFP) for news content. Only with the creation of the Defense Advanced Research Projects Agency (DARPA)-funded Integrated Conflict Early Warning System (ICEWS; see O Brien (2010)) project in 2009, which draws articles form 29 international and regional news sources, did an event dataset with global coverage attempt to utilize a more comprehensive list of global news outlets. The key difference between the ICEWS event data coding efforts and those of earlier NSF-funded efforts was the scale. As O Brien (2010) notes: 16 This was established by aggregating the raw dataset and counting the number of unique secondary actor codes. 15

24 ...the ICEWS performers used input data from a variety of sources. Notably, they collected 6.5 million news stories about countries in the Pacific Command(PACOM) AOR [area of responsibility] for the period This resulted in a dataset about two orders of magnitude greater than any other with which we are aware. These stories comprise 253 million lines of text and came from over 75 international sources (AP, UPI, and BBC Monitor) as well as regional sources (India Today, Jakarta Post, Pakistan Newswire, and Saigon Times). As the name suggests, the most important innovation of the machine-coded school of conflict data collection was the introduction of computer software that could replace humans and fully automate the Step 3 of the conflict-data generating process. As discussed, in the early stages of event coding, the lack of readily available electronic news stories and sufficient computing power to support machine coded efforts meant that human coding was the only viable coding option. Although human coding was initially the only available way to code events, it has three main shortcomings: it is slow, expensive, and subjective. The average human coder can code around six to ten stories an hour on a sustained basis, and very few people can reliably code more than a few hours a day because the process is so mind-numbingly boring. At that rate, it takes a team of 10 coders at least three person-years to code 80,000 news stories. Paying coders $10 an hour would cost 100,000, and the costs to training, re-training, cross-checking and management would at least double that investment. Additionally, due to the inherently subjective nature of human analytical processes, interoperability between analysts rarely exceeded 70% and often falls in the 30%-40% range (see Mikhaylov, Laver and Benoit (2012) and King and Lowe (2004)) particularly when coding is done across institutions and over long periods of time. By the late 1980s, computational power had advance to the point that it was possible to run automated coding software from personal computers. The KEDS project was the first attempt within academia to use a computer to parse through electronic text and code relevant events into an event data database, relying on dictionary-driven sparse parsing based on the WEIS typology. The sparse parsing relies primarily on simple pattern matching on the text of an article to find specific words (i.e. Israel, attack, bomb) or sets of words ( United Nations Secretary General; promised to provide aid, promised to seek revenge) that match entries in dictionaries corresponding to the actor and event ontologies. In addition, the system knows some basic rules of English grammar: for example a phase of the form Representatives of the US and France will meet with Israeli negotiators 16

25 involves two events US meets Israel and France meets Israel and the passive voice construction A U.S. convoy was attacked by Iraqi insurgents reverses the usual subject-verb-object ordering of English sentences so that this corresponds to Iraq insurgents-attack-usa. Consider the following hypothetical sentence: March 12, 1998 Israeli troops launched offensive attacks against Palestinian insurgents on Monday, in the first of what is expected to be a new wave of counter-terrorism efforts. Using the CAMEO verb typology and actor dictionaries, as well as rules that automatically concatenate the proper nouns Israeli and Palestinian with the generic agents troops and insurgents, the TABARI-derived output for the example is presented in Table 6: [INSERT TABLE 6 HERE] By the late 1990s, machine-coding had become increasingly popular, and almost all time and costs were upfront in the dictionary and software development phase. Because these were open source, they were easily adopted and upgraded. In 2000, the KEDS projects launched the Textual Analysis By Augmented Replacement Instructions (TABARI) software, which became the dominant machine coding system in event data. Subsequent automated-coding software, like the proprietary VRA Reader used to build the King and Lowe (2004) s 10 Million International Dyadic Events datasets, was modeled off of TABARI. Automated event coding has proven to be fast, accurate and replicable, inexpensive, and easily updatable. As of November 2011, TABARI was able to code 26 million stories for the ICEWS project in 6 minutes using a small parallel processing system. Since computers are able to rigidly apply coding rules, results are perfectly replicable. Moreover, because TABARI is open source, it is free to install and is easily manipulated to include customized dictionaries or coding rules, which made possible the creation of the GDELT dataset GDELT. In 2012, Kalev Leetaru released a new, cutting edge event dataset called the Global Database of Events, Location, and Tone (GDELT). This new dataset not only combined the strengths of King and Lowe (2004) dataset (i.e. global coverage) and the ICEWS dataset (detailed sub-state actor dictionaries, but it also uses an advanced natural language processing (NLP) program to provide latitude and longitude coordinates for each event, which not only combines 17

26 the strengths of the 10 Dyadic Events (i.e. global coverage) with ICEWS (i.e. robust sub-state actor coverage), but also provides latitude and longitude coordinates for the events. Thus, a typical GDELT event data observation provides latitude and longitude coordinates, as illustrated in Table 7: [INSERT TABLE 7 HERE] Part of the GDELT coding scheme is still proprietary, but here is what we know. GDELT uses TABARI and the CAMEO ontology to machine-code the entire content of electronic news stories. Additionally, GDELT obtains news stories from four sources: LexisNexis, Agence France Presse, Reuters, Associated Press, and Xinhua. Two key aspects of GDELT remain proprietary. First, it is unclear what GDELT is using for actor dictionaries, though we can be highly confident that they are of similar richness as the dictionaries built for the ICEWS project. Second, the process by which GDELT assign specific latitude and longitude coordinates to each event is still unclear. The final output is 200+ million events from 1979 to February 2013 (and at the time of writing this, it is being updated daily). Each observation contains up to 70 columns of additional information regarding the actors and location of the event. This dataset allows me two major advancements: first, it combines the strengths of the 10 Million International Dyadic Events dataset and the ICEWS project, by providing global event coverage with nuanced sub-state actor coverage. Second, it is the first machine-coded dataset to provide location information for events. Prior to GDELT, ACLED and UCDP-GED were the only large geo-coded political conflict dataset, with ACLED (the larger of the two) providing 75,000 events recorded across 60 countries. 17 Since this dissertation contains the first rigorous analyses of the GDELT data, few tests have been performed to assess the external validity of the data. However, two anecdotes suggest a high degree of external validity. First, I used the GDELT data to calculate a time-series reflecting the number of violent events that occurred per week in Aleppo and Homs during 2011 and I did this by selecting all material conflict events that occurred within the latitude and longitude coordinates surrounding Aleppo and Homs, and then calculating the sum of these event by week. Next, I plotted these time-series and visually cross referenced these values with a ground-truthed database from a Syrian NGO. The GDELT derived time series from both Aleppo and Homs appeared nearly identical to that of the ground-truthed dataset. [INSERT FIGURE 1 HERE] 17 See Appendix for a detailed description of material conflict events. 18

27 [INSERT FIGURE 2 HERE] Second, similar maps and figures reflecting violence in Afghanistan built with GDELT were presented to U.S. government officials, and they were sufficiently similar to maps built with classified ground-truthed datasets as to warrant accusations that the GDELT-derived maps were plagiarized version of the classified ground-truth maps. 3. The Substance At its core, this dissertation is a study of political violence. While all three chapters provide an empirical analysis of political violence, they address three different literatures. In Chapter 2, I focus on the costs of conflict, in Chapter 3, I analyze the causes of conflict, and in Chapter 4, I attempt predict conflict Chapter 2. There is a general assumption within the empirical study of conflict literature that war is costly. The opening sentence of one of the most heavily cited articles on war written in the last 20 years states, Fearon (1995), states: The central puzzle about war, and also the main reason we study it, is that wars are costly but nonetheless recur. Why is it important to understand the costs of conflict? The game theoretic literature provides rigorous theoretical analysis of factors impacting various aspects (i.e. onset, duration, termination) of war. Critical to many of these models is a parameter reflecting the cost of fighting, which affects whether or not actors decide to engage in conflict. For example, Fearon (1995) applies the bargaining model of war framework to demonstrate that all else being equal, two states are more likely to reach a negotiated settlement short of war as their expected costs of fighting increase. Gartzke (1999) builds on Fearon (1995), further stressing the importance of actors expectations about cost in deciding whether to fight or negotiate. Similarly, Mesquita and Siverson (1995) theoretically argue and empirically demonstrate that leaders are leaders are more likely to avoid war when the expected costs of fighting increase. All else being equal, conflict becomes more appealing as the expected costs decrease. Conversely, Wagner (2000) shows that the side facing higher expected costs will be more likely to initiate conflict, and additionally argues that state s increase their odds of reaching a favorable negotiated settlement when they are able to make their opponent believe that his costs of continued fighting will increase. Overall, it is difficult to find any game-theoretic model of war with equilibrium outcomes not affected by costs either observed or expected of fighting. Thus, the game-theoretic literature has strongly 19

28 demonstrated the extent to which costs both expected and observed affect all aspects bargaining both preceding the onset of a conflict and also during fighting. How are the costs of conflict measured? The measurement of some costs, such as government expenditures or damage to infrastructure that results from fighting is straightforward. Most governments maintain detailed accounts of expenditures related to a conflict, and it is fairly easy to appraise how much a bridge or factory would cost to rebuild after being destroyed. However, other costs of fighting are impossible to measure directly and require empirical estimation. For example, much empirical literature interested in analyzing the costs of war have attempted to empirically tests its effects on trade and GDP. For example, Collier (1999) and Kang and Meernik (2005) find that civil wars, on average, have strong negative effects on GDP. Focusing exclusively on civil war in Sri Lanka, Grobar and Gnanaselvam (1993) likewise finds civil war significantly decreased GDP performance. Furthermore, traditional wisdom would suggest that interstate war should lead to decrease in trade, and domestic conflict should lead to decreased in GDP. Any number of anecdotal examples could easily support this belief; the United State and Japan had been vibrant trading partners prior to the outbreak of WWII, and bilateral trade ceased after Russett and Oneal (2001) and Hegre, Oneal and Russett (2010) find support for this example, contending that conflict does in fact lead to lower levels of trade. Conversely, Barbieri and Levy (1999) find that on average, conflict does not impact trade. Although the majority of scholars interested in analyzing the costs of war have focused on its effects on GDP and trade, a smaller yet important literature analyzes the effects of political conflict on other important economic areas, such as commodity prices, government bonds, currencies, and equities. For example, Frey and Kucher (2000), Frey and Kucher (2001), Frey and Waldenstrom (2004), analyze the effects of WWII on U.S. government bond yields, while Ferguson (2008) performs a similar analysis but focuses on the effects of WWI. Eldor and Melnick (2004), Fratianni and Kang (2006), and Bolbol (1999) study the effects of war on currency exchange rates, Eldor and Melnick (2004) do not find that foreign exchange rates on the Israeli foreign exchange market tend to respond to terrorist attacks, whereas Bolbol (1999) demonstrates that the civil war in Lebanon that lasted until 1990 led to devaluation of the Lebanese pound. Rigobon and Sack (2003), Zussman, Zussman and Orregard (2008), Schneider and Troeger (2006) all analyze the effect of terrorist attacks on equity markets, focusing primarily on the effects of conflict in the Middle East on equities traded 20

29 on the Tel Aviv Stock Exchange (TASE), London Stock Exchange, and New York Stock Exchange (NYSE). In Chapter 2, I contribute to the broader literature interested in assessing the costs of conflict by providing an empirical analysis measuring the effects of political violence on financial markets. Drawing on the bargaining model of war framework, understanding whether conflict has a meaningful effect on financial markets is potentially important. Governments of countries that house publicly traded equity markets, such as the NYSE in the UNited States, the TASE in Israel, or the LSE in the United Kingdom, have interests in maintaining strong market performance. Not only do governments in these countries generate tax revenues from corporate profits, meaning that government revenues increase as equity prices rise, but members of government also receive considerable financial contributions from members of the finance sector. Additionally, since financial markets are so interconnected, poor performance of equity markets can increase the costs of government borrowing and adversely affect currency rates. Thus, all else being equal, governments would prefer to avoid the costs incurred with poor market performance. As previously discussed, a number of studies have analyzed the effecters of political conflict on equity markets. Among these, every single study finds that political conflict (primarily operationalized terrorist attacks) has a statistically significant effect on either equity market returns or variance in returns. It is easy to find anecdotal support for these findings, since equity markets have tended to respond negatively to major conflict events (think 9/11 of the London bombings on July 7, 2005). However, it is important to note that these studies tend to analyze the effects of conflict on financial markets located in countries that generally exist devoid of political conflict, like the United States and the United Kingdom. Thus, a logical question emerges: is the consistency of findings suggestions that equity markets do meaningful respond to political conflict reflective of a true relationship that holds across time and space, or it is more a function of biased case selection? To address this, I provide a rigorous empirical analysis of the effects of variation in the level of conflict directly at Israel on variance of returns of the Tel Aviv 100, which is an index comprised of the largest 100 companies traded on the TASE. Israel, unlike other countries that house highly liquid, publicly traded equity markets, regularly experiences high levels of political conflict. Thus, it may be the case the the regularity with which violence occurs in Israel has caused investors to price equities under the assumption that the future business climate in Israel will experience violence. If this is occurring, then equity prices should not 21

30 meaningfully vary when violent events occur, because these were largely expected. Following the logic of the game theoretic models discussed above, whether Israeli equities meaningfully respond to violence directed towards Israel should have considerable effects on conflict dynamics. For example, if we extend Fearon (1995) s bargaining model of war, Israel should be less likely to initiate a conflict with Palestinians if they believe that retaliatory attacks from the Palestinians will negatively affect financial markets, since this would impart additional costs on the Israeli government. On the other hand, if the Israeli government is confident the equities on the TA100 are more or less immune from political conflict, then that is simply one less obstacle to a more hawkish position. The central question that I address in Chapter 3 is: does variance in returns of the TA 100 index meaningfully respond to variation in levels of conflictual events targeted as Israel? To empirically test this, I utilize the GDELT dataset to construct a daily level measure reflecting the number of conflict events initiated by actors out of Israeli, Palestinian-occupied, or Lebanese territory against Israeli actors. Then, I follow common protocol within econometrics literature and utilize a series of generalized autoregressive conditional heteoskedasticity (GARCH) models to test the extent to which variance in the TA100 meaningfully responds to variation in daily conflict. Additionally, I replicate this procedure to analyze the effects of conflict on variance in equity returns of the two largest insurance companies traded on the TA100, Migdal Insurance and Financial Holdings Ltd. (MGDL) and Clal Insurance Enterprises Holdings Ltd. (CLIS). I find that on average, the TA100 does not meaningful respond to violent attacks. However, political conflict achieve weakly significance effects on variance of MGDL and consistent and highly significant effect on CLIS returns Chapter 3. Few, if any topics in international relations have received more attention than the causes of interstate conflict. In fact, a desire to better understand interstate conflict is what motivated the earliest political conflict datasets, from Woods and Baltzly (1915) to to COW. Over the past 50 year scholars have found empirical support suggesting dozens, if not hundreds of different variables meaningfully affect various aspects of interstate conflict. For example, at the dyadic level, the following factors are a sample of some that have been empirically demonstrated affect the likelihood of interstate conflict. In one most consistent finding in all of the empirical study of war, dozens of scholars, including Doyle (1986), Maoz and Abdolali (1989), Maoz and Russett (1992), have found empirical support for the democratic peace theory, which argues that wars do not occur between democracies. Perhaps even more robust than the effects of joint democracy 22

31 is the effect of distance. As the distance between states increases, the likelihood of interstate war decrease, and sharing a border dramatically increases the chances of conflict (see Wesley (1962), Vasquez (1993), Lemke and Reed (2001) and Starr and Thomas (2005)). The effects of dyadic trade levels has also received considerable attention, with many scholars studies like Russett and Oneal (2001), Bennett and Stam (2000), Gartzke, Li and Boehmer (2000), and Oneal (1996) finding that the likelihood of interstate conflict is lower between states that trade with each other, though others like Barbieri and Schneider (1999) find the opposite effect. More subtle factors, such as whether two states shared ethnic groups ( see Davis (1997) and Woodwell (2004)), have also been studied in detail. Additionally, scholars have analyzed domestic level conditions that affect the likelihood of interstate conflict. For example, Mansfield and Snyder (1995), Mansfield and Snyder (2002), and Mansfield and Snyder (2009) focus on domestic democratic transitions, finding that states are more likely to engage in interstate conflict during and after a democratizing movement. Chiozza and Goemans (2004), Wolford (2007), Gelpi and Grieco (2001) and Bak and Palmer (2010) focus on aspects of states leaders, finding that leadership tenure can increase the likelihood of conflict. Furthermore, additional scholars have tested aspects of the diversionary diversionary theories of war, which is a general theory that leaders will often seek out international crises or conflict in order to focus domestic audiences attentions away from domestic issues. For example, Morgan and Anderson (1999),Baker and Oneal (2001), and Kisangani and Pickering (2007) focus on approval ratings, Russett (1990), Fordham (1998b), Fordham (1998a), and DeRouen (2000) test the effects of economic conditions and Hess and Orphanides (1995), Smith (1996), and Tir (2010) analyze the effects of domestic elections on interstate conflict. Given the massive number of studies attempting to isolate conditions that affect interstate conflict, it is somewhat surprising that such little empirical research has analyzed the effects of domestic conflict on interstate conflict. This is even more surprising given the large number of historical examples of domestic conflict influencing interstate conflict. For example, spillover effects from the civli war in Rwanda in 1994 led to broader interstate conflict amongst states in the Great Lakes region. More recently, the ongoing civil war in Syria has led to interstate conflict, as Israel has begun launching intermittent missile attacks against Syrian government forces. 23

32 Despite the historical precedence of domestic conflicts influencing interstate conflicts, relatively little empirical work has attempted to analyze the effect that domestic conflicts can have on interstate conflict. Moreover, the studies that do attempt to empirically test linkages between domestic and interstate conflict tend to focus on either a specific sub-set of all possible types of domestic conflict and interstate conflict. For example, Trumbore (2003) focuses exclusively on the effect of domestic ethnic conflict on MID initiation, Davies (2002) analyzes the effects of riots and protests on MID initiation, and (Elbadawi and Sambanis (2002), Gleditsch (2007), Regan (2000) all analyze the extent to which a domestic conflict makes a state more likely to be the target of an intervention. Although these studies provide a theoretical and empirical foundation, they all exclude certain types of domestic conflict or interstate conflict from their analyses. For example,trumbore (2003) and Davies (2002) focus exclusively on the initiation of interstate conflict, while Elbadawi and Sambanis (2002), Gleditsch (2007), Regan (2000) only measure when a state becomes the target of interstate conflict. Gleditsch, Salehyan and Schultz (2008) largely overcomes the problem of only focusing on limited types of domestic and interstate conflict by analyzing the effects of domestic conflicts according to the comprehensive, UCDP/PRIO coding rules, on involvement in a MID. Gleditsch, Salehyan and Schultz (2008) and all of the studies referenced in the previous paragraph have two things in common. First, they all find similar results to the aforementioned studies regardless of operationalizations of conflict, a country experiencing a domestic conflict is more likely to be involved in an interstate conflict. Second, they all use binary, annual measures of domestic and interstate conflict. The first similarity is positive, since consistent empirical findings of factors influencing interstate conflict are rare in the literature. The second similarity leads to considerable shortcomings in empirically testing for relationship between domestic and interstate conflict. For example, consider tests of the effect of onsets of domestic conflict on the likelihood of an onset of interstate conflict. In this context, binary data is not problematic, since the concept of an onset is conducive to a a yes/no framework. However, the annual aspect of data is more difficult for two reasons. First, it is impossible to tell whether a domestic conflict onset actually precedes an interstate conflict if both onsets occur within the same calendar year. Thus, if scholars do not lag the variable reflecting domestic conflict, they run the risk of conflating interstate conflicts that lead to domestic conflicts (think the civil conflict that followed the U.S. Invasions of Iraq and Afghanistan) with domestic conflict that lead to interstate conflict (think the U.S. attacks on 24

33 Ghadaffi forces in response to ciivl conflict in Libya). As a result, lagging the variable reflecting domestic conflict onset is the preferred method, since this ensures that the domestic conflict onset temporal precedes the interstate conflict onset. However, this approach effectively drop instances of interstate conflict onsets that quickly follow onsets of domestic conflict in the same calendar year, as often occurs. Although the binary nature of the data employed in the extant literature does allow for crude tests of onset, it completely eliminates the possibility of tests of conflict intensity, either in the domestic or interstate conflict. Thus, scholars relying on the binary, annual level conflict data are simply unable to ask interesting questions like, does an onset of interstate conflict become more likely as ongoing domestic conflicts becoming more severe? Or, does an onset of domestic conflict affect the intensity of an ongoing interstate conflict? The central goal of Chapter 3 is to move beyond the binary, annual measures in order to provide more nuanced analyses of the effects of domestic conflict on interstate conflict. To accomplish this, I obviously need more nuanced data, which I am able to build using the GDELT data. With GDELT dataset, I derive monthly, continuous measures reflecting the number of domestic conflict events each month for over 150 countries from 1979 to 2004 and the number of inter-state conflict events per month for all non-directed dyads for the same time period, which results in over 4 million observations. This allows me to make two major advances on the current literature. First, I move beyond the annual level to provide monthly level analyses of the effects of domestic conflict onset on the likelihood of domestic conflict conflict onset. Consistent with the existing literature, I find that onsets of domestic conflict in month t in one or both states comprising a dyad tend to increase the likelihood of interstate conflict onset within that dyad in month t + 1 and month t + 2. Second, for the first time, I am able to provides tests accounting for domestic and interstate conflict intensity. I find that as ongoing domestic conflict becomes more intense in one or both states of a dyad, the likelihood of an interstate conflict onset increase. Moreover, when two states are engaged in an ongoing interstate conflict, that conflict intensity tends to lessen if both states also experience an onset of domestic conflict Chapter 4. In the previous two sections of this introductory chapter, I have cited over 60 studies performing empirical analyses of political conflict, and these are merely a small sample of the thousands of articles that have been published in peer-reviewed journals, presented at academic 25

34 conferences, or submitted as a phd dissertation to collect dust in a back corner of a university library. An obvious, yet rarely asked question, is: what is the ultimate goal of the all of these studies that comprise the subfield of political science that focuses on the quantitative study of conflict? Two sentences from Karl Deutsch s introduction to Wright (1942) s Study of War provide a commonly cited answer: war, to be abolished, must be understood. To be understood, it must be studied. 18 This statement contains two important points. First, we study war so that we can understand it. Second, we wish to understand war so that we can decrease its future occurrence. These seem like legitimate goals with which few scholars of war would likely disagree. But, how do we actually know if our empirical models are enhancing our understanding of war, and how can we use these empirical models to actually prevent the occurrence of war, or at least provide some insight about future war dynamics as to lessen human suffering? Drawing on past scientists/philosophers like Sir Francis Bacon (Bacon (1602)), Sir David Hume (Hume (1748)), Sir Karl Popper (Popper (1934)) and current political scientists like Michael Ward (Ward, Greenhill and Bakke (2010)), Phil Schrodt (Schrodt (2010)), Gary King (King and Zeng (2001), I argue that the answer to both of these questions rests on models that focus on prediction, rather than explanation. Prediction is a vital tool for both of the two big picture goals of the empirical study of conflict. First, to the extent that an empirical model has actually enhanced our understanding of war, that model will be able to better predict war. If it cannot, then it is likely that statistically significant relationships it found either only hold for a limited spatial or temporal range and no longer apply to the current world, or, the relationships were simply noisy anomalies in the data. Either way, it is unlikely that a model unable to enhance predictive accuracy of war actually increases our understand of war, and this model will certainly be unable to contribute to the future prevention of war. Methodological approaches to forecasting conflict can be generally divided into two camps: gametheoretic and data-driven. Game-theoretic approaches generally focus on predicting a single outcome, such as will two sides reach a negotiated settlement during a given series of peace talks (see Bueno de Mesquita and Lalman (1992), Bueno de Mesquita (2002), and Bueno de Mesquita (2009)). To build these style models, researchers must first determine the relevant actors cable of influencing the outcome, second estimate each of each actor s preferences regarding potential outcomes, and lastly mathematically solve for the equilibrium solutions given each actor s preferences. 18 I draw this example from Ward et al. (2012). 26

35 Bueno de Mesquita has used this approach successfully in a number of contexts (see Bueno de Mesquita (2002) and Bueno de Mesquita (2009)). However, it is slow and must be recalculated on a case-by-case basis. For example, Bueno de Mesquita often conducts rigorous interviews with relevant actors, and when this is not possible, spends considerable time reading about actors past behaviors. Thus, game-theoretic approaches are appropriate in some circumstances, such as trying to determine whether Iran will obtain a nuclear weapon, but less so in others, such as building real-time forecasts of local-levels of violence in multiple countries. The data-driven method is a far more common technique to building predictions of political conflict. This approach involves three general steps, which are often immensely complicated in actual practice: first, collect data on an outcome of interest (say, a binary measure of war onset) and ideally additional covariates that may effect the outcome of interest; second, train an empirical model on a subset of the data in order to hopefully identify empirical patterns; third, use patterns found on the training set to build predictions on a hold-out portion of the data in order to determine predictive accuracy. To complicate matters further, an additional division exists among studies using data to attempt to predict political conflict, largely defined as structural and dynamic models. Structural models, like those employed in Gurr and Harff (1996), King and Zeng (2001), Fearon and Laitin (2003), and Goldstone et al. (2010), rely on coarse, annual level data. This means that structural models are only capable of building forecasts of conflict at the state- or -dyad year. Again, this is useful in some contexts, such as theory-testing for academics or defense budget allocation for major-power governments, but not capable of providing sub-annual or sub-state forecasts. The bulk of data-driven approaches utilize dynamic models, focusing on sub-annual level variation in conflict. The vast majority of dynamic forecasting models utilize fine-grained, machine-coded event data, such as Schrodt (1999), Pevehouse and Goldstein (1999), and Shellman (2004). These studies have demonstrated are capable of providing accurate, sub-annual level forecasts of political conflict. However, the lack of geo-location information means that these studies have been unable to generate predictions desegregated to sub-state geographic units. A smaller number of dynamic forecasting models, like Weidmann and Ward (2010), utilize human-coded data that does provide geo-location information, and as a result, are able to build sub-annual level forecasts of levels of violence at the sub-state administrative unit. However, since human-coded datasets provided limited spatial coverage, it is not possible to extend these models to all countries. 27

36 The central goal of Chapter 4 is to build accurate forecasts of future levels of political violence at a sub-state and sub-annual level of temporal nuance, and do so in a way that could be applied to any future conflict occurring in any country in the world in real-time. Since this requires making many predictions in a short period of time with limited (or no) information about the preferences of the actors involved, a game-theoretic approach is not feasible. Thus, I implement a data-driven model. Historically, a tradeoff existed between using machine-coded datasets, which could provide global coverage but no geo-coded information, or human-coded datasets, which contained geo-coding but were difficult (or impossible) to maintain in real-time for a large number of countries. The use of the GDELT dataset allows me to overcome both of these shortcomings. Despite the considerable benefits resulting from the scope and detail of GDELT s 200+ million observations, this also creates technical difficulties. One major current challenge is aggregating GDELT data to sub-state geo-spatial units. Currently, this requires the combination of a number of computational scripts to first pull relevant GDELT observations, and then aggregate these using shape file and Geographic Information software (GIS). Given the time-intensity of this process, I build predictions for a single country to serve as a proof-of-concept for an eventual model with global coverage. I choose to focus on Afghanistan, since it experienced high levels of regionally dispersed violence, and existing studies have demonstrated strong local-level predictive accuracy using human-coded data (see Mangion-Zammit et al. (2012)). Using GDELT and GIS, I calculate the number of conflict events that occur at the district, province, and country level for each month from 2001 April I focus primarily on building forecast at the district-month level (number of districts = 317), since the district is Afghanistan s smaller administrative unit. However, I likewise build forecasts at the province-month (number of province = 32)) and country-month level. This allows me to speak to the effects of geo-spatial aggregation on predictive accuracy. Empirically, I build predictions using an autoregressive fractionally integrated moving average (ARFIMA) model. To assess predictive accuracy, I set aside the final 48 months from April 2009 to April March 2012 as out-of-sample test months, and iteratively build unique forecasts for each geo-spatial unit (district, province, country) for each of these months using a one-month-in-advance framework. This results in 317, 32, and 1 prediction at the district-, province-, and country-month unit of a analysis, respectively, for all 48 out-of-sample months. For each month, I calculate whether the ARFIMA model s prediction generate lower mean absolute error (MAE) across the spatial units relative to a naive model that simply predicts that 28

37 the number of conflict events in month t = month t 1. The ARFIMA model outperforms this naive model in 47 out of 48 months at the district-month level, 42 out of 48 at the province-month level, and 40 out of 48 at the country-month level. Additionally, I experiment with feature building, alternative forecasting algorithms, and the inclusion of exogenous drug price variables, but none of these approaches improve predictive accuracy achieved by the univariate ARFIMA model. 29

38 References Azar, Edward E The Conflict and Peace Data Bank (COPDAB) Project. Journal of Conflict Resolution 24: Bacon, Sir Francis Novum Organum. Forgotten Books. Reprinted in Bak, Dahee and Glenn Palmer Testing the Biden Hypotheses: Leader Tenure, Age, and International Conflict. Foreign Policy Analysis 6(3): Baker, William D. and John R. Oneal Patriotism of Opinion Leadership? The Nature and Origins of the Rally Round the Flag Effect. Journal of Conflict Resolution 45(5): Barbieri, Katherine and Gerald Schneider Globalization and Peace: Assessing New Directions in the Study of Trade and Conflict. Journal of Peace Research 36(4): Barbieri, Katherine and Jack S. Levy Sleeping with the Enemy: the Impact of War on Trade. Journal of Peace Research 36(4): Bennett, D. Scott and Allen C. Stam When (Seemingly) Innocuous Decisions Matter: Research and Estimator Choices in the Analysis of Interstate Dyads. Journal of Conflict Resolution 44: Bloomfield, Lincoln P and Allen Moulton CASCON III: Computer-Aided System for Analysis of Local Conflicts. Cambridge, Mass: MIT Center for International Studies. Bolbol, Ali A Seigniorage, Dollarization and Public Debt: The Lebanese Civil War and Recovery Experience, World Development 27(10): Bond, Doug, Joe Bond, Churl Oh, J. Craig Jenkins and Charles L. Taylor Integrated Data for Events Analysis (IDEA): An Event Typology for Automated Events Data Development. Journal of Peace Research 40(6): Bueno de Mesquita, Bruce Predicting Politics. Columbus, Ohio: Ohio State University Press. Bueno de Mesquita, Bruce The Predictioners Game. New York: Random House. Bueno de Mesquita, Bruce and David Lalman War and Reason. New Haven, CT: Yale University Press. Chiozza, Giacomo and Hein E. Goemans International Conflict and the tenure of Leaders: Is War still Ex Post Inefficient? American Journal of Political Science 48(3):

39 Chojnacki, Sven, Christian Ickler, Michael Spies and John Weisel Event Data on Armed Conflict and Security: New Perspectives, Old Security, and Some Solutions. International Interactions 38: Collier, Paul On the economic consequences of civil war. Oxford Economic Papers 51: Daly, Sarah Zukerman Organizational Legacies of Violence: Conditions favoring insurgency onset in Colombia, Journal of Peace Research 49(3): Davies, Graeme A. M Domestic Strife and the Initiation of International Conflicts: A Directed Dyad Analysis, The Journal of Conflict Resolution 46(5): Davis, David R Ethnicity Matters: Transnational Ethnic Alliances and Foreign Policy Behavior. International Studies Quarterly 41: DeRouen, Karl Jr Presidents and the Diversionary use of Force. International Studies Quarterly 44: D Orazio, Vito, Steven T. Landis, Glenn Palmer and Philip Schrodt Separating the Wheat from the Chaff: Applications of Automated Document Classification to MID.. Presented at the MidWest Political Science Meeting, Available at Doyle, Michael Liberalism and World Politics. American Political Science Review 80: Eck, Kristine In Data we Trust? A comparison of UCDP GED and ACLED conflict events datasets. Conflict and Cooperation 47(1): Elbadawi, Ibrahim and Nicholas Sambanis External interventions and the duration of civil wars. In World Bank. Policy Research Working Paper Series 2433, World Bank. Eldor, Rafi and Rafi Melnick Financial Markets and Terrorism. European Journal of Political Economy 20: Fearon, James D Rationalist Explanations for War. International Organization 49(3): Fearon, James D. and David D. Laitin Ethnicity, Insurgency, and Civil War. American Political Science Review 97(1): Ferguson, Niall Earning from History? Financial Markets and the Approach of World Wars. Brookings Papers on Economic Activity pp

40 Fordham, Benjamin. 1998a. Partisanship, Macroeconomic Policy, and the U.S. use of Force, Journal of Conflict Resolution 42: Fordham, Benjamin. 1998b. The Politics of Threat Perception and the Use of Force: A Political Economy Model of U.S. Used of Force, International Studies Quarterly 42: Fratianni, Michele and Heejoon Kang International terrorism, International Trade, and borders. Volume Research in Global Strategic Management 12: Frey, Bruno S. and Daniel Waldenstrom Markets work in war: World War II reflected in the Zurich and Stockholm bond markets. Financial History Review 11(1): Frey, Bruno S. and Marcel Kucher History as Reflected in Capital Markets: The Case of World War II. Journal of Economic History 60(2): Frey, Bruno S. and Marcel Kucher War and Markets: How Bond Values Reflect the Second World War. Economica 68(271): Gartzke, Erik War is in the error term. International Organization 55(3): Gartzke, Erik, Quan Li and Charles Boehmer Investing in Peace: Economic Interdependence and International Conflict. International Organizations 55: Geller, Daniel S Toward a Scientific Theory of War. In The Scourge of War: New Extensions on an Old Problem. University of Michigan Press pp Gelpi, Christopher and Joseph M. Grieco Attracting Trouble: Democracy, Leadership Tenure, and the Targeting of Militarized Challenges, Journal of Conflict Resolution 45(6): Gerner, Deborah J., Philip A. Schrodt, Omur Yilmaz and Rajaa Abu-Jabr The Creation of CAMEO (Conflict and Mediation Event Observations: An Event Data Framework for a Post Cold War World. Presented at the Annual Meeting of the American Political Science Association. Gerner, Deborah J., Philip A. Schrodt, Ronald A. Francisco and Judith L. Weddle The Machine Coding of Events from Regional and International Sources. International Studies Quarterly 38: Ghosn, Faten, Glenn Palmer and Stuart Bremer The MID3 Data set, 1993:2001: Procedures, Coding Rules, and Description. Conflict management and Peace Science 21: Gleditsch, Kristian Skrede Transnational dimensions of civil war. Journal of Peace Research 44(3):

41 Gleditsch, Kristian Skrede, Idean Salehyan and Kenneth Schultz Fighting at Home, Fighting Abroad: How Civil Wars Lead to International Disputes. Journal of Conflict Resolution 52(4): Gleditsch, Nils Petter, Mikael Eriksson Wallensteen, Margereta Sollenberg and Havard Strand Armed Conflict : A New Dataset*. Journal of Peace Research 39(5): Gochman, Charles S. and Zeev Maoz Militarized Interstate Disputes, : Procedures, Patterns, and Insights. Journal of Conflict Resolution 28(4): Goldstone, Jack A., Robert H. Bates, David L. Epstein, Ted Robert Gurr, Michael B. Lustik, Monty G. Marshall, Jay Ulfelder and Mark Woodward A Global Model for Forecasting Political Instability. American Journal of Political Science 54: Grobar, Lisa Morris and Shiranthi Gnanaselvam The Economic Effects of the Sri Lankan Civil War. Economic Development and Cultural Change 41: Gurr, Ted Robert and Barbara Harff Early Warning of Communal Conflict and Humanitarian Crisis. In Monograph Series on Governance and Conflict Resolution. United Nations Press. Harmann, Charles, Maurice A. East, Margaret G Hermann, Barbara G. Salmore and Stephen A. Salmore CREON: A Foreign Events Data Set. Beverly Hills: Sage Publications. Hegre, Havard, John R. Oneal and Bruce Russett Trade does promote peace: New simultaneous estimates of the reciprocal effects of trade and conflict. Journal of Peace Research 47(6): Hess, Gregory and Athananios Orphanides War Politics: An economic, Rational-Voter Framework. American Economic Review 85: Hume, Sir David An Enquiry Concerning Human Understanding. Available at Kang, Seonjou and James Meernik Civil War Destruction and the Prospects for Economic Growth. Journal of Politics 67: King, Gary and Langche Zeng Improving Forecasts of State Failure. World Politics 53: King, Gary and Will Lowe An Automated Information Extraction Tool for International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design. International Organization 57(3):

42 Kisangani, Emizet F. and Jeffrey Pickering Diverting with Benevolent Military Force: Reducing Risks and Rising Above Strategic Behavior. International Studies Quarterly 51: Lemke, Doug and William Reed The Relevance of Politically relevant Dyads. Journal of Conflict Resolution 45: Mangion-Zammit, Andrew, Michael Dewar, Visakan Kadirkamanathan and Guido Sanguinetti Point process modeling of the Afghan War Diary. Proceedings of the National Academy of Science 109(31): Mansfield, Edward D. and Jack Snyder Democratization and the danger of War. International Security 20(1):5 38. Mansfield, Edward D. and Jack Snyder Democratic Transitions, Institutional Strength, and War. International Organizations 56(2): Mansfield, Edward D. and Jack Snyder Pathways to War in Democratic Transitions. International Organizations 63(2): Maoz, Zeev and Bruce Russett Alliance, Contiguity, Wealth, and Political Stability: Is the Lack of Conflict Among Democracies a Statistical Artifact? International Interactions 17: Maoz, Zeev and Nasrin Abdolali Regime Type and International Conflict, Journal of Conflict Resolution 33:3 35. McClelland, Charles A World-Event-Interaction-Survey: A Research Project on the Theory and Measurement of International Interaction and Transaction. University of Southern California. Melander, Erik and Ralp Sundberg Climate Change, Environmental Stress, and Violent Conflict Tests Introducing the UCDP Georeferenced Event Dataset. Presented at the annual convention of the International Studies Association, Montreal, March Mesquita, Bruce Bueno de and Randolph M. Siverson War and the Survival of Political Leaders: A Comparative Study of Regime Types and Political Accountability. American Political Science Review 89(4): Mikhaylov, Slava, Michael Laver and Kenneth R. Benoit Coder Reliability and Misclassifications in the Human Coding of Party Manifestos. Political Analysis 20: Moore, Will H A Problem with Peace Science: The Dark Side of COW.. 34

43 Morgan, Clifton T. and Christopher J. Anderson Domestic Support and Diversionary External Conflict in Great Britain, Journal of Politics 61(3): O Brien, Sean Crisis Early Warning and Decision Support: Contemporary Approaches and Thoughts on Future Research. International Studies Review 12(1): Oneal, John R Empirical Support for the Liberal Peace. In Economic Interdependence and International Conflict: New Perspectives on an Enduring Debate, ed. Edward D. Mansfield and Brian M. Pollins. University of Michigan Press pp Pevehouse, Jon C. and Joshua S. Goldstein Serbian Compliance or Defiance in Kosovo? Statistical Analysis and Real-Time Predictions. The Journal of Conflict Resolution 43(4): Popper, Sir Karl The Logic of Scientific Discovery. London: Rutledge Classics. Reprinted in Raleigh, Clionadh, Andrew Linke, Havard Hegre and Joakim Karlsen Introducting ACLED: An Armed Conflict Location Event Dataset. The Journal of Peace Research 47(5): Regan, Patrick M Civil Wars and Foreign Powers: Interventions and intrastate conflict. University of Michigan. Richardson, Lewis F Statistics of Deadly Quarrels. Chicago and Pittsburgh: Quadrangle/Boxwood. Rigobon, Roberto and Brian P. Sack The Effects of War Risk on U.S. Financial Markets. Available at: Russett, Bruce Economic Decline, Electoral Pressure, and the Initiation of Interstate Conflict. In Prisoners of War?, ed. Charles S. Gochman and Ned Sabrosky. D.C. Heath. Russett, Bruce and John R. Oneal Triangulating Peace: Democracy, Interdependence, and International Organization. New York: Norton. Salehyan, Idean, Cullen S. Hendrix, Jesse Hamner, Christina Case, Christpher Linebarger, Emily Stull and Jennifer Williams Social Conflict in Africa: A New Database. International Interactions 38: Sarkees, Merideth Reid and Frank Wayman Resort to War: CQPress. Schneider, Gerald and Vera E. Troeger War and the World Economy: Stock Market Reactions to International Conflict. Journal of Conflict Resolution 50(5):

44 Schrodt, Philip A Parallel Event Sequences in International Relations. Political Behavior 12(2): Schrodt, Philip A Statistical Characteristics of Events Data. International Interactions 20(1-2): Schrodt, Philip A Early Warning of Conflict in Southern Lebanon using Hidden Markov Models. In TThe Understanding and Management of Global Violence: New Approaches to Theory and Research of Protracted Conflict, ed. Harvey Starr. New York: St. Martin s Press pp Schrodt, Philip A Seven Deadly Sins of Contemporary Political Analysis. Presented at the Annual Meeting of the American Political Science Association, Washington. Schrodt, Philip A Precedents, Progress and Prospects in Political Event Data. International Interactions 38(4): Schrodt, Philip A., Glenn Palmer and Mehmet Emre Hatipoglu Automated Detection of reports of militarized interstate disputes using SVM document classification algorithm. Presented at theamerican Political Science Association. Shellman, Stephen M Measuring the Intensity of International Political Interactions Event Data: Two Interval-Like Scales. International Interactions 30(2): Sherman, Frank L. and Laura Neack Imagining the Possibilities: The Possibilities of Isolating the genome of international Conflict from the SHERFACS Dataset. In International Event Data Developments, ed. Richard L. Merrirr, Robert G. Muncaster and Dina A. Zinnes. University of Michigan Press. Singer, David The Correlates of War Project: Interim Report and Rationale. World Politics 24(2): Small, Melvin and David Singer Resort to Arms: International and Civil Wars: New York: Sage. Smith, Alastair Diversionary Foreign Policy in Democratic Systems. International Studies Quarterly 40(1): Sorokin, Pitirim Social and Cultural Dynamics: Fluctuation of Social Relationships, War, and Revolution. New York: American Book Company. Starr, Harvey and Dale G. Thomas The Nature of Border and International Conflict: Revisiting Hypotheses on Territory. international Studies Quarterly 49(1):

45 Sundberg, Ralph and Mathilda Lindgren UCDP GED Codebook Version Uppsala: Department of Peace and Conflict Research, Uppsala University. Taylor, Charles Lewis and David A. Jodice World Handbook of Political and Social Indicators: Third Edition. New Haven, CT: Yale University Press. Tir, Jaroslav Territorial Diversion: Diversionary Theory of War and Territorial Conflict. Journal of Politics 72(2): Trumbore, Peter F Victims or aggressors? Ethno-political rebellion and use of force in militarized interstate disputes. International Studies Quarterly 47(3): Urdal, Henrik and Kristian Hoelscher Explaining Urban Social Disorder and Violence: An Empirical Study of Event Data from Asian and Subsaharan African cities. International Interactions 38: Vasquez, John A The War Puzzle. Cambridge: Cambridge University Press. Wagner, Harrison R Bargaining and War. American Journal of Political Science 44(3): Wallensteen, Peter and Karin Axell Armed Conflict at the End of the Cold War, Journal of Peace Research 30(3): Ward, Michael D., Brian D. Greenhill and Kristin M. Bakke The Perils of Policy by P-Value: Predicting Civil Conflicts. Journal of Peace Research 47(5). Ward, Michael, Nils W. Metternich, Cassy Dorff, Max Gallop, Florian M. Hollenbach, Anna Schultz and Simon Weschle Learning from the past and stepping into the future: the next generation of Crisis prediction.. Weidmann, Nils B. and Michael D. Ward Predicting Conflict in Space and Time. Journal of Conflict Resolution 54(6): Wesley, James Paul Frequency of Wars and Geographic Opportunity. Journal of Conflict Resolution 6(4): Wolford, Scott The Turnover Trap: New Leaders, Reputation, and International Conflict. American Journal of Political Science 51(4): Woods, Frederick Adams and Alexander Baltzly Is War Diminishing? A Study of the prevalence of War in Europe from 1950 to the Present Day. Bostong: Houghton Mifflin. Woodwell, Douglas Unwelcome Neighbors: Shared Ethnicity and International Conflict during the Cold War. International Studies Quarterly 48(1):

46 Wright, Quincy AstudyofWar. Chicago: Reprinted in 1957 by University of Chicago Press. Zussman, Asaf, Noam Zussman and Morten Nielsen Orregard Asset Market Perspectives on the Israeli-Palestinian Conflict. Economica 75:

47 4. Appendix Table 1. Example of a COW interstate war observation WarNum StateName Start End WhereFought BatDeaths 1 Spain 4/7/ /13/ France 4/7/ /13/ Table 2. Example of a MID observation DispNum StateAbb Start End HostLev 2 YUG 5/2/ /25/ AUH 5/2/ /25/ Table 3. Example of a UCDP observation Location SideA SideB 1 Start End Type Albania United Kingdom Albania 10/22/ /31/ Table 4. Example of an ACLED observation Date Actor 1 Actor 2 Event Types Latitude Longitude 9/1/97 Military Forces of Unidentified Armed Battle-No change Djibouti ( ) Group (Djibouti) of territory 39

48 Table 5. Example of a UCDP-GED observation SideA SideB Type Start End Latitude Longitude Government of Algeria AQIM 1 10/6/ /9/ Table 6. Example of a KEDS Event Date Source Target CAMEO code CAMEO event ISRMIL PALINS 190 (Use conventional miltiary force) Table 7. Example of a GDELT Event Data Event Date Source Target Action Latitude Longitude THAMIL THAREB Figure 1. The Number of GDELT-derived Violent Events in Homs and Aleppo from January 2012 through June

49 Figure 2. The Number of Ground-truthed Violent Events in Homs and Aleppo from January 2012 through June

50 CHAPTER 2. USING POLITICAL EVENT DATA TO ANALYZE VARIANCE IN TEL AVIV 100 INDEX RETURNS Introduction Scholars and practitioners alike have long been interested in better understanding the effects of political events on financial markets. Largely after 9/11, researchers began to give considerable attention to how markets respond to political violence. Among the dozens of empirical studies that have emerged analyzing the effects of various forms of political violence (though primarily focusing on terrorism) on financial markets (including commodities, bonds, currencies, and equities), not a single study has failed to reject a null hypothesis that markets do not significantly respond to variations in violence over time. These findings tend to match much of the anecdotal evidence. For example, on the first day of trading following 9/11, the Dow Jones Industrial Average (DJIA) fell 7%. But are these results a function of research design and case selection? Should we expect these results to hold across time and space? At least one major theory the efficient markets hypothesis would suggest that we should only expect to see consistent, meaningful reactions in financial markets to political violence events when these events reveal new information. To put it more directly, traders should only respond to a political violence event if they believe that the event reveals new information about the future profitability of an asset. In some cases, like 9/11, this was clearly the case as it was first large-scale, foreign attack on the contiguous United States in nearly 200 years. But in other countries like Israel, where political violence is common, it is possible that political violence actually reveals no new information since investors are already operating in a climate where political violence is the norm. Consider what occurred in early November, 2012 in Israel, when Hamas militants operating out of the Gaza Strip escalated rocket attacks against targets in southern Israel. In response, Israel launched Operation Pillar of Defense on November 14 with 20 air strikes, killing high-profile Hamas leader Ahmed Jabari. From November 14 to the November 21 cease fire, Hamas launched over 1,000 small rockets into Israeli territory, and Israel carried out approximately 1,500 targeted 42

51 air strikes. All together, November 14 to November 21 was the most conflictual week in the Arab- Israeli conflict in nearly a decade. Meanwhile, Israeli financial markets seemed unaffected. The Tel Aviv 100 (TA100) an index of the 100 largest equities traded on the Tel Aviv Stock Exchange opened on November 14 at $ and closed on November 21 at $ , reflecting a trivial 0.25% gain for the seven day period. Though only an anecdote, this suggests the possibility that in Israel, financial markets may be fairly immune to variation in levels of political violence. Thus, while the extant literature has provided consistent findings that financial markets in stable countries respond significantly to political violence, less is known about how financial markets in conflictual countries respond to violence. Therefore, in this paper, I provide rigorous empirical tests to measure the extent to which variation in the level of violent attacks against Israel affects variance in returns of the TA100. Before proceeding, I address two questions underlying the importance of this paper: why focus on Israel and what is so important about financial markets? First, Israel is an ideal case study because it possesses an uncommon combination of a large, highly liquid financial market and high levels of political violence. Second, understanding the effects of political violence on financial markets is important for both practical and theoretical reasons. For example, from a practical investment perspective, if you are a mutual fund manager wishing to reduce your emerging market fund s volatility, should you avoid markets with political violence, like Israel, India, or Nigeria, based on the expectation that should future violence occur, market volatility will increase? Additionally, for traders, do short term opportunities for profit exist amidst political violence? If we know markets tend to not respond to political violence, but experience an abnormal negative shock following a particularly large attack, is this a strong buying opportunity? Additionally, the effect of violence on financial markets has tremendous influences on conflict dynamics. Imagine a rebel group with political demands that engages in missile attacks, bombings, riots, etc.. In Scenario 1, these actions have considerable negative effects on financial markets. Traders panic, volatility spikes, and prices plummet. In Scenario 2, traders basically ignore these rebel actions because they have been used off and on for the last 50 years and all political risk has already been priced into financial assets. Rebel bargaining power is likely considerably higher in Scenario 1, especially if the target state is a democracy and accountable to the domestic investors and corporations who are incurring the losses. In Scenario 43

52 2, the government would be under far less pressure to act since it is business as usual in the markets even amidst the increases in violence. To address the central question of this paper to what extent does variation in the level of violent attacks against Israel affects variance in returns of the TA100 I utilize the GDELT event data dataset. From this, I build daily level features that reflect the number and severity of violent events committed against Israel by relevant local actors. Next, I employ generalized autoregressive conditional heteoskedasticity (GARCH) models with a number of financial control variable to see whether the political violence events have a significant effect on variation in TA100 returns. Contrary to the existing literature, I find that the TA100 does not meaningful respond to political violence committed against Israel. As an additional robustness check, I additionally test the extent to which political violence events affect variance of daily returns of the two largest insurance companies in the TA100, with the rationale being that if any companies are likely to respond to variation in violence, it is likely to be insurance companies, who s profits fluctuate based on material damages. Using the same GARCH approach, I find that MGDL does not respond but CLIS does. 1. The Literature A number of recent studies across a variety of disciplines have attempted to analyze the effects of various forms of political violence on different types of financial markets. Since political violence is a vague concept, scholars operationalize it in different ways. In general, the extant literature focuses on three different types of political violence: terrorist attacks Though definitions vary between studies and datasets, these tend to be unannounced acts of violence committed against civilians. episodic event history A series of actions that, taken together, make up a broader event, such as the start of a campaign or war. atomic event data The specific actions of a conflict treated as unique events. 44

53 First, the majority of existing literature relating political violence to financial markets focuses on effects of terrorism (Eldor and Melnick (2004), Chen and Siems (2004), Arin, Ciferri and Spagnolo (2008), Kollias et al. (2011), Chesney, Reshetar and Karaman (2010), Johnson and Nedelescu (2005)). Due to both the highly subjective natures of the term terrorism as well as the multidisciplinary backgrounds of the scholars writing on this subject (including finance, economics, and political science), no established benchmarks exist for how to operationalize terrorism. As a result, each of the above cited studies use terrorism datasets consisting of different events, many of which provide limited (or no) justification for why certain acts of terror warranted inclusion while others failed to make it into the dataset. For example, Chen and Siems (2004) select 14 major terrorist/military events that occurred between the sinking of the Liusitania in 1915 and the attacks against the World Trade towers in 2001; Kollias et al. (2011) analyze the effects of 15 and 21 incidents of terror in the United Kingdom and Greece, respectively; Arin, Ciferri and Spagnolo (2008) uses the MIPT terrorism data to analyze variance in equity market returns in Indonesia, Israel, Spain, thailand, Turkey, and the U.K.; and Chesney, Reshetar and Karaman (2010) analyze the effects of 77 major terrorist attacks occurring around the world. In each of these cases, the scholars chose to focus on a small subsection of all possible acts of terror that occurred within their spatial and temporal domains of interest. More importantly, these subsamples of terrorist attacks are not selected at random, but rather, selected by intensity, with only the largest scale events gaining inclusion. As a result, the universal findings that terrorism events negatively affect financial markets do not speak to the effects of terrorism in general, but rather large scale and generally unexpected acts of terror. With this in mind, the results become less insightful since they are both readily obvious and not generalizable to a country like Israel, which is often the target of hundreds of small-scale missile attacks a year. Among the studies analyzing the effects of terrorism on financial markets, Eldor and Melnick (2004) provides the most rigorous research design by analyze the effects of 639 terrorist attacks against Israel from 1990 to 2003 on the Tel Aviv 100 index and Shekel-Dollar exchange rates. However, like the previous studies, Eldor and Melnick (2004) does not provide an explanation of the coding rules used to determine a terrorist attack. Additionally, although 639 seems like a sufficiently large number to eliminate the biasing effects of selecting only the most severe attacks, consider that from December 2011 to November 2012 alone, over 600 rockets were launched into 45

54 southern Israel from Gaza. Thus, it is likely that 639 attacks in a 13 year span represents only a subsection of total attacks that occurred during that time period, potentially biasing results. With regard to the central question of this chapter does variation in levels of political violence have a meaningful effect on variance in TA100 returns? terrorist attacks only represent a small portion of events that comprise the broader concept of political violence. Other forms of violence such as riots, protests, or traditional military exchanges may also be important but tend to be excluded by studies focusing exclusively on terrorism. To this end, a number of studies utilize a more inclusive event history operationalization of political violence. For example, Frey and Kucher (2000) and Frey and Waldenstrom (2004) focus on the effects of major events during WWII on European bond rates; Rigobon and Sack (2003) analyzes the effects of various events preceding the U.S. invasion of Iraq in 2003 including key addresses by George. W. Bush and meetings and activities of U.N. weapons inspectors; and Zussman, Zussman and Orregard (2008) looks at the effects of a broad range of events in Israel on TASE prices, including meetings, cease fires, and key military outcomes. Although these event history studies account for more inclusive forms of political violence than just terrorist attacks, they are still highly subjective and inherently ad hoc. The subjectivity arises from informally choosing select conflict events. For example, Frey and Kucher (2000) provide no justification for only including a handful of key events from WWII and omitting hundreds of others. Additionally, Zussman, Zussman and Orregard (2008) clearly highlights the presence of severe hindsight bias among these event history event studies. They suggest that a cut in interest rates in August, 1998 increased TASE equity returns, but they do not mention that interest rates were cut seven other times in 1998 alone. Unless the interest rate cut in August was a unique case (it was not), it is likely that the true relationship between the August 1998 interest rate cut and the increase in TASE returns was corollary, not causal. Given the ad hoc nature of the event history approaches, it is impossible to differentiate correlation from causality. Schneider and Troeger (2006) overcomes the shortcomings of both the terrorism and the event history studies by operationalizing the concept of political violence with the use of event data. In their study, Schneider and Troeger (2006) build a measure of political conflict using data from King and Lowe (2003) s 10 Million International Dyadic Events datasets, which incorporates events ranging form cooperative meetings and negotiations to conflictual threats, bombings, and artillery attacks. Further, this approach overcomes the shortcomings of the terrorism and event history 46

55 approaches in a number of ways. First, it does not require making judgment calls about whether an attack qualifies as a terrorist act a bombing is simply a bombing, regardless of whether if was intended to instill fear and targeted civilians. Second, unlike the event history approach, event data records atomic, rather than aggregate conflict events. This approach incorporates only the information available in real time, thereby avoiding hindsight bias by retrospectively clustering individual events into an aggregate event. For example, event data does not provide events like Germany invaded France. Instead, it would report that German troops crossed the border, German planes bombed french cities, and german and french troops exchanged gunfire. This also avoids hindsight bias after the fact it may be easy to attribute financial market movements to the aggregate German Invasion but in real time events occur atomically, not cumulatively. Though Schneider and Troeger (2006) make a convincing case for the use of event data and find interesting results, this chapter builds on their work in two important ways. First, whereas Schneider and Troeger (2006) are interested in the effects of levels of violence in Israel on global financial markets (the DJIA and FTSE), I focus on the effects on the domestic TASE. I argue that this is important for a number of reasons. For example, although Israeli policymakers work closely with politicians in the United States (who may alter their positions towards Israel based on the effects of conflict in Israel on U.S. financial markets), it is likely that Israeli policy makers take into much greater consideration the effects of conflict on their own domestic markets. Imagine if the Hamas rocket attacks from December 2011 through November 2012 had dramatic and detrimental effects on the TASE, either in the form of lower returns or greater market volatility. 1 Additionally, it is interesting from a theoretical position because Israel represents one of the only cases of a well-functioning, highly liquid financial market in a highly conflictual region. Given its uniqueness, findings in the extant literature that consistently find that financial markets do respond to political conflict may not hold for Israel. Second, I utilize a newer, more comprehensive event data set and utilize the event data to build more logical measures of political violence. In Section 3, I outline the use of event data in Schneider and Troeger (2006) highlighting shortcomings, introduce the GDELT dataset, and discuss how my aggregation techniques overcome these shortcomings. Before proceeding to Section 3, I first briefly outline theoretical arguments underlying competing hypotheses arguing that the TA100 should and should not respond to violence. 1 All else being equal, many fund managers prefer less volatility, meaning that enhanced volatility in the TA 100 would likely discourage investment. 47

56 2. Brief explanation of competing hypotheses Much of the extant literature analyzing the relationship between political violence and financial markets tend to suggest the similar, intuitive explanation that markets respond negatively to political violence events since these disrupt economies both physically and mentally. Indeed, implicit in some of the most canonical models of political conflict is that all else being equal, violence is costly to a state (see Fearon (1995)). Physically, violence can damage the means of production and the infrastructure needed to transport goods. Mentally, violent events both cause fear among investors and buyers of goods (especially acts of terrorism) and generate feelings of ill will between leaders thereby disrupting trade and other forms of mutually beneficial commerce (see Anderton and Carter (2001) and Anderton and Carter (2003) for empirical evidence suggesting war disrupts interstate trade). Any number of case studies 9/11 world trade attacks, Germany s invasion of France in 1940, July 7th bombings in London support the straightforward argument that financial markets tend to respond significantly and negatively to political violence. This all suggests that the same relationship should hold true in Israel, which leads to the sole hypothesis of this paper: Hypothesis 1: Variation in the level of violence towards Israel will have a statistically significant impact on variance in returns of the TA100. However, theoretical arguments suggesting that political violence has a meaningful effect on financial markets tend to conceptualize violence and consequently, peace, in black and white terms with peace being the natural state of the world, intermittently disrupted by violent events. We see this play out in the majority of the previously discussed terrorism and event history studies, which tend to subjectively analyze the effects of large scale (economically disruptive) and unexpected (unpredictable) instances of political violence. But what about in areas where political violence is common and tends to be low-scale, as is the case in many countries that would serve as interesting case studies for the effects of violence on financial markets (India, Nigeria, Indonesia, etc.)? In these cases, the relationship may not be as straightforward. According to financial theory, the price of an equity reflects investors perceptions of the underlying company s ability to profit in the future. As Robock (1971) argues, the extent to which a future event is likely to affect the price of a financial asset is directly related to extent to which the that event is predictable, because according to the efficient market hypothesis, if the future event is predictable, then the risk of that event occurring at time t + n will be priced into the asset at 48

57 time t (see Fama (1970)). Cosset and Doutriaux de la Rianderie (1985) extends this even further to assert that, events that are either expected or easy to anticipate do not constitute political risk. This line of argument fully supports much of the previous findings in the extant terrorism and event history studies, which find that political violence significantly affects financial markets because these studies tend to focus on rare, unanticipated events. Unlike large-scale terrorist attacks against western industrialized democracies, violence directed towards Israel is frequent and tends to be low scale. Thus, the efficient market hypothesis suggests that equity prices already assume that domestic corporations will exist in a climate of violence in the future. This means that during escalations in violence directed towards Israel (as occurred in November 2012), we should not expect to see markets meaningfully respond. 2 Li and Sacko (2002) provide empirical support for this line of argument in the context of interstate trade. They find that levels of trade between two states tend to decline after a military dispute between countries that do not usually engage in military disputes, but is unaffected when a dispute occurs between states that consistently experience conflict. These empirical findings along with the theoretical arguments of the efficient markets suggest the null that variation in levels of violence against Israel should not significantly affect variance in TA100 returns. Thus, both theoretical and empirical support exists for both Hypothesis 1 as well as a null finding. Null: Because traders efficiently incorporate future risk in today s prices, variation in levels of violence directed against Israel should not have a significant effect on variation in TA100 returns. 3. Data and Research Design 3.1. The Event Data. Recognizing that collecting and utilizing event data is difficult, the critiques I level against the data and data treatment of Schneider and Troeger (2006) are more the result of the improvements in the field across the last 6 years than mistakes made in Nevertheless, I am able to improve on operationalizing political violence with event data in a number of ways. First, in terms of event data quality, Schneider and Troeger (2006) use the King and Lowe (2003) 10 Million International Dyadic Events dataset, which relies exclusively on Reuters and Agence France Press (AFP) newswires. Like most automated coding systems, King and Lowe 2 This assumes that the level of violence is relatively in line with past levels. If a black swan event should occur such as an ICBM exploding in Tel Aviv, then we should expect to see a meaningful market response because this level of attack would have been unexpected and therefore not priced into assets. 49

58 (2003) s approach does not measure the scope or intensity associated with each event (i.e. it is unable to extract information about whether a bombing kills 1 person of kills 1000). Because of the reliance on just the title and lead sentence of two newswires, events tend to only generate one (or a few) article a day, regardless of their importance/scope. Thus, during periods of intensified conflict between Israeli s and Palestinians, it is likely that the event data does not fully capture the variation in actual levels of conflict at the daily level (e.g. the temporal unit of analysis employed in Schneider and Troeger (2006)), since the event data generating process used to build to 10 Million International Dyadic Events dataset is inherently smoothing. Additionally, the 10 Million Dyad Dataset was designed to capture inter-state interactions. As a result, it only recognized 450 sub-state domestic actors meaning that it is likely to miss a large number of conflictual events that target specific individuals or sub-state groups. Second, regarding treatment of the event data, Schneider and Troeger (2006) utilize questionable aggregation approaches. As discussed in the Appendix to this dissertation, studies utilizing event data must, at minimum, clearly specify their aggregation choices regarding actors, actions, and temporal range. Schneider and Troeger (2006) do clearly explain their action (counts of conflictual and cooperational events) and temporal (daily) choices. However, they make no mention of the actors between whom actions must occur in order for the event to make it into the dataset. This both renders replication impossible and inhibits our ability to interpret the empirical findings. Third, Schneider and Troeger (2006) build two measures that reflect political conflict: the count of cooperational events and the count of conflictual events. While this is not problematic, the way that they use the two counts in the time-series models is. First, these two count variables are highly correlated. In the Israel-Palestine conflict, cooperation events (including meetings and verbal dialogue) tend to follow shortly after almost all attacks (see Table 2). Simultaneously estimating two highly corollary variables in the same model can severely inhibit out ability to conduct inferences on coefficients and standard errors. Additionally, and perhaps more importantly, is that Schneider and Troeger (2006) only consider the events that occur at time t and time t 1. However, we know that political events do not occur in a vacuum. Rather, they occur within a broader political climate. Thus, it is not enough to simply account for the number of events occurring at time t and t 1 because investors are likely to interpret these events differently based on recent events. Imagine a missile exploding in Tel Aviv that kills three civilians. We should expect that traders would react differently to this missile attack if it were to occur during a peaceful period than if 50

59 it occurred amidst ongoing fighting and daily missile attacks. Thus, I argue that it is important to construct event data derived measures that reflect both the ongoing political climate as well as more immediate, short-term shocks. Existing studies that have analyzed financial markets with daily level event data have done a poor job at constructing appropriate measures. With these shortcomings in mind, I utilize the GDELT dataset and detail my aggregation choices and variable construction techniques. Following the advice of the Appendix, I describe the actor, action, and temporal choices I make during the aggregation process, and then describe further data manipulation to build meaningful measures of political violence Actors: GDELT contains events occurring between tens of thousands of actors, most of whom have no influence on Israeli equity markets (unless you believe that a rebel group flapping its wings in Mozambique cause a market crash in Israel, which I do not). I focus on events initiated by actors whose primary affiliation is with Israel, Palestine, or Lebanon against an actor whose primary affiliation is with Israel. Like all event data studies, the choice of actors is a subjective call, and all else being equal, parsimony is preferred. Arguments could be made to include actors from the United States, Saudi Arabia, Jordan, Egypt, Syria, etc.. However, from 1992 to the 2012 (i.e. the years of active trading in the TA100 index), the most pressing security concerns for Israel have arisen from agents operating from the Palestinian occupied territories (Gaza and the West bank), Lebanon, and from within Israel itself. I feel confident that focusing strictly on events initiated by actors identified as Lebanese, Palestinian, and Israeli towards Israeli actors captures the large percentage of the political events that affect Israeli financial markets Actions: GDELT uses the 20-cue category CAMEO coding system, which uses 1-4 digit numerical codes to reflect a broad spectrum of relevant political events. As the Appendix illustrates, scholars utilizing CAMEO-coded data tend to aggregate the raw codes into a more meaningful format. Although scaling CAMEO codes is the most popular action aggregation technique within the event data literature, this leads to major shortcomings (see the sum and mean problems in the Appendix. Instead of scaling, I implement a highly simplistic measure of the level of political violence by simply counting the number of events that qualify as material conflict Temporal: GDELT provides the specific day on which every event occurs. Since I use daily level financial data as my dependent variable, I keep the GDELT data in its daily form as opposed to aggregating into more coarse weekly or monthly totals. 51 Daily level temporal aggregation is

60 employed in a number of relevant studies, including Schneider and Troeger (2006), Leblang and Mukherjee (2005), Freeman, Hayes and Stix (2000), and Hammoudeh, Yuan and McAleer (2009) Measuring scope from duplicates. Automated event data extraction has advanced considerably in recent years, but still struggles to extract measures of scope. One way to measure the severity of an event is to count the number of times various news outlets report the same event. This assumes that large-scale events for example, a bombing that kills 100 civilians will receive more media attention than a smaller-scale event, such as a similar bombing that only kills one civilian. Since Schneider and Troeger (2006) use the 10 Million Dyad Dataset which only codes Reuters and AFP stories, even large scale events are likely to receive one or a few stories a day. Thus, they are unable to differentiate between a bombing that kills and a bombing that kills 100. Since the GDELT dataset codes hundreds of local, regional, and international news stories, the same large-scale event can be reported dozens or hundreds of times. In the absence of software able to extract measures of severity from the content of the articles, the number of times that a specific event is reported on a given day is the best approximation of the scope of the event. It is common practice among event data studies to eliminate duplicate entries. However, in this study, I keep all duplicates. This allows me to overcome the shortcoming in Schneider and Troeger (2006) by utilizing event data to better capture variation in the intensity of violent events. This information is especially important when analyzing consistently conflictual daily level data, which often contains minimal day-to-day variation when using the reduced form of the data. In total, my final event dataset consists of 473,197 events across the 5,148 days on which a recorded event occurred from December to August (coded events do not occur on every day) Additional manipulation of daily level count data. The process by which an observation gains inclusion into an event data dataset generally involves two broad components: 1) the event occurs in the real world; 2) the event is reported in open-source, readily accessible electronic news stories. The dramatic increased in the volume of online journalism in past decades has meant that even if the first component were to hold steady, the number of those events being coded into automated event datasets would steadily increase. Thus, a month in the GDELT dataset in 2011 with 100 material conflict events likely experienced less severe conflict in reality than a month in the GDELT 52

61 dataset with 100 material conflict events in The graph below illustrated the steady rise in the total number of events reported to have occurred between ISR-PAL-LEB from 1992 to [INSERT FIGURE 1 HERE] We know with virtual certainty that the large spikes in 2006, 2008, and 2010 did not actually experience 50 times more conflictual events that the most conflictual day from 1992 to 1995, despite the fact that 50 times more accounts of violence are in the GDELT dataset. Thus, left untreated, our data lacks external validity and would almost certainly lead to biased estimated when modeled in a time-series framework. In order to adjust the data to control for changes in the second component (i.e. the amount of online reporting) of the data generating process, I eliminate the increasing time trend through the following process. First, I regress the daily event counts by time. Second, I use the stored regression coefficient and constant term to generate predicted values for the 1991 to 2012 time period. Third, I divide the counts by the fitted values. Minor adjustments needed to be to the fitted values during the first years of the time series made (setting the minimum divisor to equal the fitted value from early 1994), since they were in some instances negative, 1, sufficiently small that adjusted values became disproportionately large and variant to the rest of the time series. Additionally, I experimented with calculating two best-fit lines, one for , and one for to reflect a major shift in Reuters policies around However, this had minimal effect on the final, de-trended values, but it did generate a strange shock at the break. Thus, I use all data from to generate the best fit line. The graph below illustrates the total sum of events after de-trending the data by dividing by fitted value. [INSERT FIGURE 2 HERE] As the graph indicates, considerable variation still exists in the series, but the increasing trend has been removed Converting counts to meaningful information: As previously discussed, political events do not occur in a vacuum. Rather, they occur within a broader political climate. Thus, it is important to construct event data derived measures that reflect both the ongoing political climate as well as more immediate, daily level shocks. Existing studies that have analyzed financial markets with daily level event data have done a poor job at constructing appropriate measures. For example, Schneider and Troeger (2006) simply account for the number of events occurring at time t and 53

62 time t 1, which ignores longer term trends in the level of conflict. Consequently, this approach ignores events occurring more than one day in the past. This lacks considerable external validity, as we know with complete certainty that investors account for a longer temporal lag than one unit. There are two general techniques that are able to account for events beyond a one unit lag. First, it is possible to simply add more temporal components to the time-series model. Though feasible, this complicates both model estimation and interpretation. Second, we can construct new variables that incorporate longer term trends as well as short term shocks. This approach allows for a more parsimonious model (since one variable is able to reflect N number of lags, rather than having to include all N lags in the model) and is easier to substantively interpret. As such, I choose this latter option. Since the event data literature does not provide any additional established techniques for manipulating data to capture trends, I construct my own measures, as listed and defined below: 6 one week MA = 1 7 violence t i i=0 The unweighted average number of violent events that occurred during the prior 7 days 13 two week MA = 1 14 violence t i i=0 The unweighted average number of violent events that occurred during the prior 14 days four week MA = i=0 violence t i The unweighted average number of violent events that occurred during the prior 28 days one week =violence i one week MA i:i 6 The change in the number of events occurring today from the unweighted average number of violent events that occurred during the prior 7 days two week = violence i two week MA i:i 13 The change in the number of events occurring today from the unweighted average number of violent events that occurred during the prior 14 days four week = violence i four week MA i:i 27 The change in the number of events occurring today from the unweighted average number of violent events that occurred during the prior 28 days 3.2. The Dependent variable. The TA100 is an index of the 100 largest companies traded on the Tel Aviv Stock Exchange, with a mean trading volume in 2012 of over 200 million shares per 54

63 day. The TA100 index was first introduced on December 31, 1991 at $100, and closed on November 17 at $1, (see Plot 1). The TA100 is an ideal for the purposes of this study since it reflects companies from an area with considerable variation in the level of conflict, high media attention, and sufficient liquidity to respond to short term shocks. In order to convert the raw TA100 time series into a more appropriate format for time-series analysis, I follow common protocol and calculate the first difference of logged returns. This is commonly employed when modeling financial time series because, as is the case with the TA100, simply taking the first differences generates a series with steadily increasing variances, as apparent in Plot 2. After taking the first difference of logged returns, the variance appears consistent throughout the series (see Plot 3), and a a Dickey-fuller test allows us to reject the null of a unit root. Additionally, I run a Philips-Perron test, which further rejects the null that a unit root exists with the first difference of the logged returns and indicates that this series is non-integrated. [INSERT FIGURE 3 HERE] 4. The GARCH models I choose to model model variance in TA100 returns using a GARCH model for two main reasons. First, it is an empirically justified approach. Like many other high frequency financial time-series, the TA100 index contains high degrees of volatility that tend to cluster rather than follow a random distribution. Notice in plot three that small returns tend to follow small returns (in the absolute value) and large returns tend to follow large returns to a greater extent than if returns were generated randomly. In his seminal study, Engle (1982) provides the first statistical approach able to account for what he deems autoregressive conditional heteroscedasticity (ARCH). To empirically test for what visually appears to be the presence of an ARCH process in the TA100 data, I run a Lagrange multiplier test, which allows me to reject the null hypothesis that no ARCH process exists with nearly 100% confidence. Instead of using an ARCH model, which tends to require a high order of autoregressive error terms, I utilize Bollerslev (1986) s Generalized ARCH, or GARCH model. Second, the majority of recent studies attempting to analyze the effects of exogenous variables (not only political violence but also elections, domestic policies, etc) on daily level financial market data employ a GARCH model or one of its variants. This includes Schneider and Troeger (2006) as well as Leblang and Mukherjee (2005), Freeman, Hayes and Stix (2000), Hammoudeh, Yuan and McAleer (2009), Bernhard and Leblang (2002), Arin, Ciferri and Spagnolo (2008), Dhankar 55

64 and Chakraborty (2007), and Mun (2008). Additionally, Alberg, Shalit and Yosef (2008) focus exclusively on univariate analyses of the TA100 and find that the GARCH model (and its variants) best fit the time series Specifying the model. A GARCH(p,q) model consists of a conditional mean and conditional variance equation, both of which allow for the inclusion of exogenous variables. For consistency, I follow adopt the notation from Leblang and Mukherjee (2005). The conditional mean is: (1) (ln(ta100 t )=λ + ψz t + t where (ln(ta100 t )=ln(ta100 t ) ln(ta100 t 1 ),λ is a constant that is approximately 0 due to the first differencing, Z t is a vector of exogenous variables, ψ is a vector of estimated coefficients, and t is the error term distributed (0,σ 2 t ). The conditional variance is: q p (2) σt 2 = ω + α i 2 t i + β i σt i 2 + δ i I i,t i=1 i=1 where ω is a constant, t i is the lagged error, σ t i is the lagged variance, I i,t is a matrix of exogenous variables, and α i, β i, and δ i are estimated parameters Control variables. Decades of literature on financial markets has uncovered hundreds of factors that influence financial markets. Although the causal direction is often difficult to uncover, we know that statistically significant relationships exist between equity markets and commodity prices (especially oil), inflation rates, trade, other global markets, domestic regime types, etc. However, within the relevant literature interested in the relationship between political events and financial markets, little established precedent exists in terms of appropriate control variables. For example, most studies focusing on the effects of terrorism on equity markets do not control for any other financial variables, which likely leads to considerably underspecified models thereby systematically overestimating the effects of the terrorist events. In two of the most methodologically sophisticated relevant studies, Leblang and Mukherjee (2005) control for trading volume, inflation, and interest 3 It is highly likely that a variant of the GARCH, such as the T-GARCH or M-GARCH, may be a better fit for the data. However, comparing all possible GARCH models far exceeds the scope of this paper. For an application of APARCH, and EGARCH to the TA100 index, see Alberg, Shalit and Yosef (2008). 56

65 rates when testing for the effects of parties on variation in DJIA returns, and Schneider and Troeger (2006) control for major global financial markets when analyzing the effects of political violence on U.S. and British equity markets. Drawing on these studies, I choose two sets of control variables: Control 1: DJIA The first difference of the logged Dow Jones Industrial average, which reflects the performance of the global economy Crude oil The first difference of logged daily Brent crude oil prices, which reflects global commodity markets Inflation The first difference of logged inflation rates, reported monthly by the Israel Central Bureau of Statistics. Monthly values are extended to the daily level by assuming constant rates for every day in a given month. Because Schneider and Troeger (2006) find that variation in the intensity of violence in Israel significantly affects variation in the DJIA, I implement a second set of control variables Control 2 that excludes the DJIA. If variation in levels of violence in Israel does, in fact, affect both variation in the DJIA and TA100, then the GARCH model may be unable to uncover the relationship between variation in violence and the TA100 due to endogeneity issues with the DJIA. If the political violence variables fail to achieve statistical significant with Control 2 variables, we can be highly confident that no true effect exists. 5. Results In Models 1-7 in Table III, I use the Control 1 set of controls in both the mean and variance equation of a GARCH (1,1) model. 4 Additionally, in Model 1-6, I test for the effects of moving averages of the level of violence across different temporal ranges (1 week in Model 1 and Model 2, two weeks in Model 3 and Model 4, and four weeks in Model 5 and Model 6). None of the six event-data derived measures of political violence achieve statistical significance in the variance equation, which suggests that traders are not, on average, highly responsive to variations in the level of violence directed at Israel. Additionally, in Model 7, I omit all measures of violence. The AIC and BIC scores, which provide alternative measures of model fit that penalize for extra parameters, suggest that Model 7 which contains no measures of political violence best fits the data. These findings suggest that we are unable to reject the null hypothesis that the TA100 4 Due to the fat tails of the distribution of returns, I assumes a Student s t, rather than a normal distribution. 57

66 does not meaningfully respond to variation in the level of intensity of violence against Israel. This finding also suggests that equity prices in Israel may already price in future political violence, and to the extent that the level of violence is not dramatically more intense than expected, markets will not take notice. Instead, it appears that the variance in TA100 returns is driven primarily by previous error and variance rates of the previous trading day, as well returns in the DJIA. 5 Second, because Schneider and Troeger (2006) find that violence in Israel affects variance in the DJIA, I rerun all analyses in Table III after omitting the DJIA as a control variable in both the mean and variance equation of a GARCH (1,1), again using Student s t distribution of errors. Model 1 and Model 2 of Table IV suggest that change from one week MA does have a meaningful effect on variation in the TA100 at a modest p=.063 value. However, the AIC and BIC scores in Model 7, which omits all political violence variables, are considerably lower, suggesting that the low level of statistical significance in Model 1 and Model 2 may not actually improve model fit. Further, as Model 3, Model 4, Model 5, and Model 6 in Table IV show, the four additional measures of political violence do not approach statistical significance. This suggests that even in a potentially underspecified model, variation in levels of political violence do not meaningfully impact variance of TA100 prices. Taken together, findings in Table III and Table IV suggest that we are unable to reject the null hypothesis with any meaningful degree of confidence. Consequently, this indicates that the causal argument outlined in Hypothesis 1 is not present. Realizing that it is dangerous to provide ad-hoc explanations of statistical findings, I cautiously argue that these findings suggest that traders already price in future violence so that when it occurs, they tend to not react. This finding has a number of important practical implications. First, and most importantly to conflict dynamics in Israel, it suggests that it is highly difficult for opponents of Israel to adversely affect Israeli equity markets through violence. This likely affects bargaining dynamics, since all else being equal, the costs that Israel incurs from being the target of violence is lower than more stable countries like the United States or England, where political violence is not already priced into assets. Second, it suggests that findings in the extant literature about the effects of political violence on various financial markets may not be generalizable to all countries. Alternatively, financial markets in countries with long histories of political violence, like Israel, may be more resilient to future violence than in more stable countries. Third, from a purely capitalist perspective, this finding suggests that any abrupt changes in Israeli equity markets 5 I also re-run Models 1, Model 2, Model3, Model 4, Model 5, and Model 6 using the one-unit lagged values of the political violence variables. The political violence variables continue to fail to achieve p-values of.1. 58

67 following escalations of attacks may reflect a buying opportunity, since the effects of those attacks (in terms of decreased corporate profitability) are likely already priced into the equity. Thus, abrupt change following violence may indicate exploitable mis-pricings. 6. Robustness checks focusing on insurance equities This chapter, like many other studies interested in the effects of political violence on equity markets, has thus far focused on an index the TA100 as the dependent variable. Although the preceding sections have found almost no empirical support suggesting that variance in TA100 returns is driven by changes in levels of violence directed against Israel, it does not mean that violence has no effect on Israeli financial markets. Rather, it simply suggests that the average effect of variation in levels of violence on the 100 largest companies tends to not be statistically significant. However, it is feasible that certain sectors or companies are more vulnerable to political violence than others. For example, consider that airline and insurance stocks suffered the largest average losses on the DJIA on the first day of trading after both the September 11 attacks as well as the London bombings on Jul 7, In this section, I repeat the research design and empirical testing outlined in Section 3 and Section 4, except instead of focusing on the TA100 index, I analyze the effects of variation in levels of violence against Israel on variance in returns of the two largest insurance companies publicly traded on the TASE: Migdal Insurance and Financial Holdings ltd. (MGDL) and Clal Insurance Enterprise Holdings (CLIS). MGDL is the largest insurance company traded on the TASE with a market cap of over 6 billion USD, with 69.3% of shares are owned by the Eliahu Insurance company and the remaining 30.7% held by public investors. CLIS is the second largest insurance company, though considerably smaller than MGDL with a market cap of 3.3 billion USD. Unlike MGDL, CLIS ownership is more diversified, with the largest shareholder owning only 10.7% of outstanding shares. Daily closing prices were obtained using a Bloomberg Terminal, and were available from June 1997 to January 2010 for MGDL and January 1995 to January 2010 for CLIS. [INSERT FIGURE 4 HERE] As illustrated in the graph, CLIS and MGDL appear to follow a similar general pattern as the TA100 of a general increasing time trend with a sharp decline during the 2008 global recession, with the correlations of.83 and.72 for CLIS:TA100 and MGDL:TA100, respectively. Additionally, 6 see effect.html. 59

68 correlation between CLIS and MGDL daily closing prices is high, at.89, which is common among stocks operating within the same sector. Following the approach in Section 3.2, prior to estimating GARCH models, I first calculate the first difference of the logged daily closing prices for both MGDL and CLIS. Figure 5 and Figure 6 illustrate the raw closing prices, the first differences of closing prices, and then the first difference of the logged prices. As the tables illustrates, the first difference of the logged prices for both equities appear to exhibit desirable stationary processes. [INSERT FIGURE 5 HERE] [INSERT FIGURE 6 HERE] In Table 5, I replicate Table 3, except use the logged first difference of MGDL instead of the TA100. In Model 1, Model 2, and Model 6, none of the variables reflecting the number of conflictual events targeted towards Israel achieve statistical significance, which is consistent with the findings in Table 3 and Table 4. However, in Model 3, Model 4, and Model 5, the variables reflecting the change in the number of conflictual events targeted towards Israel today relative to the moving average of events across the past two and four weeks does achieve moderate levels of statistical significance. The positive coefficients of the statistically significant variables indicates that increase in the level of conflict relative to the moving averages tends to increase variance in MGDL returns. This means that we are unable to reject the null hypothesis that investors do not respond to variation in the level of conflictual acts against Israel with a high degree of confidence. Table 6 replicates Table 5, but analyzes the first difference of logged returns for CLIS instead of MGDL. Across Model 1 through Model 6, the variables reflecting the change in conflictual events relative to the moving averages are consistently signifiant, with four of the five conflict variables generating p-values <.01. The robustness of these findings across the three different lengths of moving averages (i.e. one, two, and four weeks) used to calculate one week, two week, and four week allow us to confidently reject the null of Hypothesis 1. As in Table 5, the coefficients on the statistically significant variables are all positive, indicating that increases in conflict levels tend to increase variance of equity returns. Although these results clearly suggest that investors in CLIS stock do meaningfully responding to variation in the level of attacks against Israel when making trading decisions, the likelihood, AIC, and BIC scores indicate that accounting for political conflict may not actually increase model fit. In Model 2, Model 4, and Model 6, one week, two week, and four week all generate highly 60

69 significant p-values, but the AIC and BIC scores are still considerably higher than for Model 7, which omits all political conflict variables. This indicates that according to AIC and BIC measures, Model 7 actually provides a better fit of the data than Model 1 through Model 7. This tempers our interpretation of the statistical significance of the one week, two week, and four week variables. Although any number of factors could explain why CLIS stock appears to respond more strongly to variation in conflict levels than MGDL, one potential explanatory factor may be differences in the distribution of ownership of MGDL and CLIS stock. Theoretically, Aggarwal and Rao (2005) find that as the percentage of stock owned by institutional investors increases, the variance in returns of the stock tends to decrease. This is because institutional investors tend to have longerterm horizons than individual investors and pay less attention to day-to-day events. The 69.3% of MGDL owned by Assicurazioni Generali S.p.A. are not actively traded, meaning that variation in MGDL is driven exclusively by trading among the remaining 30% of shares. Thus, if all active traders of CLIS and MGDL responded similarly to conflictual events targeting Israel, the effect on CLIS shares would be greater. 7. Conclusion On November 13, 2012, the Haaretz Daily Newspaper (the most prominent Israeli newspaper printed in English) reported that the Tel Aviv Stock exchange ended lower again on Monday amid violence in Gaza. TA-100 dropped.6% to 1,061.38, attributing the decline on mortar shells that fell on Israeli settlements in the Golan Heights. 7 Only six days later on November 19, the TA-100 rose 1.1%, prompting the same newspaper to report, TASE ignores war, cheers low CPI. In that article, Haaretz reports: Israel was bombarded by a barrage of rockets Sunday as Operation Pillar of Defense entered its fifth day and Prime Minister Benjamin Netanyahu told ministers at the weekly cabinet meeting that Israel was prepared to significantly expand its operation in the Gaza Strip. Nevertheless, Hadar Oshart, head of the equities trading desk at Deutsche Bank in Israel, said foreigners had not been deterred by the fighting. The reaction of foreigners has, all told, been restrained. The sense is that the impact of

70 the operation for now is limited and doesn t demand any reassessment by overseas investors of their investments or positions. 8 These two headlines underscore the central question of this paper: does the TA-100 meaningfully respond to variation in the level of violence target at Israel? As of November 2012, the TA-100 had an approximate market cap of $114.5 billion USD. If we are to believe the November 13 Haaretz article, then missile attacks against Israel cost TA-100 investors approximately $87 million USD. Consider the implications of this purported causal argument that the TA-100 meaningfully responds to violence against Israel actually being supported by the data. Politically, this would drastically increase the costs of fighting against Hamas and other opponents of Israeli, likely encouraging more extreme measures by Israeli politicians to prevent future attacks. Economically, this would likely discourage investment in Israel given the vulnerability of equities to relatively common missile attacks. This paper is the first attempt to provide a rigorous and objective analysis of the extent to which the TA-100 responds to violence against Israel. To achieve this, I utilize the GDELT event data data set, which allows me to calculate daily counts reflecting the intensity of attacks committed against Israel. Using this data, I follow common practice and perform a series of multivariate GARCh models to test for the extent to which variance in TA-100 returns is explained by variation in the level of intensity of violent events committed against Israel. I find that on average, variance in returns of TA-100 taken as a whole are not significantly driven by levels of violence committed against Israel. Additionally, I perform similar analyses on the returns of the two largest insurance companies on the TA100, Migdal Insurance and Financial Holdings Ltd. (MGDL) and Clal Insurance Enterprises Holdings Ltd. (CLIS). Variables reflecting changes in the level of conflictual attacks against Israel achieve inconsistent, moderate statistical significance explaining variance in MGDL return, and strong and consistent significance when modeling CLIS returns. This strongly suggests that while the TA100 may not meaningfully respond to variation in attacks against Israel, specific companies that comprise the TA100 do. The findings in this paper warrant two caveats as well as three logical extensions. In terms of caveats, when working with finely grained political event data and financial data, a number of aggregation choices must be made. Although I draw on both theory and the extant literature when

71 making aggregation choices in this chapter, it is feasible the different aggregations may have led to different empirical findings. For example, working with weekly level averages (as opposed to daily level data), may alter findings. Additionally, it is possible that the timing of violent attacks against Israel matters. If this is true, then both the November 13 and November 19 Haaretz articles may actually be correct: investors may have actually responded to the initial attacks on November 13, but by November 19, investors may have already priced in the elevated level of violence and therefore ignored the ongoing attacks. Taken together, the empirical empirical findings along with the two caveats suggest that it i that some equities respond to some operationalizations of violence some of the time. The caveats above suggest at least three useful extensions. First, analyses may be re-run using different temporal aggregations. Although more coarse aggregations (i.e. weekly) may yield meaningful results, I believe that more interesting findings would result from finer-grained temporal analyses. As event data collection becomes increasingly fine-grained, it may be feasible in the near future to obtain data at the hourly or minute level. This data, leverage with second-to-second financial tick data, could allow researchers to test for more immediate effects of political conflict on equity markets. Second, in Section 6, I demonstrate that the same political attacks have different effects on returns of two different specific equities. Repeating similar empirical tests on the other 98 companies that comprise the TA100 could provide more comprehensive insight into the types of companies that then to be affected by political conflict. Third, further analyses of the effects of political violence may vary cross sectionally (i.e. between companies), but as the second caveat suggests, they also may vary over time or by the type of conflictual act. Researchers may be test for changing effects over time testing for the effects of initial conflictual events separately from subsequent violence that follows. Additionally, it would be feasible to test whether certain types of conflictual events have different substantive effects by disaggregating material conflict events into sub-categories, such as attacks targeting specific political leaders or those more indiscriminately targeted as unaffiliated civilians. Hopefully, the research design and results in this chapter can serve as a foundation for future research to test for more nuanced relationships between political violence and equity markets returns. 63

72 References Aggarwal, Raj and Ramesh P. Rao Institutional Ownership and Distribution of Equity Returns. Financial Review 25(2): Alberg, Dima, Harim Shalit and Rami Yosef Estimating stock market volatility using asymmetric GARCH models. Applied Financial Economics 18: Anderton, Charles H. and John R. Carter The Impact of war on Trade: An Interrupted time-series study. Journal of Peace Research 38(4): Anderton, Charles H. and John R. Carter Does War Disrupt Trade? In Globalization and Armed Conflict, ed. Gerald Schneider, Katherine Barbieri and Nils Petter Gleditch. Lanham, MD: Rowman Littlefield pp Arin, Peren K., Davide Ciferri and Nicola Spagnolo The Price of Terror: The effects of terrorism on stock market returns and volatility. Economic Letters 101: Bernhard, William and David Leblang Democratic Processes, Political Risk, and Foreign Exchange Martkets. American Journal of Political Science 46(2): Bollerslev, Tim Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics 31: Chen, Andrew H and Thomas F. Siems The effects of terrorism on global capital markets. European Journal of Political Economy 20: Chesney, Marc, Ganna Reshetar and Mustafa Karaman The Impact of Terrorism on Financial Markets: An Empirical Study. Available at i d = Cosset, Jean-Claude and Bruno Doutriaux de la Rianderie Political Risk and Foreign Exchange Rates: An Efficient-Markets Approach. Journal of International Business Studies 16(3): Dhankar, Raj S. and Madhumita Chakraborty Non-linearities and GARCH Effects in the Emerging Stock Markets of South Asia. Vikalpa 32(3): Eldor, Rafi and Rafi Melnick Financial Markets and Terrorism. European Journal of Political Economy 20: Engle, Robert Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation. Econometrica 50(4): Fama, Eugene F Efficient Capital Markets: A Review of Theory and Empirical Work. Journal of Finance 25(2):

73 Fearon, James D Rationalist Explanations for War. International Organization 49(3): Freeman, John R., Jude C. Hayes and Helmut Stix Democracy and Markets: The Case of Exchange Rates. American Journal of Political Science 44(3): Frey, Bruno S. and Daniel Waldenstrom Markets work in war: World War II reflected in the Zurich and Stockholm bond markets. Financial History Review 11(1): Frey, Bruno S. and Marcel Kucher History as Reflected in Capital Markets: The Case of World War II. Journal of Economic History 60(2): Hammoudeh, Shawkat M., Yuan Yuan and Michael McAleer Shock and Volatility Spillovers Among Equity Sectors of the Gulf Arab Stock Markets. Quarterly Review of Economics and Finance 49(3): Johnson, Barry R. and Oana M. Nedelescu The Impact of Terrorism on Financial Markets. IMF Working Paper. King, Gary and Will Lowe An Automated Information Extraction Tool For International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design. International Organization 57(3): Kollias, Christos, Efthalia Manuo, Stephanos Papadamou and Apostolos. Stagiannis Stock Markets and Terrorist attacks: Comparative evidence from a large and small capitalization market. European Journal of Political Economy 27:S64 S77. Leblang, David and Bumba Mukherjee Government Partisanship, Elections, and the Stock Market: Examining American and British Stock Returns, American Journal of Political Science 49(4):2005. Li, Quan and David Sacko The (ir)relevance of militarized interstate disputes for international trade. International Studies Quarterly 46: Mun, Kyung-Chun Effects of Exchange Rate Fluctuations on Equity Market Volatility and Correlations: Evidence from the Asian Financial Crisis. Quarterly Journal of Finance and Accounting 47(3): Rigobon, Roberto and Brian P. Sack The Effects of War Risk on U.S. Financial Markets. Available at: Robock, S Political Risk: Identification and Assessment. Columbia Journal of World Business 6:

74 Schneider, Gerald and Vera E. Troeger War and the World Economy: Stock Market Reactions to International Conflicts. Journal of Conflict Resolution 50(5): Zussman, Asaf, Noam Zussman and Morten Nielsen Orregard Asset Market Perspectives on the Israeli-Palestinian Conflict. Economica 75:

75 8. Appendix Table 1. Correlation Matrix of Counts mater conf mater coop verb conf verb coop mater conf 1.0 mater coop verb conf verb coop Table 2. Correlation Matrix of material conflict MAs 1 week MA 2 week MA 4 week MA 1 week MA week MA week MA

76 Table 3. GARCH models of daily TA100 with DJIA control Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Model 7 Mean Equation AR(1) (.014) (.013) (.014) (.013) (.014) (.013) (.014) inflation (.006) (.006) (.006) (.006) (.006) (.006) (.006) crude.021**.022**.021**.021**.021**.021**.022** (.009) (.009) (.009) (.009) (.009) (.009) (.010) DJIA.182***.183***.182***.184***.182***.183***.182*** (.019) (.019) (.019) (.019) (.019) (.019) (.019) Variance Equation ARCH(1).245***.247***.244***.247***.244***.246***.244** (.036 ) (.036 ) (.036) (.037 ) (.037) (.037) (.036) GARCH(1).714***.717***.714***.716***.714***.716***.716*** (.032 ) (.032 ) (.032) (.033 ) (.032) (.033) (.035) inflation (25.58 ) (46.97 ) (24.81 ) (45.70 ) (24.42) (45.75) (40.92) crude (15.01 ) ( ) ( 15.09) ( 15.34) ( 14.99) ( 14.99) (13.74) DJIA *** *** *** *** *** *** *** (42.28) (42.91) (41.88) (41.51) (41.81) (41.11) (38.80) dt 1 mater conf (4.46) dt c one mater conf (3.79 ) (1.281) dt two mater conf (4.48) dt c two mater conf (3.80) (1.159) dt four mater conf (4.44) dt c four mater conf (3.800) (.975) Constant *** *** *** 12.56*** *** *** *** (1.06 ) (1.509 ) (1.05 ) (1.58 ) (1.05) (1.44) (1.37) N Log-likelihood AIC BIC df ***,**,*: 1%, 5%, and 10% level. Coefficients with standard errors in (). Distribution = Student s t 68

77 Table 4. GARCH models of daily TA100 without DJIA control Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Model 7 Mean Equation AR(1) -.027** -.027** -.027* -.027* -.027* -.027* -.027* (.013) (.013) (.013) (.014) (014) (.014) (.014) inflation (.006) (.006) (.006) (.006) (.006) (.006) (.006) crude.033***.033***.034***.034***.034***.034***.034*** (.010) (.010) (.010) (.010) (.010) (.010) (.010) Variance Equation ARCH(1).264***.265***.267***.266***.267***.267***.267*** (.038 ) (.038) (.039) (.039) (.039) (.039) (.039) GARCH(1).682***.677***.665***.669***.669***.663***.666*** (.043) (.046) (.053) (.051) (.052) (.053) (.056) inflation (15.207) (13.76) (11.03) (11.19) (11.48) (10.38) (10.26) crude *** *** *** *** *** *** *** (13.01) (12.281) (11.836) (12.128) ( ) ( ) (12.63) dt 1 mater conf (1.307) dt c one mater conf 1.659* 1.458* (.936) (.784) dt two mater conf.635 (.792) dt c two mater conf (.786) (.818) dt four mater conf.524 (1.023) dt c four mater conf (.812) (1.019) Constant *** *** *** *** *** *** *** () (.967) (.941) (.931) (.890) (.878) (.938) N Log-likelihood AIC BIC df ***,**,*: 1%, 5%, and 10% level. Coefficients with standard errors in (). Distribution = Student s t 69

78 Table 5. GARCH models of daily MGDL with DJIA control Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Model 7 Mean Equation AR(1) (.022) (.022) (.022) (.022) (.022) (.022) (.022) inflation (.011) (.011) (.012) (.012) (.012) (.011) (.011) crude.035*.035*.037*.036*.037*.035*.034 (.021) (.021) (.021) (.021) (.021) (.021) (.021) DJIA.358***.357***.364***.362***.364***.038***.357*** (.042) (.042) (.042) (.042) (.042) (.042) (.042) Variance Equation ARCH(1).051***.054***.047***.052***.050***.054***.055*** (.011) (.012) (.011) (.012) (.012) (.012) (.012) GARCH(1).920***.915***.932***.921***.927***.913***.912*** (.018) (.019) (.015) (.017) (.016) (.019) (.020) inflation ** * (20.92) (2.084) (1.889) (2.019) (2.071) (2.106) (2.096) crude ** * *** *** *** * ** (10.214) (10.157) (6.701) (4.923) (6.054) (10.422) (9.769) DJIA *** *** *** *** *** *** *** (12.396) (12.073) (10.558) (9.256) (9.537) (11.704) (12.266) dt 1 mater conf (1.136) dt c one mater conf (1.127) (1.026) dt two mater conf (1.976) dt c two mater conf 2.038** 1.274* (.960) (.733) dt four mater conf (1.549) dt c four mater conf 1.532**.651 (.707) (.741) Constant *** *** *** *** *** *** *** (.392) (.401) (.418) (.432) (.382) (.395) (.398) N Log-likelihood AIC BIC df ***,**,*: 1%, 5%, and 10% level. Coefficients with standard errors in (). Distribution = Student s t 70

79 Table 6. GARCH models of daily CLIS with DJIA control Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 Model 7 Mean Equation AR(1).074***.074***.075***.075***.074***.075***.073*** (.021) (.021) (.021) (.021) (.021) (.021) (.021) inflation.027**.027**.026**.026**.026**.026**.027** (.011) (.011) (.011) (.011) (.011) (.011) (.011) crude (.021) (.021) (.021) (.021) (.021) (.021) (.021) DJIA.302***.302***.301***.302***.032***.302***.304*** (.044) (.044) (.044) (.044) (.043) (.044) (.043) Variance Equation ARCH(1).054***.055***.053***.055***.053***.055***.056*** (.011) (.011) (.011) (.011) (.011) (.011) (.011) GARCH(1).917***.916***.917***.914***.916***.914***.918*** (.015) (.015) (.015) (.015) (.015) (.015) (.015) inflation (8.718) (8.493) (9.976) (8.117) (7.972) (7.085) (8.297) crude ** ** ** ** * ** (9.668) (9.668) (9.675) (8.117) (10.423) (9.993) (10.496) DJIA.302*** *** *** *** *** *** *** (.044) (9.727) (9.061) (9.217) (8.847) (9.103) (10.096) dt 1 mater conf (1.083) dt c one mater conf 1.682** 1.643*** (.791) (.533) dt two mater conf (1.258) dt c two mater conf 2.081*** 1.698*** (.581) (.417) dt four mater conf (1.160) dt c four mater conf 1.678*** 1.461*** (.446) (.379) Constant *** *** *** *** *** *** *** (.347) (.330) (.313) (.321) (.304) (.379) (.339) N Log-likelihood AIC BIC df ***,**,*: 1%, 5%, and 10% level. Coefficients with standard errors in (). Distribution = Student s t 71

80 Figure 1. Total number of material conflict events, daily from 1992 to

81 Figure 2. The number of material conflict events with the time trend removed, daily from 1992 to

82 Figure 3. Raw, first-differenced, and logged first-differenced of TA100 prices 74

83 Figure 4. Comparison of closing TA100, CLIS, and MGDL prices 75

84 Figure 5. Raw, first-differenced, and logged first-differenced MGDL prices 76

85 Figure 6. Raw, first-differenced, and logged first-differenced CLIS prices 77

86 CHAPTER 3. EFFECTS OF DOMESTIC CONFLICT ON INTERSTATE CONFLICT: AN EVENT DATA ANALYSIS OF MONTHLY LEVEL ONSET AND INTENSITY 1. Introduction That domestic conflicts can affect interstate conflict is clear. Consider a few examples. In the first months of 2001, fighting between Burmese government troops and domestic rebels intensified, with much of the violence occurring near the border between Burma and Thailand. During this same period, interstate conflict between Burmese and Thai military forces reached their highest levels in decades, as troops from both sides clashed over control of strategic locations near the border and engaged in shelling and small arms fire resulting in scores of civilian deaths. 1 In this case, both the presence and intensity of the domestic conflict in Burma that spread across the border into Thailand led directly to the interstate conflict events that transpired between official military personnel of each state. More recent events in Libya provide another example. After more than a month of unrest in the region, the first substantial demonstrations in Libya occurred on February 15, 2011, leading to approximately 15 deaths by February 17, Within a week, anti-gadaffi rebels had mobilized and the country was engaged in severe revolutionary fighting. Seeking to take advantage of a window of opportunity that the domestic conflict provided to attempt to help extricate Gadaffi from power, United States and British forces began an international campaign against Gaddafi by firing over 100 Tomahawk cruise missiles against Libyas key air defense installations on March 19, Again, the existence of the domestic conflict influenced an interstate conflict, albeit through different mechanisms than though by different mechanisms than the previous example. 3 The central goal of this chapter is to provide a thorough and nuanced analysis of the effects of both the onset and intensity of domestic conflict on interstate conflict an area of research that is surprisingly underdeveloped in the extant literature. One potential reason for the lack of relevant empirical analyses may be due to the coarseness of existing data on both domestic and interstate 1 Thai Army Closes Border with Burma Following Fatal Clashes. The Nation, Thailand. February 12, Accessed via LexisNexis, Keywords: Burma, Thailand, conflict. 2 Arab Capitals Braced for Violence Today as Unrest Spreads. The Guardian, London. February 18, Accessed via LexisNexis, Keywords: Libya, protest, death. 3 Koutsoukis, Jason. Gaddafi Threatens Revenge; Days, not weeks says US Coalition Jets Launch Attack Snipers Fire on Rebels. The Age, Melbourne Australia. March 21, Accessed through LexisNexis, Keywords: Libya, 110 Tomahawk. 78

87 conflicts. In terms of domestic conflict, UCDP/PRIO ( see Gleditsch et al. (2002)) and Correlates of War (COW) datasets (see Sarkees and Wayman (2010)) predominate in the literature. However, scholars primarily use these datasets to provide a binary measure of whether or not conflict/war occurred in a given state-year. 4 According to the UCDP/PRIO dataset, Burma experienced domestic conflict every year from 1997 to 2011, but according to COW, domestic conflict did not reach sufficiently high thresholds to become a war, so every year in that period receives a 0 on the dichotomous scale. Thus, it is impossible to test for the effects of variation in the intensity of domestic conflict in Burma on the onset or intensity of interstate conflict, even though real world examples suggest such relationships might exist. Furthermore, studies analyzing interstate conflict also tend to rely on dichotomous, annual level measures such as militarized interstate dispute (MIDs see Ghosn, Palmer and Bremer (2004)) or COW measures. The dichotomized and annual-level nature of these measures inhibits the ability of researchers to test for more subtle variations in levels of intensity during and between years, yet this is what we expect to see in Libya, as NATO forces varied the intensity of bombings based on the success of the rebels. These shortcomings in existing data all suggest that temporally nuanced measures of both domestic and interstate conflict are needed in order to appropriately test for the range of potential effects that domestic conflict may have on interstate conflict. This chapter will addresses this problem by generating monthly level measures reflecting the number of conflictual events at both the state-month (for domestic events) and the dyad-month (for interstate events) level based on the GDELT dataset. Using this data, I perform numerous empirical tests for the effects of domestic conflict onset and intensity on the onset and intensity of interstate conflict across a range of operationalizations of onset and intensity at the monthly level. To the best of my knowledge, this chapter provides the first empirical test for the effects of domestic conflict intensity on both the likelihood of interstate conflict onset and the intensity of ongoing conflicts. This chapter proceeds in four sections: first, I provide a brief review of relevant literature and from that literature develop my testable hypotheses; second, I explain my use of event data; third, I outline my variable operationalization and research design; fourth, I provide empirical models and results. Lastly, I conclude with a discussion of future extensions. 4 These datasets also provide estimates of the total number of battle fatalities, but those figures reflect the duration of the conflict and cannot be disaggregated to smaller temporal units. As a consequence, in many cases, it is impossible to determine even annual level variation in conflict intensity. 79

88 2. Building Hypotheses from the Literature Although the examples of Burma and Libya are recent, similar cases are pervasive throughout history. For example, over a century ago in 1911, the Russian Bolsheviks were entrenched in civil war against reactionists and lacked sufficient resources to defend Russia s external border. Aware of this weakness, Japan attacked northern Siberia with 70,000 troops in an attempt to acquire Russian territory (see Humphrey (1995)). Despite both the historical occurrence of domestic conflicts affecting interstate conflicts as well as studies calling for more comprehensive analyses of potential relationships (see Sambanis (2002), and Chiozza, Gleditsch and Goemans (2006)), this topic has received relatively little attention in the literature. 5 Since theorizing and testing for a full range of potential relationships between the onset and intensity of domestic conflict on the onset and intensity of interstate conflict exceeds the scope of this chapter, I build four hypotheses derived from the related conflict literature. The most relevant extant empirical studies tend to focus only on the effects of a domestic conflict onset on the likelihood of an interstate conflict onset. For example, Davies (2002) finds that certain contentious domestic events such as protests or riots may increase the likelihood of initiating a MID; Walt (1996) argues through an opportunism framework that states undergoing domestic conflict make more attractive targets for interstate attacks; Trumbore (2003) finds that domestic ethno-political rebellion may increase the likelihood of MID initiation; a number of scholars have illustrated that domestic conflicts increase the likelihood of third-party interventions, which may or may not be welcome by the host government (Elbadawi and Sambanis (2002), Gleditsch (2007), Regan (2000)); and interstate conflict that can result from foreign support of rebels (Schultz (2010)). Gleditsch, Salehyan and Schultz (2008) provide a more comprehensive argument that onsets of domestic conflict increase the likelihood of interstate conflict by outlining the five main mechanisms through which this occurs: 6 Opportunism: Civil wars and insurgencies expose and exacerbate weaknesses in a state s military capabilities and divert resources away from defenses against foreign enemies, thereby increasing the expected utility of attacking a state with domestic conflict 5 This is even more surprising given the large number from which I select a small number of of diversionary war studies analyzing the effects domestic economic (inflation (Mitchell and Prins (2004)), inflation and unemployment (Fordham (2002)), GDP( Bennett and Nordstrom (2000)) and political (regime type, leader approval ratings (Ostrom and Job (1986)), election cycles (Smith (1996))) conditions as well as the presence of a number of studies addressing ways in which interstate conflicts can affect domestic conflicts (Thyne (2006), Akcinaroglu and Raziszewski (2005)). 6 Gleditsch, Salehyan and Schultz (2008) treat opportunism and diversion as one conceptually unique category, I divide them because I believe that they are conceptually unique concepts. 80

89 Diversion: Faced with domestic conflict, a leader may intentionally seek out interstate conflicts in order to divert attention away from domestic issues and generate a rally around the flageffect. Intervention: States can intervene either on the side of the government or the side of the rebels during a domestic conflict. Externalization: During domestic conflicts, rebels and government forces may cross interstate borders in order to find safe havens or more favorable territory from which to launch attacks. Spillover effects: Domestic conflicts often lead to enhanced troop movements near borders, cross-border refugee flows, and regional economic disruptions that can all increase the likelihood of interstate conflict. Overall, the relevant extant literature including all five of Gleditsch, Salehyan and Schultz (2008) s mechanisms is in agreement that the occurrence of domestic conflicts increases the likelihood of an interstate dispute onset and they also provide real-world examples and empirical testing to support this proposition. These theoretical arguments and empirical findings lead to my first hypothesis: Hypothesis 1: The likelihood of interstate conflict onset should increase after an onset of domestic conflict in one or both states comprising a dyad. 7 Although Gleditsch, Salehyan and Schultz (2008) and others provide a clear testable hypothesis for the effects of an onset of domestic conflict on the likelihood of an onset of interstate conflict, their theory and research design do not directly address how fluctuations in the intensity of an ongoing domestic conflict might affect either the likelihood of an onset or variation in intensity of an interstate conflict. Indeed, in many conflict prone countries, such as the case of Burma or the Democratic Republic of the Congo (DRC), levels of domestic conflict are very rarely zero. Despite this, considerable variation tends to exist in the intensity of ongoing domestic conflicts. By focusing exclusively on the effect of domestic conflict onsets, as Gleditsch, Salehyan and Schultz (2008) and others, countries like Burma are necessarily omitted from empirical testing since there is always an ongoing conflict. Therefore, in countries like Burma and the DRC, changes in intensity, rather 7 In all four hypotheses, I focus on the effects of key independent variables occurring in month t 1andmonth t 2 on the dependent variable in month t. This ensures that the independent variables is temporally preceding the dependent variable, which dismisses the possibility of the two events in the case of Hypothesis 1, a domestic conflict onset and an interstate conflict onset occurring in the same month but in reverse order (i.e. the interstate conflict preceding the domestic conflict onset) 81

90 than the occurrence or the onset of domestic conflicts that should matter most. Unfortunately, the literature is sparse with respect to the effects of variation in domestic conflict intensity on interstate conflict. However, following the logic behind Hypothesis 1, we could expect that as the intensity of a domestic conflict increase, the degree of opportunism, diversion, intervention, externalization, and spillover should also increase. This leads to my second testable hypothesis: Hypothesis 2: The likelihood of interstate conflict onset in a dyad should increase after the intensity of ongoing domestic conflict in one or both of the states comprising the dyad increases. This line of reasoning could also be applicable to changes in the predicted levels of ongoing interstate conflicts, as increases in intensity in domestic conflict could lead to increases in the expected intensity of interstate conflict. Despite a dearth of relevant empirical literature, numerous case studies support the argument that the intensity of a domestic conflict has a positive effect on levels of externalization, spillover, and opportunism. For example, more brutal conflicts tend to have higher levels of externalization and spillover, as violence in Darfur, the DRC, and Rwanda illustrate. An estimated 1.8 million refugees have fled Darfur amidst violence that has led to 300,000 deaths; in the DRC, civil violence since 1996 resulted in an estimated 5.4 million deaths and 3.4 million refugees; and in Rwanda, an estimated 2 million Hutus fled after approximately 800,000 deaths. In many cases illustrated by Hutu militiamen fleeing to Zaire and other neighboring states rebels are among the refugees, which has tended to increase interstate conflict. Clearly, these conflicts have broad negative local and regional economic consequences that nearby states would like avoid. Thus, it is logical to assume that externalization and spillover become more pronounced as the severity of domestic conflict increases. Following this line of argument, it is reasonable to suggest that as the intensity of domestic conflict increases, so does the likelihood of an interstate conflict. Hypothesis 3: The intensity of an ongoing interstate conflict should increase after the intensity of ongoing domestic conflicts in one or both of the states comprising the dyad increases. These first three hypotheses all suggest a positive relationship between domestic conflict and interstate conflict, both in terms of onset and intensity. Despite this, a considerably different strain of logic developed in the bargaining model of war literature suggests alternative relationships. Consider a baseline bargaining model of war in which the probability or winning a potential interstate conflict is a function of both states capabilities (Fearon (1995)). 8 If two states in a dyad (State A and State B) both have capabilities totaling 100 units each, the probability that either wins 8 The preponderant measure of capabilities is the COW CINC score. 82

91 the conflict is approximately 50/50 under the assumption that both states are able to commit all capabilities to a potential interstate conflict. However, as Gleditsch, Salehyan and Schultz (2008) articulates, Civil wars and insurgencies(...)divert resources away from defense against foreign enemies. This implies that if State A is engaged in a domestic conflict, it is forced to funnel a portion of the 100 units of total capabilities towards the domestic conflict meaning that its real capabilities to be used in the event of interstate conflict is 100. Consequently, the probability that a domestically conflicted State A wins an interstate conflict with State B is lower than it would be if it were not also fighting in a domestic conflict. Additionally, this rationale is extendable to cases in which the level of ongoing domestic conflict in both (or either) State A and State B varies after a conflict is already initiated. A number of cases exist to provide a clear illustration of two states simultaneous engaged in domestic and interstate conflict such as Angola and the DRC, Burma and Thailand, and India and Pakistan. For example, consider a state with 100 units of capabilities is engaged in both an interstate and a domestic conflict. As the percentage of resources that it chooses to dedicate towards one of the ongoing conflicts increases, the amount of resources available to fight the other conflict decreases, meaning that the probability of winning also decreases. Focusing on enduring rivalries conceptually akin to continuing conflict Bennett and Nordstrom (2000) posit that by reducing or ending interstate conflict with a rival, a state becomes able to free up important resources that may be reallocated to the domestic economy. It is likely that these resources could also be used to reinforce efforts to put down domestic conflict. For example, in 1905, increasing levels of domestic unrest played into Russia s calculi during negotiations to end the Russo-Japanese war: Their (Russian negotiators ) country was in the first throes of a slow revolution that they knew to be unstoppable. At best, the revolution could be postponed if they could negotiate a foreign peace [with Japan] that would enable the Tsar s ministers to deal undistractedly with the war developing in the streets and basements back home. 9 Furthermore, in the post WWII era, the likelihood of losing territory, rents, and control over government is higher in modern domestic conflicts than interstate conflicts due largely to evolving international norms, meaning that it is logical to assume that a state would place greater emphasis on winning the domestic conflict. 10 Thus, if domestic intensity increases in both states and causes 9 Excerpt taken from Theodore Roosevelt biographer Edmund Morris account of negotiations between Russian and Japanese diplomats, overseen by then President Roosevelt. 10 See Zacher (2001) for a discussion of how the territorial integrity norm that has increased the rarity of transfers of territory through interstate war; Collier (1999) for an illustration of the negative effects of civil war on domestic 83

92 them to funnel resources away from the interstate conflicts, it follows that the intensity of the interstate conflict should decrease. These arguments lead to my final hypothesis: Hypothesis 4: The intensity of an ongoing interstate conflict should decrease after the intensity of domestic conflict increases in one or both states of a dyad that are engaged in domestic conflict. In the following section, I outline how I outline my research design, focusing primarily on how I use the GDELT event dataset to comprise measures reflecting onset and changes of intensity of both domestic and interstate conflicts. 3. Research Design As previously mentioned, existing literature interested in the effects of domestic conflict on interstate conflicts has been restricted by its use of annual level measures of conflict. In order to allow for sufficiently nuanced analyses to test my four hypotheses, I utilize the dyad-month unit of analysis for all empirical testing. Since the existing conflict datasets (UCDP, COW, MIDs, etc.) are aggregated to the yearly level, I am required to build my own measures reflecting onsets and changes in intensity of both domestic and interstate conflict. To do so, I utilize the GDELT event data dataset. With this data, I construct domestic and interstate conflict variables for over 4 million dyad-month observations for all countries from 1979 to Constructing Independent Variables. In order to test my four hypotheses, I need independent variables that reflect both onsets of domestic conflicts as well as variation in the intensity of ongoing domestic conflicts. To build these measures, I first use the GDELT data to calculate how many domestic material conflict events occur in each country per month. In order to qualify as domestic material conflict, the event must be between two actors whose primary affiliation is with the same country. Additionally, one actor s secondary affiliation must be with the government, either as a member of government, a member of the armed forces, or a member of the policy. The other actor s secondary affiliation must be as a rebel, separatist, or insurgent Measures of Domestic Conflict Onset. At the most basic level, the concept of an onset assumes that an event that was not previously occurring suddenly begins. In theory, this should mean that an operationalization of domestic conflict onset should require a period devoid of domestic conflict during which an onset may occur. In reality, however, states with a history of economic sectors and GDP; and Le Billon (2001) for a discussion of domestic rebels ability to siphon resource rents from governments. 84

93 domestic conflict are rarely entirely devoid of conflictual events. As a result, current intra-state conflict datasets such as COW and UCDPuse a cutpoint (1,000 fatalities for COW, 25 fatalities for UCDP) and assume any state with fewer than the cutpoint number of deaths is at peace and a state with more than the cutpoint number of fatalities in a given time period is at conflict. To operationalize civil conflict onset with event data, I follow this cutpoint approach. However, unlike COW and UCDP, which both use a single cutpoint, I test across three cutpoints since there is no single theoretically justified cutpoint, regardless of COW or UCDPprocedure. This allows me to conduct robustness checks, which helps ensure that statistically significant findings are not merely a function of a certain cutpoint, but rather are consistent across various cutpoints. In total, I build six binary measures of civil conflict onset - three to reflect domestic conflict onset in only one of the states per dyad-month, and three to reflect onset in both of the states per dyad-month. one domestic 20 A 1 if only one of the states in each dyad-month experiences > 20 domestic material conflict events in month t and both states experienced fewer than 20 domestic material conflict events between the government and rebel groups in month t 1, and 0 otherwise. one domestic 40 identical to one domestic 20, except the cutpoint is set at 40 material conflict events. one domestic 60 identical to one domestic 20, except the cutpoint is set at 60 material conflict events. both domestic 20 A 1 if both of the states in each dyad-month experiences > 20 domestic material conflict events in month t and both states experienced < 20 domestic material conflict events in month t 1, and 0 otherwise. both domestic 40 identical to both domestic 20, except the cutpoint is set at 40 material conflict events. both domestic 60 identical to both domestic 20, except the cutpoint is set at 60 material conflict events Measures of Domestic Conflict Intensity. Empirically testing for the effects of variation in the severity of domestic conflict on the level of interstate conflict at the monthly level is a difficult task that the extant literature has yet to address. Consequently, I am unable to draw upon existing methodological approaches to operationalize variation in the intensity of domestic conflict intensity. Given the dearth of precedent, I build the following six binary measures reflecting changes 85

94 in intensity of ongoing domestic conflicts in an attempt to construct as straightforward measures as possible. As in section 3.1.1, I build variables across the three different cutpoints, which allows me to test for the robustness of findings. one worse 20 A 1 If the state in the dyad experienced >20 domestic material conflict events in month t 1 and more domestic material conflict events in month t than in month t 1. 0 otherwise. both worse 20 A 1 if both states in the dyad experienced >20 domestic material conflict events in month t 1 and more domestic material conflict t events in month t than in month t 1, or, one state in month t 1experienced>20 domestic material conflict events in month t 1 and more domestic material conflict events in month t than month t 1, and the other state experienced <=20 domestic material conflict events in month t 1but >20 domestic material conflict in month t. 0 otherwise. one worse 40 Identical to emphone worse 20, except the cutpoint is set at 40 material conflict events. both worse 40 Identical to emphboth worse 20, except the cutpoint is set at 40 material conflict events. one worse 60 Identical to emphone worse 20, except the cutpoint is set at 60 material conflict events. both worse 60 Identical to emphboth worse 20, except the cutpoint is set at 60 material conflict events Constructing Dependent Variables. To build measures reflecting onsets and variation in intensity of interstate conflicts, I follow a similar process to that used to build domestic conflict measures. First, I construct a measure called interstate material conflict for each dyad-month, which reflects the number of material conflict events occurring each month between two actor s whose primary affiliation is with different states comprising each dyad, and whose secondary continuousaffiliation is with the government, military, or police. For example, consider the Thailand- Vietnam dyad. For an event to qualify as an interstate material conflict event, one actor s primary affiliation is required to be Thailand and the other required to be Vietnam, and both of their secondary affiliations must be either government, military, or police. I require these restrictive secondary commands to maintain consistency with COW definitions of interstate conflict (i.e. that it occurs between official state forces). Whereas COW uses a single cutpopint (1,000 battle 86

95 fatalities) to qualify as an interstate conflict, I again employ three different cutpoints since there is neither theoretical nor empirical justification to chose a single cutpoint. Using the interstate material conflict and the 20, 40, 60 event cutpoints, I build three binary variables reflecting an onset of interstate conflict: interstate 20 A 1 if greater than 20 interstate material conflict events occur in month t and fewer than 20 interstate material conflict events occured in month t 1, and 0 otherwise. interstate 40 identical to interstate 20, except the cutpoint is set at 40 interstate material conflict events interstate 60 identical to interstate 20, except the cutpoint is set at 60 interstate material conflict events To measure the change in intensity of interstate conflict, I build three binary and three continuous variables reflecting the change in intensity of ongoing interstate conflicts. Again, I calculate measures across the three different cutpoints 20, 40, and 60 events in order to facilitate robust checks in Section 4.3. interstate worse 20 A 1 if the dyad experienced greater than 20 interstate material conflict events in month t 1 and more interstate material conflict events in month t than in month t 1. A 0 otherwise. interstate worse 40 A 1 if the dyad experienced greater than 40 interstate material conflict events in month t 1 and more interstate material conflict events in month t than in month t 1. A 0 otherwise. interstate worse 60 A 1 if the dyad experienced greater than 60 interstate material conflict events in month t 1 and more interstate material conflict events in month t than in month t 1. A 0 otherwise. interstate change 20 The (number of interstate material conflict events in month t) -(the number of interstate material conflict events in month t 1), calculated when month t 1 experienced greater than 20 interstate material conflict events. interstate change 40 The (number of interstate material conflict events in month t) -(the number of interstate material conflict events in month t 1), calculated when month t 1 experienced greater than 40 interstate material conflict events. 87

96 11 The control variables are only collected through interstate change 60 The (number of interstate material conflict events in month t) -(the number of interstate material conflict events in month t 1), calculated when month t 1 experienced greater than 60 interstate material conflict events Control Variables. In addition to the event data-derived measures of domestic conflict, I follow Russet and Oneal (2001) and employ their eight baseline variables used to explain MID involvement (Table A5.1, p. 316), which are all aggregated at the yearly level: Non-Contiguity: 0 reflecting a shared land border or fewer than 150 miles of water separating the nearest borders, and 1 indicating non-contiguous borders (Stinnett et al. (2002)). Power Ratio: The ratio of CINC scores between the two states in the dyad, with the lower score serving as the numerator (Singer, Bremer and Stuckey (1972)). Minor Powers: A binary measure taking on 1 if neither of the two states in the dyad are considered major powers. In my dataset, all dyads are comprised of minor powers with the exception of those containing either China or Japan. Log Distance: COW data reflecting distance between capitals in miles, logged (Stinnett et al. (2002)). Democ L: Polity IV data, which reflects the autocracy-democracy score of the lesser democratic state in the dyad on the 21-point, -10 (fully autocratic) to +10 (fully democratic) scale (Marshall and Jaggers (2009)). Depend L: Bilateral trade data, calculated to reflect the percentage of both states in the dyads total trade comprised by the dyadic trade. The dyad receives the lower of the two state scores (Barbieri, Keshk and Pollins (2009)). IGO: A count of that reflects the number of shared dyadic IGO membership (Pevehouse and Nordstrom (2004)). Alliance membership: An ordinal measure of 0, 1, or 2, reflecting the highest degree of dyadic alliance (Gibler and Sarkees (2004)). To construct my complete dataset, I first use EUGene (see Bennett and Stam (2000)) to build a dyad-year time-series cross section dataset with the eight annual level controls from Russet and Oneal (2001) from 1979 to 2004 for all possible dyads. 11 Next, since my analysis focuses on monthly level variation, I must convert these yearly scores to the monthly level. To do so, I assume that the value of the control variables in each month is the same as their yearly total. For example, if

97 trade the Depend L score for the China-Japan dyad in 2004 is.42, I set the Depend L score for all 12 months in the Japan-China dyad at.42. Lastly, I merge the event data derived domestic/interstate conflict onset/intensity variables. The final dataset is a time-series cross section dataset at the dyad-month level with over 4 million observations. With this data, I use logistic and ordinary least squares (OLS) regression to test all four hypotheses. I perform all empirical tests twice, once lagging the domestic conflict variables by one month, and then a second time lagging these variables two months. This is useful for two reasons. First, using lags of my domestic conflict variables helps ensure that the domestic conflict onsets actually precede the interstate conflicts. If I were to use unlagged measures, a domestic conflict onset could occur after an interstate conflict onset but within the same calendar month, meaning that it would have been impossible for onset of domestic conflict to have caused the onset of interstate conflict. Second, the four hypotheses suggest that a relationship between domestic conflict and interstate conflict may exist, but do not suggest how quickly the relationships unfold. Thus, testing for a one- and two-month lags allow the empirical models to capture effects that occur quickly as well as ones that take longer (i.e up to two months) to unfold. 4. Empirical Tests 4.1. Tests of the effects of domestic conflict onset on the likelihood of interstate conflict onset. My first empirical test addresses Hypothesis 1, analyzing whether an onset of domestic conflict affects the likelihood of an onset of interstate conflict. [INSERT TABLE 1 HERE] In Table 1, I test for the effects of onsets of domestic conflict in month t 1 on the likelihood of an onset of interstate conflict in month t. Since I am interested in modeling onsets, both states in the dyad must have experienced an absence of domestic conflict in the previous month. Additionally, the dyad must not have experienced an interstate conflict in the previous month. I run three separate logistic regressions, which all account for the same set of Russet and Oneal (2001)control variables, but utilize difference cutpoints 20, 40, and 60 conflictual events needed to qualify as a domestic and interstate conflict. To further explain the use of cutpoints, each model in Table 1 corresponds to a single logistic regression. In Model 1, the dependent variable is interstate 20 and the key independent variables reflecting domestic conflict onsets are one domestic 20 and 89

98 both domestic 20, meaning that Model 1 assumes that both a state with fewer than 20 domestic material conflict events and a dyad with fewer than 20 interstate material conflict events are at peace, but once the 20 material conflict cutpoint is passed, the state and dyad are treated as being at conflict. 12 In all three models, an onset of domestic conflict in one or both of the two states comprising each dyad in month t 1 increases the likelihood of an interstate conflict onset in month t at statistically significant levels. Additionally, the relative size of coefficients in Table 1 suggest that dyads in which both states experiencing an onset of domestic conflict in month t 1 have a higher likelihood of an interstate conflict onset in month t than dyads in which only one state experiences a domestic conflict onset. To facilitate interpretation of the empirical findings in Table 1, I calculate the marginal effects of each of the key domestic conflict variables. To do so, I build three average dyads according to the eight control variables (by taking the mean of the continuous measures and the mode of the binary measures) one with no onsets of domestic conflict in month t 1, one with one onset of domestic conflict in t-1, and one with two onsets of domestic conflict in month t 1. Then, I use the mean and standard errors of coefficient estimates from the logistic regressions to calculate the mean predicted probabilities with 95% confidence intervals reflecting the likelihood of an onset of interstate conflict onset. I repeat this across the three cutpoints 20, 40, and 60 to serve as a robustness check. 13 [INSERT FIGURE 1 HERE] Figure 1 is divided into three columns, cutpoint 20, cutpoint 40, and cutpoint 60, which correspond to Model 1, Model 2, and Model 3 of Table 1, respectively. Along the x-axis in each column are labels neither, one, and both, which reflect whether neither, one, or both of the states comprising each dyad experienced an onset of domestic conflict in month t 1. The y-axis reflects the predicted probability of an onset of interstate conflict occurring in month t. The box plots reflect the mean and 95% confidence intervals of the estimates. Though difficult to visually recognize due to the small predicted probabilities, the 95% confidence interval do not overlap from the neither (i.e. neither of the two states in the dyad experience an onset of domestic conflict in 12 I perform additional robustness check would be mixing cutpoints, meaning use interstate 20 as a dependent variable but use one domestic 40 and both domestic 40 or one domestic 60 and both domestic 60 as the key independent variables. Results are consistent. 13 I also calculate marginal effects for a hypothetical dangerous dyad by assuming either Minor power =0 or Non contiguity=0. Findings are consistent with those reported in Figure 1. 90

99 month t 1) to the one (i.e. one of the two states in the dyad experiences an onset of domestic conflict in month t 1) across any of the three cutpoints. This reflects the statistically significant coefficients in Table 1, which strongly suggest that all else being equal, a dyad with one domestic conflict onset in month t 1 is more likely to experience an onset of interstate conflict in month t. Additionally, Figure 1 clearly illustrates that dyads in which both states experience an onset of domestic conflict are considerable more likely to experience an onset of interstate conflict than dyads in which either one or neither states experienced an onset of domestic conflict. Note that the large confidence intervals around the both box plots is due to the relatively small number of observations, as presented in Table 1. [INSERT TABLE 9 HERE] In Table 9, I report the odds ratio, which reflects how much more likely a dyad is to experience an onset of interstate conflict in month t following a month in which either one or both of the states experiences an onset of domestic conflict in month t 1 relative to a the likelihood of an interstate conflict onset in month t when neither of the two states experienced an onset of domestic conflict in month t 1. As Table 9 indicates, a dyad month following one domestic conflict onset in month t 1 is between 2.38 and 3.85 times more likely to experience an onset of interstate conflict in month t across the three cutpoints, relative to a dyad-month with no domestic conflict onsets in month t 1. Taken together, both the consistency of findings across cutpoints as well as the high degree of statistical significance in Table 1 provide strong support for Hypothesis 1. We can say with a high degree of confidence that domestic conflict onsets in one or (especially) both states in a dyad dramatically increases the likelihood that that dyad will experience an onset of interstate conflict in the following month. In Table 2, I repeat this process but lag the domestic conflict variables two months, as opposed to the one-month lag used in Table 1, Figure 1, and Table 9. The results using two-month lags are largely consistent with the findings when using a one-month lag. [INSERT TABLE 2 HERE] As Table 2 illustrates, an onset of domestic conflict in one or both states comprising each dyad occurs in month t 2, the likelihood of an interstate conflict onset in month t increases. 14 This 14 In Model 3 of Table 2, a standard logistic regression omits the both domestic 60. To overcome this problem, I instead implement a Firth Logit, as recommended by Zorn (2005). 91

100 relationship holds across all three cut points and consistently achieves >95% confidence. Although it seems logical to test for the effects of domestic conflict onset in month t 1 and month t 2 simultaneously in the same model, this is not possible, since by definition, if a conflict onset occurs in month t 2, month t 1 is dropped from the empirical model because, an onset can not occur after an onset occurred in the previous month. [INSERT FIGURE 2 HERE] Figure 2 provides the marginal effects of onsets of domestic conflict in one or both states in each dyad in month t 2 (as opposed to month t 1 in Figure 1) on the likelihood of interstate conflict onset in month t, calculated with the same approach used to build Figure 1, as detailed above. 15 As in Figure 1, dyads that experience an onset of domestic conflict in one or both states are more likely to experience an onset of interstate conflict. Additionally, the likelihood of an interstate conflict onset is greater when both states experience an onset of domestic conflict in month t 2 than when only one state experiences an onset. Furthermore, like in Figure 1, the large confidence intervals across all three columns when both states experience an onset of domestic conflict is the result of only a small number of dyads, as reported in Table 2. [INSERT TABLE 10 HERE] Table 10 is identical to Table 9, except it reflects the marginal effects of a two-month, rather than one-month lag. As Table 10 indicates, dyads in which one state experiences an onset of domestic conflict in month t 2 is between 2.4 and 3.9 times more likely to experience an onset of interstate conflict in month t. Furthermore, the likelihood of interstate conflict becomes between 4.4 and 26.8 times more likely when both states experience an onset of domestic conflict in month t Tests of the effects of domestic conflict intensity on the likelihood of interstate conflict onset. In Table 3, I test for the effects of whether increasing intensity of ongoing domestic conflicts in month t 1 increases the likelihood of an onset of an interstate conflict in month t. Since I am interested in the effects of domestic conflict intensity on interstate conflict onset, at least one of the two states in the dyad must have experienced a domestic conflict in month t 1 and the dyad must not have experienced an interstate conflict in month t 1 gain inclusion into the regression. 15 Note that in Figure 2, I cap the upper bound on the both boxplot in the cutpoint 60columnat.0005inordertofacilitateinterpretation.T heactualupperboundis

101 As in Table 3, I utilize logistic regression while accounting for the eight Russet and Oneal (2001) controls and test across the three different cutpoint values. [INSERT TABLE 3 HERE] In both Model 1 and Model 2 of Table 3, the key variables reflecting increasing intensity in domestic conflicts one worse 20 and both worse 20 in Model 1 and one worse 40 and both worse 40 in Model 2 achieve statistical significance at 95% confidence, suggesting that the likelihood of an interstate conflict onset increases as the level of domestic conflict intensity in one or both states comprising the dyad increases. However, in Model 3, only one worse 60 achieves relatively weak statistical significance, with the estimated effect of both worse 60 being statistically indistinguishable from zero. [INSERT FIGURE 2 HERE] Figure 2 presents the estimated marginal effects of increases in intensity of domestic conflicts across the three cutpoints on the likelihood of an interstate conflict. The box plots calculated following the same procedure used in Figure 1 visually represents the empirical findings in Table 2. As the cutpoint 20 and cutpoint 40 columns indicate, the likelihood of an interstate conflict onset increases slightly as the intensity of domestic conflict increase in one of the states, and dramatically increases in months following an increase in domestic conflict intensity in both states. Based on these estimated marginal effects, when the cutpoint = 20, the likelihood that a dyad experiences an onset of interstate conflict in month t is approximately 50% greater when one state experiences an increase in intensity of a domestic conflict in month t 1, and 70% greater when both states experience increasing intensity of domestic conflict in month t 1. In the third column, labeled cutpoint 60, the mean predicted probability of interstate conflict onset when one and both states experienced an increase in intensity in ongoing domestic conflict is within the 95% confidence interval of the predicted probability of an onset of interstate conflict when neither of the two states comprising the dyad experienced increasing intensity in a domestic conflict, which indicates that increasing intensity in domestic conflicts does not have statistically significant impact on the likelihood of interstate conflict. [INSERT TABLE 11 HERE] Table 11 follows the same procedure used to build Table 9, this time reporting how much more likely an interstate onset becomes between a dyad in month t as one or both of the states comprising 93

102 the dyad experience a worsening domestic conflict in month t 1. Column 1 and Column 2 indicate that across the first two cutpoints, a dyad becomes between 1.28 and 1.92 times more likely to experience an interstate conflict onset in month t as one or both of the states comprising the dyad experience a more severe domestic conflict in month t 1. As reflective of the findings in Table 3 and Figure 2, the marginal effects at cutpoint 60 do not achieve statistical significance at a meaningful level, which reduces our overall confidence in the strong findings across cutpoint 20 and cutpoint 40. In Table 4, I test for whether changes in domestic conflict intensity during month t 2 affect the likelihood of an interstate conflict during month t. [INSERT TABLE 4 HERE] The findings are similar to those in Table 3, but the statistical significance is more consistent across cut points and the marginal effects are even stronger. Using 1-month lag in Table 3, one worse 60 is weakly significant and one worse 60 fails to achieve a p-value of.1. However, in Table 4, both one worse 60 one worse 60 generate strong statistical significance at the.01 level. [INSERT FIGURE 3 HERE] Figure 3 visualizes the results in Table 4, demonstrating that across all cut points, an onset of interstate conflict becomes considerably more likely in month t when one or both states comprising the dyad experience an increasingly intense domestic conflict in month t 1. Unlike in Table 3 and Figure 2, these results hold when the cutpoint=60. [INSERT TABLE 12 HERE] Table 12 presents how much more likely an interstate conflict onset during month t becomes as one or both of the states in each dyad experience an increasingly intense domestic conflict during month t 2. Interestingly, all of the marginal effects are stronger than they are in Table 10, which reflects the marginal effects at a one-month lag. For example, at cutpoint=40, an interstate conflict onset in month t becomes 128% more likely as both states experience increasing intense domestic conflicts in month t 1, but 220% more likely when both states experiencing increasingly intense domestic conflicts in month t 2. These findings, interpreted jointly with the results in Table 10, are interesting for a number of reasons. First, it suggests that it takes interstate dynamics varying amounts of time to respond to domestic conditions. For example, Table 10 demonstrates that states tend to meaningfully respond to events occurring in the previous month, but Table 11 shows 94

103 that interstate dynamics respond even stronger to events occurring two months ago. Though it is outside the focus of this chapter to rigorously analyze why this is the case, it seams feasible that domestic institutional processes may slow response times to domestic crises, meaning that states foreign policies often take longer than one-month to respond to important events. Second, the tests of the one-month lag highlight the importance of using different cutpoints as a robustness check. Since the empirical findings are not consistent across all three cutpoints (like they were in Model 1 and Figure 1), we must be less confident about the strength of findings. With that in mind, Table 2 and Figure 2 still provide general support for Hypothesis 2 that the likelihood of an onset of interstate conflict increase in months following increases in intensity in domestic conflicts Tests of the effects of domestic conflict intensity on interstate conflict intensity. Thus far, I have analyzed the effects of domestic conflict onset on the likelihood of interstate conflict onset and the effects of domestic conflict intensity on the likelihood of an interstate conflict onset. Finally, in this section, I address whether changes in domestic conflict intensity affect the intensity of ongoing interstate conflicts. In Table 3, I test for the effects of whether one or both states experienced more intense domestic conflicts in month t 1 on whether the ongoing interstate conflict becomes more intense in month t than in month t 1. Given this focus, I only model dyad-months in which at least one of the two states experienced greater than the cutpoint number of material conflict events in month t 1 and a >cutpoint number of interstate material conflict events in month t 1. Of the over 4 million dyad-month observations in my dataset, only 3,791, 1,074, and 458 dyad-months meet this requirement across the three cutpoints, respectively. As in Table 1 through Table 4, I run a series of logistic regressions while accounting for eight Russet and Oneal (2001) controls. [INSERT TABLE 5 HERE] As Table 5 reflects, none of six measures reflecting whether one or both countries in each dyad experienced more intense domestic conflict achieved statistical significance. In Table 6, I repeat the analysis using a two-month lag of the domestic conflict variables. [INSERT TABLE 6 HERE] Results are similar to Table 5, with the single exception that one worse 60 achieves moderate statistical significance. However, it seems unlikely that this relationship is robust based on the lack of consistency across Model 1 and Model 2 in Table Rather, it is more likely that the

104 statistical significance of one worse 60 in Model 3 is simply fitting noise in the dataset. Overall, a joint interpretation of the results in Table 5 and Table 6 jointly provide suggest that the causal mechanisms purported in Hypothesis 3 and Hypothesis 4 may not be present in the data, though I provide further testing of these hypotheses in Table 7 and Table 8 below. [INSERT TABLE 7 HERE] In Table 7, I provide an additional test of the effects of changes in the intensity of ongoing domestic conflicts on changes in the intensity of ongoing interstate conflicts, but this time I utilize the three continuous measures that reflect changing intensity of interstate conflict (interstate change 20, interstate change 40, and interstate change 60 ) as dependent variables. Since my dependent variables are now continuous, I utilize a basic OLS regression. In Model 1 and Model 2 of Table 4, dyads in which both states experienced worsening domestic conflict in month t 1 tended to engage in fewer interstate material conflict events in month t. Interpreting the coefficients, Model 1 suggests that as both states experience more intense domestic conflicts in month t 1, these two states tend to engage in approximately 4.6 fewer interstate material conflict events in month t. When the cutpoint shifts to 40 in Model 2, we should expect to see over 7 fewer interstate material conflict events in month t if both states experienced more intense domestic conflict in month t 1. However, the failure of both worse 60 to achieve statistical significance in Model 3 tempers our confidence in the the significant and negative relationship between two countries experiences more intense domestic conflict and the dyad experience less intense interstate conflict. [INSERT TABLE 8 HERE] As in Section 4.1 and Section 4.2, I rerun the empirical models in Table 7 using a two-month lag of the domestic conflict variables, with results presented in Table 8. Although results are not perfectly consistent across Model 1, Model 2, and Model 3 in Table 8, a joint interpretation of the three models suggests that increases in intensity of domestic conflicts in month t 2tendsto decrease the intensity of ongoing interstate conflicts in month t. Additionally, unlike in Section 4.3, the lack of consistency in Table 7 and Table 8 makes it difficult to compare the substantive effects of the one- and two-month lag. One potentially interesting comparison is that in Table 7, none of the three one worse variables achieve statistical significance, but two of the both worse variables do. In Table 8, this is reversed, with only one of the three both worse variables achieving a p-value.1 but all three of the one worse variables generating p-values.05. While this may suggest that interstate conflict dynamics may respond more rapidly to changes in domestic conflict intensity in both states 96

105 than to changes in domestic conflict intensity occurring in one state, the lack of robust empirical support across cut points prevents me from asserting this relationship with meaningful confidence. Overall, across the 12 empirical tests run in Table 5, Table 6, Table 7, and Table 8, I find no support for Hypothesis 3 increases in intensity of domestic conflict should increase the intensity of ongoing interstate conflicts. Conversely, Table 7 and Table 8 provide some support for Hypothesis 4 increase in intensity of domestic conflicts should lead to decreases in intensity of ongoing interstate conflicts. 5. Conclusion In the real world, we know that the levels of domestic and interstate conflict can fluctuate rapidly. One month, Rwanda is at relative domestic peace, the next three months, it experiences a horrific genocide, and the next month it returns to relative peace. Similarly in the interstate conflict context, one month India and Pakistan are at peace, the next month there is an escalation in violence, and the following month they return to peace. Also based on real-world observations, we have a strong expectation that often times, domestic conflicts tend to affect interstate relations. Given these observations, a number of interesting questions emerge regarding potential relationships between domestic conflict and interstate conflict. Do onsets of domestic conflict in a state increase the likelihood that it will engage in interstate conflict with a neighbor? If two states are engaged in interstate conflict are also fighting domestic conflicts, do increases in intensity of the domestic conflicts tend to lead to increases in intensity in the interstate conflict as well? Despite the massive number of quantitative studies of both domestic and interstate conflict, studies testing for relationships between domestic and interstate conflict are scarce, primarily due to a lack of appropriate data. For example, Gleditsch, Salehyan and Schultz (2008) is the most comprehensive study to date analyzing the relationship between domestic and interstate conflict, but their use of state-level binary measures of both domestic and interstate conflict (as is ubiquitous throughout the related literature) inhibit their ability to both construct measures of conflict intensity and test for sub-annual level variation that tends to be ubiquitous in all conflicts. The key advancement of this study is my use of the GDELT event data to build measures reflecting levels of domestic and interstate conflict at the monthly level for all countries and dyads in the world. This allows me to test four hypotheses that the extant literature has theorized to be 97

106 true but never before been able to test empirically. Based on various logistic and OLS regressions, I find strong support for Hypothesis 1, moderate support for Hypothesis 2, no support for Hypothesis 3, and moderate support for Hypothesis 4. Additionally, increases in domestic conflict intensity in both states in month t 1 tending to leader to less intense interstate conflicts in month t. 16 Overall, I believe that this study has two major takeaways as well as two clear paths for future research. First, we now have sufficiently nuanced data to move beyond coarse, yearly level binary measures of conflict. With event data, we can build monthly (or even weekly or daily) level measures that are able to capture the sub-annual variation in the level of domestic and interstate conflicts. This should allow researchers to test for a host of theoretically expected relationships that have heretofore been difficult or impossible to empirically test given a lack of data. Second, the empirical evidence seems to suggest that if you are interested in analyzing interstate conflict onsets, you should account for whether one or both states in the dyad recently experienced an onset of interstate conflict or increasingly intense conflicts if one or both states have ongoing domestic conflicts. It is possible that by accounting for various measures of domestic conflicts, inferences drawn on other variables may change. In terms of future research, this chapter has provided some preliminary answers to hypotheses framed in a basic do these relationships exist? format. For some proposed relationships, including the one suggested in Hypothesis 1, my empirical results provide strongly suggests that the answer is yes. The next logical test is to test for hypotheses that propose why these relationships exist. Again, the major obstacle to asking these more difficult why questions has been a lack of data. However, as I attempt to highlight throughout this chapter, the 200 million (and counting) events in the GDELT dataset make it possible to ask increasingly nuanced questions, and I am confident that moving forward, scholars will be able to use GDELT to isolate specific causal mechanisms such as intervention or diversion that may be responsible for the strong effect that domestic conflicts seems to have on interstate conflicts. Additionally, this chapter provides an initial framework for analyzing whether the effects of domestic conflict events on interstate conflict dynamics change as the amount of time since the occurrence of the domestic events increases. For example, I find some support suggesting that domestic conflict onsets in month t tend to increase the likelihood of an interstate conflict onset more in month t + 2 than in month t + 1. Future studies could focus more heavily on this and 16 strong>moderate>weak>no. 98

107 similar findings and potentially isolate factors, such as institutional design or regime type, that may affect the time that amount of time that elapses before an interstate conflict reflects changes occurring at domestic levels. 99

108 References Akcinaroglu, Seden and Raziszewski Expectation, Rivalries, and Civil War Duration. International Interactions 31(4): Barbieri, Katherin, Omar M.G. Keshk and Brian M. Pollins Tradeoffs in Trade Data: Do Out Assumptions Affect Our Resul. Conflict Management and Peace Science 21(2): Bennett, D. Scott and Allan Stam EUGene: A Conceptual Manual. International Interactions 26: Bennett, D. Scott and Timothy Nordstrom Foreign Policy Substitutability and Internal Economic Problems in Enduring Rivalries. Journal of Conflict Resolution 44(1): Chiozza, Giacomo, Kristian Gleditsch and Hein E. Goemans Civil War, interstate conflict, and tenure. Paper Presented at the Polarization and Conflict Workshop, Nicosia, Cyprus. Collier, Paul On the Economic Consequences of Civil War. Oxford Economic Papers 51: Davies, Graeme A. M Domestic Strife and the Initiation of International Conflicts: A Directed Dyad Analysis, The Journal of Conflict Resolution 46(5): Elbadawi, Ibrahim and Nicholas Sambanis External interventions and the duration of civil wars. In World Bank. Policy Research Working Paper Series 2433, World Bank. Fearon, James D Rationalist explanations for war. International Organization 49(03): Fordham, Benjamin O Another Look at Parties, Voters, and the Use of Force Abroad. Journal of Conflict Resolution 46(4): Ghosn, Faten, Glenn Palmer and Stuart Bremer The MID3 Data Set, : Procedures, Coding Rules, and Description. Conflict Management and Peace Science 21: Gibler, Douglas M. and Meredith Sarkees Measuring Alliances: The Correlates of War Formal Interstate Alliance Data set, Journal of Peace Research 41(2): Gleditsch, Kristian Skrede Transnational dimensions of civil war. Journal of Peace Research 44(3): Gleditsch, Kristian Skrede, Idean Salehyan and Kenneth Schultz Fighting at Home, Fighting Abroad: How Civil Wars Lead to International Disputes. Journal of Conflict Resolution 52(4):

109 Gleditsch, Nils Petter, Peter Wallensteen, Mikael Eriksson, Margareta Sollenberg and Hvard Strand Armed Conflict : A New Dataset. Journal of Peace Research 39(5): Humphrey, Leonard The Way of the Heavenly Sword: The Japanese Army in the 1920s. Palo Alto, CA: Stanford University Press. Le Billon, Phillippe The Political Ecology of War: Natural Resources and Armed Conflicts. Political Gepography 20(5): Marshall, Monty G. and Keith Jaggers Polity IV Project: Political Regime Characteristics and Transitions, The Polity IV Dataset, retrieved from [ Mitchell, Sara McLaughlin and Brandon C. Prins Rivalry and Diversionary Use of Force. Journal of Conflict Resolution 48(6): Ostrom, Charles W. and Brian Job The President and the Political Use of Force. American Political Science Review 80(2): Pevehouse, Jon C. and Kevin Nordstrom, Timothy an Warnke The COW-2 International Organization Dataset Version 2.0. Conflict Management and Peace Science 21(2): Regan, Patrick M Civil Wars and Foreign Powers: Interventions and intrastate conflict. University of Michigan. Russet, Bruce and John Oneal Triangulating Peace. Toronto: W. W. Norton and Company. Sambanis, Nicholas A Review of Recent Advances and Future Directions in the Quantitative Literature on Civil War. Defense Economics 13(1): Sarkees, Merideth Reid and Frank Wayman Resort to War: CQPress. Schultz, Kenneth The Enforcement Problem in Coercive Bargaining: Interstate Conflict Over Rebel Support in Civil Wars. Defense Economics 13(1): Singer, David J., Stuart Bremer and John Stuckey Capability Distribution, Uncertainty, and Major Power War, In Peace, War, and Numbers, ed. Bruce Russett. Beverly Hills: Sage pp Smith, Alastair Diversionary Foreign Policy in Democratic Systems. International Studies Quarterly 40(1): Stinnett, Douglas M., Jaroslav Tir, Philip Schafer, Paul F. Diehl and Charles Gochman The Correlates of War Project Direct Contiguity Data, Version 3. Conflict Management and Peace Science 19(2):

110 Thyne, Clayton L Cheap Signals with Costly Consequences: The Effect of Interstate Relations on Civil War. Journal of Conflict Resolution 50(6): Trumbore, Peter F Victims or aggressors? Ethno-political rebellion and use of force in militarized interstate disputes. International Studies Quarterly 47(3): Walt, Stephen Revolution and War. Ithica, NY: Cornell University Press. Zacher, Mark W The Territorial Integrity Norm: International Boundaries and the Use of Force. International Organization 55: Zorn, Christophe A Solution to Separation in Binary Response Models. Political Analysis 13:

111 6. Appendix Table 1. The Effects of Lagged Domestic Conflict Onset on Interstate Conflict Onset with 1-month Lag Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance -1.11*** -1.10*** -.98*** IGO count.03***.31***.04*** Alliance.35***.36***.51*** Minor Powers -1.28*** -1.55*** -2.31*** Power Ratio.29**.55*** 1.10*** Depend L ** Democ L -.08*** -.12*** -.17*** Non-Contiguity 4.55*** 4.33*** 3.51*** l.one domestic 20.99*** l.both domestic *** l.one domestic *** l.both domestic *** l.one domestic *** l.both domestic *** Constant -4.25*** -5.06*** -6.23*** N 3,989,306 4,313,227 4,427,683 # of Interstate onsets # of one domestic onsets 147,798 65,794 36,473 # of both domestic onsets 2, Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 103

112 Table 2. The Effects of Lagged Domestic Conflict Onset on Interstate Conflict Onset with 2-month Lag Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance -1.14*** -1.10*** -1.02*** IGO count.03***.03***.04*** Alliance.31***.377***.49*** Minor Powers -1.30*** -1.60*** -2.42*** Power Ratio.45***.58** 1.14*** Depend L * Democ L -.08*** -.11*** -.17*** Non-Contiguity 4.96*** 4.35*** 3.89*** l2.one domestic 20.84*** l2.both domestic *** l2.one domestic *** l2.both domestic *** l2.one domestic *** l2.both domestic ** Constant -4.22*** -4.99*** -6.10*** N 3,976,179 4,297,435 4,410,831 # of Interstate onsets # of one domestic onsets 147,377 65,471 36,126 # of both domestic onsets 2, Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 104

113 Table 3. The Effects of Changes in Intensity of 1-month Lagged Ongoing Domestics Conflicts on Interstate Conflict Onset Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance -.12*** -.19*** -.25*** IGO count.03***.03***.04*** Alliance.05*.15**.31** Minor Powers -.87*** -.78*** -.97*** Power Ratio.45**.47***.60* Depend L.46**.82**.06 Democ L -.04*** -.06*** -.07*** Non-Contiguity l.one worse 20.30*** l.both worth 20.76*** l.one worse 40.35*** l.both worse *** l.one worse 60.30* l.both worse Constant -2.57*** -3.88*** -4.79*** N 580, , ,008 # of Interstate onsets # of one worse 187,679 88,530 49,320 # of both worses 8,438 1, Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 105

114 Table 4. The Effects of Changes in Intensity of 2-month Lagged Ongoing Domestics Conflicts on Interstate Conflict Onset Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance -1.6*** -.14*** -.16 IGO count.03***.05***.07*** Alliance.11***.33***.93*** Minor Powers -1.25*** -1.45*** -1.53*** Power Ratio.38***.49*.26 Depend L.42** Democ L -.05*** -.08*** -.10*** Non-Contiguity l2.one worse 20.64*** l2.both worth 20.83*** l2.one worse 40.84*** l2.both worse 40.79*** l2.one worse 60.55*** l2.both worse *** Constant -3.19*** -5.50*** -8.62*** N 589, , ,598 # of Interstate onsets 1, # of one worse 186,350 87,821 48,965 # of both worses 8,471 1, Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 106

115 Table 5. The Effects of Changes in Intensity of 1-month Lagged Ongoing Domestics Conflicts on Whether an Ongoing Interstate Conflict Becomes more Intense Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance.10* IGO count Alliance.26***.35***.22 Minor Powers Power Ratio ** -.67 Depend L ** -.34 Democ L Non-Contiguity -.82** l.one worse l.both worth l.one worse l.both worse l.one worse l.both worse Constant -2.09*** -2.34*** -.87*** N , # of more intense interstate conflicts # of one worse 1, # of both worses Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 107

116 Table 6. The Effects of Changes in Intensity of 2-month Lagged Ongoing Domestics Conflicts on Whether an Ongoing Interstate Conflict Becomes more Intense Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance.11* IGO count Alliance.23***.45***.36 Minor Powers Power Ratio *.72 Depend L Democ L Non-Contiguity -.96** l2.one worse l2.both worth l2.one worse l2.both worse l2.one worse ** l2.both worse Constant -1.92*** -2.54*** -.91*** N 3, # of more intense interstate conflicts # of one worse 1, # of both worses Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 108

117 Table 7. The Effects of Changes in Intensity of 1-month Lagged Ongoing Domestics Conflicts on Changes in Interstate Conflict Intensity Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance IGO count.11**.34**.23 Alliance 1.55*** Minor Powers Power Ratio *** ** Depend L Democ L Non-Contiguity l.one worse l.both worth *** l.one worse l.both worse * l.one worse l.both worse Constant *** *** *** N , # of one worse 1, # of both worses Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) Table 8. The Effects of Changes in Intensity of 2-month Lagged Ongoing Domestics Conflicts on Changes in Interstate Conflict Intensity Model 1 Model 2 Model 3 Variable (interstate 20 ) (interstate 40 ) (interstate 60 ) log distance IGO count Alliance 1.19** Minor Powers Power Ratio ** ** Depend L Democ L Non-Contiguity l2.one worse ** l2.both worth *** l2.one worse *** l2.both worse l2.one worse ** l2.both worse Constant *** *** -6.30*** N 3, # of one worse 1, # of both worses Coefficients with p-values reflected by: ***(.01), **(.05), *(.10) 109

118 Table 9. How much more likely is an interstate conflict onset in month t relative to neither state experiencing an onset of domestic conflict in month t 1 (cutpoint 20 ) (cutpoint 40 ) (cutpoint 60 ) one 2.38x* 3.78x* 3.85x* both 2.81x* 4.56x* 4.46x* * indicates statistically significant at 95% confidence Table 10. How much more likely is an interstate conflict onset in month t relative to neither state experiencing an onset of domestic conflict in month t 2 (cutpoint 20 ) (cutpoint 40 ) (cutpoint 60 ) one 2.29x* 3.67x* 7.45x* both 4.41x* 8.68x* 24.76x* * indicates statistically significant at 95% confidence Table 11. How many times more likely is an interstate conflict onset in month t relative to neither state experiencing a more intense ongoing domestic conflict in month t 1 (cutpoint 20 ) (cutpoint 40 ) (cutpoint 60 ) one 1.33x* 1.39x* 1.31x both 1.92x* 1.28x*.90x * indicates statistically significant at 95% confidence Table 12. How many times more likely is an interstate conflict onset in month t relative to neither state experiencing a more intense ongoing domestic conflict in month t-2 (cutpoint 20 ) (cutpoint 40 ) (cutpoint 60 ) one 1.90x* 2.32x* 1.73x* both 2.28x* 2.20x* 4.19x* * indicates statistically significant at 95% confidence 110

119 cutpoint_20 cutpoint_40 cutpoint_60 Predict probablity of an interstate conflict onset in month t neither one both neither both one Whether 'neither', 'one', or both states experienced neither one both a domestic conflict onset in month t-1 Figure 1. The Effects of Domestic Conflict Onset in Month t-1 on the Likelihood of Interstate Conflict Onset in Month t 111

120 cutpoint_20 cutpoint_40 cutpoint_60 5e-04 Predict probablity of an interstate conflict onset in month t 4e-04 3e-04 2e-04 1e-04 0e+00 one neither both neither both one Whether 'neither', 'one', or both states experienced a neither both one domestic conflict onset in month t-2 (with ``both'' for cutpoint_60 capped) Figure 2. The Effects of Domestic Conflict Onset in Month t-2 on the Likelihood of Interstate Conflict Onset in Month t 112

121 cutpoint_20 cutpoint_40 cutpoint_60 Predict probablity of an interstate conflict onset in month t neither both one neither both one Whether 'neither', 'one', or both states experienced a more severe domestic conflict in month t-1 than month t-2 neither one both Figure 3. The Effects of an Increasingly Severe Domestic Conflict in Month t-1 on the Likelihood of Interstate Conflict Onset in Month t 113

122 cutpoint_20 cutpoint_40 cutpoint_ Predict probablity of an interstate conflict onset in month t neither both one neither both one Whether 'neither', 'one', or both states experienced a more severe domestic conflict in month t-2 than month t-3 neither one both Figure 4. The Effects of an Increasingly Severe Domestic Conflict in Month t-2 on the Likelihood of Interstate Conflict Onset in Month t 114

123 CHAPTER 3. PREDICTING FUTURE LEVELS OF VIOLENCE IN AFGHANISTAN DISTRICTS 1. Introduction For centuries, key pillars of the philosophy of science like Francis Bacon and David Hume, have stressed that scientific progress occurs through the development of consistently accurate, replicable, and falsifiable predictive models. Building on these argument, numerous scholars of political conflict, including Choucri (1974), Singer and Wallace (1979), Beck, King and Zeng (2000), Bueno de Mesquita (2002), and Ward, Greenhill and Bakke (2010), have similarly stressed the importance of predictive models for two main reasons. First, as Beck, King and Zeng (2000), Weidmann and Ward (2010), and others convincingly argue, predictions are vital for the development of theories about the causes of violence, since the most rigorous way to test whether an empirical model is actually reflecting a real-world data generating process, or simply fitting noise, is to measure its forecast accuracy. 1 Second, accurate conflict forecasts can be tremendously useful in the real world they can help peacekeepers allocate scarce resources, inform Non-governmental Organizations (NGOs) on potential hot-spots to avoid, and even provide speculative investment opportunities. Although the majority of empirical studies of conflict continue to focus on explanation primarily in the form of interpreting coefficients and standard errors established through in-sample testing a smaller though considerable number papers and projects exist with the explicit goal of building dynamic forecasts of future levels of violence. Likewise, the goal of this chapter is to build a forecasting model, though not for theory-building or hypothesis-testing, but rather to create a proof of concept tool for real-time, policy relevant decision making. Extant empirical forecasting studies focusing on domestic conflict range tremendously in terms of data, methods, and scope. The most coarse studies build forecasts at that state-year level using primarily structural variables like GDP per capita, ethnic diversity, and infant mortality (see Gurr and Harff (1996), King and Zeng (2001), Fearon and Laitin (2003), and Goldstone et al. (2010)), which are useful in some contexts but unable to build predictions beyond the state-year unit of analysis. The majority of studies attempting to build empirical forecasts of violence use 1 I use prediction and forecast interchangeably throughout this chapter. 115

124 more fine grained, event data coded at the daily and sometimes local level, as these data allow scholars to capture more dynamic patterns of violence and ultimately build more detailed forecasts than those using state-year, structural data. Historically, scholars building empirical forecasting models of violence have used either machine-coded (like KEDS, ICEWS, 10 Million International Dyadic Events Dataset) or human-coded event data datasets (like ACLED, KOSVED, SCAD, etc. ) built form open source text, with the majority of scholars utilizing the machine-coded option. Recently, however, WikiLeaks has provided an alternative data set of conflict events that previously required security clearance from the United States Government to access, but have subsequently been illegally obtained and distributed to the public. The logical question, then, is which of these sources of data is more appropriate for this study? Given the goal of this chapter, an ideal dataset would contain the following five key attributes: (1) Broad spatial coverage: Global coverage is preferable to one with country or region specific coverage as it would enable a forecasting model to be built for any global location. (2) Density: Predictive algorithms tend to perform better with more data, meaning that many fine-grained events is preferable to fewer larger scale events. (3) Geo-coding: Sub-state, geo-spatial predictions require sub-state, geo-coded events. (4) Accuracy: The data should accurately reflect the events as they occur in reality in order to build relevant predictions. (5) Future availability in real-time: If the data are not accessible in the future in real or near real-time, then it becomes highly difficult to build actionable predictions. As discussed in Chapter 1 in greater detail, the GDELT dataset provides greater spatial coverage, event density, and prospects for future availability in real-time than either the human coded datasets or the WikiLeaks datasets. Accuracy is likely greater for the WikiLeaks dataset since it is based on first hand accounts, and ongoing debate exists regarding the accuracy of human-coded and machine-coded datasets suggesting that neither may have a clear advantage (see King and Lowe (2004), O Brien (2010), O Loughlin et al. (2010), Schrodt (2012), Chojnacki et al. (2012), and Eck (2012) for discussions of the accuracy of machine-coded and human-coded event data datasets). However, because the ultimate goal is building policy relevant predictions in real (or near real) time, the fifth attribute is a necessary condition that neither the human-coded nor WikiLeaks dataset meet. Thus, GDELT is the most appropriate dataset. 116

125 This is the first study to ever use open-source, machine-coded event data to build forecasts of political violence at a sub-state level of geospatial aggregation. Since the process of aggregating conflict events into sub-state units based on latitude and longitude is currently time and computationally intensive, doing so on a global scale exceeds the scope of a dissertation chapter. Thus, I focus on forecasting conflict in sub-state geospatial units in a single country: Afghanistan. I choose Afghanistan for two reasons. First, there is dense political violence across a long time-frame ( ) with considerable variation at local levels. Second, Mangion-Zammit et al. (2012) have demonstrated the ability to build forecasts with the WikiLeaks data, meaning that to the extent it is possible at all to build temporally and geo-spatially nuanced forecasts of political violence using open source, machine-coded event data, it should be feasible in Afghanistan. Although I focus primarily on building predictions one-month in advance at the district-month unit of analysis (Afghanistan s smallest administrative unit, N=317), I also build forecasts at the province-month (N=32) and the country-month (N=1) level, which provides a rudimentary test of the effects of geo-spatial aggregation on forecast accuracy. Empirically, I use an autoregressive fractionally integrated moving average (ARFIMA) model, which builds forecasts of levels of material conflict one-month-in-advance that consistently outperforms a naive model assuming that the level of violent in a location during a month will be the same as it was in the same location in the previous montnh. The ARFIMA model performance decrease relative to the naive model at each additional level of geo-spatial aggregation, suggesting further justification for the use of fine-grained geo-spatial analyses. Additionally, I implement two logical extensions to the univariate ARFIMA model, first by building and modeling additional features, and second by incorporating exogenous drug price data to ARFIMA model, though neither enhance predictive accuracy. The remainder of this chapter provides a review of relevant literature, details my research design and ARFIMA forecasting model, discusses two logical extensions, and lastly concludes. 2. Literature Review To facilitate this review of relevant literature, I organize studies that forecast domestic political violence into the three general types of data that they use: machine-coded, human-coded, and WikiLeaks Machine-coded data. Although a large number of studies utilize machine-coded event data (see Appendix), a much smaller subset of these studies build forecasts: Schrodt and Gerner (1997) 117

126 use discriminant analysis to predict conflict phases in the Levant, Schrodt (1999) uses HMMs to forecast conflict in southern Lebanon, Pevehouse and Goldstein (1999) use time-series to predict events in the Serbia-Kosovo conflict, Schrodt and Gerner (2000) forecast unique clusters of conflict in the Levant from 1979 to 1997, Schrodt (2000) uses HMM s to forecast conflict dynamics in the Levant form 1979 to 1997, Bond et al. (2004) forecast conflict in Indonesia, Shellman (2004b) forecasts conflicts between government and dissident actors in Chile and Venezuela, Brandt and Freeman (2005) use Bayesian time-series to forecast dynamics between the United States, Israel, and Palestine, Schrodt (2006) forecasts conflict in the Balkans using HMMs, Shearer (2006) uses HMMs to forecast conflict between Israel and Palestine, Bagozzi (2011) uses zero-inflated count models and D Orazio, Yonamine and Schrodt (2011) use sequence analysis to forecast domestic conflict in 29 Asian countries, and Brandt, Freeman and Schrodt (2011) employ Markov Switching Bayesian Vector Autoregression (MS-BVAR) for forecast domestic and inter-state conflict in the Levant in Although these and other scholars demonstrate the ability to generate accurate forecasts of when and between whom conflict will occur in the future using open-source, machine-coded event data, they have been unable to predict where this conflict will occur at a sub-state level since none of the relevant machine-coded event data datasets provided geo-location information prior to GDELT Human-coded data. A number of geo-located, human-coded event data datasets exist that could allow researchers to build forecasts of violence at specific sub-state geographic units. For example, the Armed Conflict Location and Event Dataset (ACLED), which provides over 75,000 geo-coded violent events with (both atomic and composite) for approximately 60 countries, including all of Africa, and other, conflict-prone countries throughout the world (see Raleigh et al. (2010)), Daly (2012) provides a dataset with 7,729 geo-coded acts of violence in Colombia from , Schneider, Bussman and Ruhe (2012) presents the Konstanz One-Sided Violence Event Dataset (KOSVED) with 21,458 attacks against civilians in Bosnia, Urdal and Hoelscher (2012) introduces a dataset of 4,003 events occurring in 55 major cities in Asia and sub-saharan Africa from 1960 to 2008, and Salehyan et al. (2012) introduce the Social Conflict in Africa Database (SCAD), which contains 7,200 events of political unrest occurring in 47 African countries from Despite the geospatial nuance of these datasets, it is somewhat surprising that only Weidmann and Ward (2010) uses one of the aforementioned datasets (ACLED) in order to build predictions, whereas dozens of other articles dimly focus on explanation. 118 Weidmann and Ward (2010) use

127 ACLED s Bosnia dataset in order to build a model that predicts a binary measure of whether a given municipality-month in Bosnia. In total, 4,796 municipality months exists (109 municipalities form March 1992 to October 1995), of which 301 experienced an ACLED conflict event and are treated as a 1. They build a model based on exogenous variables (population, ethnic diversity, borders, and mountains) as well as various endogenous lags of the dependent variable, and utilize a Markov Chain Mote Carlo (MCMC) technique to estimate a logistic regression which is then used to calculate predictions in a rigorous out-of-sample framework, which I discuss in greater detail in Section 4.2. Despite making major theoretical and empirical contributions to the study of political violence, the fact that the only study to build out-of-sample forecasts using human-coded event data (e.g. Weidmann and Ward (2010)) did so for a conflict that ended five years prior to the release of the study underscores the slow, tedious nature of building human-coded datasets that makes them extremely difficult to update sufficiently close to real time as to build policy-relevant forecast actually for the future WikiLeaks data. On July 25, 2010, WikiLeaks publicly released the majority of classified documents comprising both the Afghan War Diary (containing 91,731 documents) and the Iraq War Log (containing 391,832), which contain classified documents that provide a highly detailed account of events occurring in Afghanistan and Iraq from January 2004 through December Additionally, in 2010, the United States government declassified subsections of the Afghan War Diary and the Iraq war log, called Significant Acts (SIGACT). Although both the WikiLeaks and SIGACT datasets have become difficult to obtain, a number of academic studies have been published that empirically model these data for both Iraq and Afghanistan. Like studies discussed in Section 2.2, the majority of studies using the WikiLeaks and SIGACT data focus on explanation, rather than prediction. For example, Berman et al. (2011) analyze the effects of sub-state level unemployment data for 297 district-quarters (3 quarters for 99 districts) for Iraq and 2,160 district-months (6 months for between 363 and 365 districts) for Afghanistan on levels of violence using the SIGACT data; Weidmann and Salehyan (2011) use the SIGACT data to analyze the effects of the U.S. surge in Iraq on levels of violence in 85 neighborhoods in Baghdad; O Loughlin et al. (2010) use hotspot and cluster analysis to compare the Afghan War Diaries data to ACLED s Afghanistan data; Linke, Witmer and O Loughlin (2012) model violence dynamics between the U.S-led coalition forces and 119

128 insurgent by analyzing 301,374 violent events aggregated at the three-day, 30-by-30 second gridcell level, and although the authors do assess their model s predictive accuracy, this is done only using in-sample findings as opposed to a proper in-sample/out-of-sample break, meaning that the model is not actually building predictions. Among studies drawing on the WikiLeaks or SIGACT datasets, Mangion-Zammit et al. (2012) is the only to actually build out-of-sample forecasts. To do so, Mangion-Zammit et al. (2012) first use the WikiLeaks data to calculate the number of violent events at the province-month level in Afghanistan from 2004 to 2009, which serves as the in-sample training set. Second, they construct and train a point-process model on the training data. Third, they build future predictions at the province-year level for 2010, based purely on information from Since WikiLeaks only provides data through 2009, Mangion-Zammit et al. (2012) evaluate their model s predictive accuracy based on data provided by the Afghan NGO Safety Office (ANSO), and find that 62.5% of actual levels of violence fall within 95% confidence intervals of predicted levels. Although these studies apply innovative methods to address interesting questions, they highlight two major shortcomings to working with WikiLeaks-style of data. First, even when it can be acquired, it does not provide real or near-real time updates. As a result, Mangion-Zammit et al. (2012) needed to use a different data source to obtain data from 2010 since WikiLeaks only covered Second, all of the studies discussed in Section 2.3 focus on either Iraq or Afghanistan since WikiLeaks only provided dense data for those countries, which clearly means that WikiLeaks data is unsuitable to build predictions for any other states in the world. The research design I outline in the following sections using the GDELT dataset not only overcome the shortcomings WikiLeaks-style data, but also those of the extant literature relying on human-coded and pre-gdelt machine-coded datasets. In the following section, I outline how I use GDELT to build state- and sub-state levels of political conflict in Afghanistan and discuss my forecasting approach. 3. Research Design 3.1. Constructing material conflict counts. As previously mentioned, Afghanistan is spatially divided into 32 provinces and 317 sub-provincial-level districts. Using the GDELT data in conjunction with GIS software, I calculate the number of material conflict events that occur from February 1, 2001 through April 30, 2012 between all actors in each month at three (country, province, and 120

129 district) geo-spatial levels of analysis. To accomplish this, I first select all material conflict events for which either the source or target actor s primary affiliation (i.e. the first three characters of their actor identification) was with Afghanistan. I use a version of the GDELT data that has duplicate entries eliminated, as my goal in this chapter is to forecast actual the occurrence of events, rather than the perception or intensity of events. This step generates 139,915 material conflict events, each of which contains a specific latitude and longitude coordinate reflecting where the event occured. Next, using shape files and GIS software, I calculate the the number of events that occur within each district and province in each month. I choose to use the month as my level of temporal aggregation because this provides sufficient variation throughout the time-series while reducing the level of noise that is present at daily or weekly levels. Largely for those reasons, the monthly level aggregation is the most commonly used in the relevant literature, employed by Goldstein (1991), Schrodt (1997), Schrodt and Gerner (1997), Schrodt and Gerner (2000), Schrodt and Gerner (2001), Shellman (2004a), Shellman (2004b), Gleditsch and Beardsley (2004), Schrodt (2007), Brandt, Colaresi and Freeman (2008), Weidmann and Ward (2010), Ward, Greenhill and Bakke (2010), Shellman, Hatfield and Mills (2010), Brandt, Freeman and Schrodt (2011), D Orazio, Yonamine and Schrodt (2011),and Mangion-Zammit et al. (2012). District- and province-months with no material conflict events are assigned a 0. This results in 43,746 district months, 4,352 province months, and 136 country months. 2 [INSERT FIGURE 1 HERE] Figure 1 provides a visual overview of the data, illustrating changes in the number of material conflict events from 2001 to 2012 that occur in each district-year. 4. Forecasting Approach In this section, I outline my forecasting approaches using the univariate data comprised solely of the counts of material conflict events. To facilitate discussion, I detail my forecasting approach as applied to the district-month level-of-analysis, though the approach is identical at the provincemonth and country-month levels-of-analysis as well. Since the structure of the data is time-series cross sectional at highly nuanced unit of analysis i.e Afghani districts I am unable to find 2 This was done with substantial assistant form John Bieler as well as Josh Steven, who completed all geo-spatial aggregation using GIS. 121

130 appropriate exogenous variables to help predict future levels of material conflict. 3 As such, the district-month dataset contains 317 univariate time-series of the count of material conflict events at the district-month level, and I reflect the number of material conflict events occurring in a dingle district month with the notation District it. Since accurate forecasts are so useful across academia, government, and private sectors, there are many different empirical approaches to building forecasts. No one-size-fits all model exists, and it is impossible to know ahead of time which algorithm will generate the greatest degree of predictive accuracy. Due primarily to the large number of observations and amount of information (i.e. location, actors, date, etc.) contained in most event data datasets, including machine-coded, human-coded, and WikiLeaks data, researchers have applied a large number of different forecasting models. D Orazio, Yonamine and Schrodt (2011) report that models forecasting domestic conflict largely fall into three general categories: time series (Shellman (2004a), Shellman (2007), Harff and Gurr (2001)), vector auto regression (VAR) (Pevehouse and Goldstein (1999), Goldstein (1992), Freeman (1989), Brandt, Freeman and Schrodt (2011)), and HMMs (Schrodt (1999), Bond et al. (2004), Shearer (2006), Schrodt (2000), and Schrodt (2006), Petroff, Bond and Bond (2012)). Additionally, other studies using event data have employed additional methods, such as linear models (Weidmann and Ward (2010), Fearon and Laitin (2003), Gurr and Harff (1996)), clustering algorithms (Schrodt and Gerner (2000), and point-process modeling (Mangion-Zammit et al. (2012)). Even after choosing a base algorithm, a number of choices must still be made regarding tuning parameters. For example. In addition, a number of techniques, like bagging and boosting can be applied to most of these algorithms (see Schrodt, Yonamine and Bagozzi (2012) for a discussion of these techniques in the context of political violence forecasting). As if that did not provide enough choices, a number of approaches combine multiple algorithms into model averaging methods, such as bayesian model averaging (BMA) (Montgomery, Hollenbach and Ward (2012)). Despite the nearly infinite number of plausible forecasting approaches, the structure of my data is highly constraining for two main reasons. First, it is a univariate time series, meaning that it does not contain exogenous covariates. Most of the methods above specifically designed for datasets with many covariates and are less relevant for my data. Second, my data is temporal. This 3 Exogenous variables on employment and drug prices exist for select districts for select months, but neither variable is available with sufficient coverage to include in an empirical forecasting model at the district-month level. I discuss this further in Section

131 restricts how I am able to divide my training and test set, since that training set must exclusively contain observations that preceded the test set. This greatly inhibits re-sampling techniques like boosting as a way of enhancing predictive accuracy. In the following section, I outline a forecasting model that achieves highly accurate predictions using a univariate time-series, discuss my out-ofsample forecasting framework, and detail how I build a benchmark to assist with evaluating forecast accuracy The ARFIMA model. To build forecasts with the univariate time-series, I implement an Autoregressive Fractionally Integrated Moving Average (ARFIMA) model, which models all univariate time-series (317 at the district-level, 32 at the province level, and 1 at the country-level) independently of each other. Though this is the first time an ARFIMA model has been used to forecast political conflict, a number of studies have demonstrated its ability to generate more accurate and consistent forecasts than other time-series models across various substantive fields. For example, Siew, Chin and Wee (2008) demonstrate that an ARFIMA model consistently outperforms a traditional ARIMA model in forecasting air pollution rates, Chu (2009) generates more accurate forecasts of tourism levels in Asia with an ARFIMA model than with seasonal ARIMA (SARIMA) models, Barkoulas and Baum (2006) illustrates how ARFIMA models outperform other autoregressive models in forecasting U.S. monetary indices, and Bhardwaj and Swanson (2006) show that the ARFIMA model outperforms both ARIMA models and GARCH models in forecasting returns in the S&P500. To introduce the ARFIMA model, first consider an ARIMA (p,d,q) model for a univariate time series X(x t,x t 2,x t 3,...,x t n )withd=0, which we can write as: p q (1) x t = ω + + β i x t i + α i t 1 i=1 i=1 where ω is a constant, x t i is the lagged dependent variable, t i is the lagged error, t is the current error, and β i and α i are estimated parameters. When a time-series is non-stationary, firstdifferencing or integrating the series can help achieve stationarity. This generates a new time series, x t, calculated by the following formula: (2) x t = x t x t 1 123

132 Thus, we can convert the ARIMA(p,d,q) model with d=0 to an ARIMA(p,d,q) model with d=1 by replacing the x characters with x, as done in the following formula: p q (3) x t = ω + + β i x t i + α i t + t i i=1 i=1 Although the ARIMA(p,d,q) model is among the most commonly used time-series models and has been used successfully to forecast with event data (see Shellman (2007)), it is rigid in that d must be an integer. The key innovation of the ARFIMA model is that it allows for d to take on any real number, which need not be an integer (hence the name fractionally integrated ). Thus, when d = 0, the ARFIMA model becomes an ARMA model, and when d=any positive integer, the ARFIMA is a simply an ARIMA model. Mathematically, Granger and Joyeux (1980) demonstrates that by allowing d<1, the ARFIMA model is able to efficiently account for a long memory process, which occurs when the time-series tends to revert to a historical mean. Parke (1999) provides a thorough explanation of the fractional integration process, and demonstrates how a key innovation of the ARFIMA model is that it allows the effects of past errors on current observations to vary, whereas AR, MA, and ARMA models force this past errors to have uniform effects across the duration of the time-series. Importantly, the ARFIMA model is capable of accounting for the long memory process even without increasing the number of p and q lags. To implement a flexible ARFIMA(p,d,q) model, I utilize the arfima package in r, which automatically establishes values for the p, d, and q parameters of a univariate time series by determining the estimates for these parameters that maximize the likelihood function. This means that the researcher does not need to pre-specify the number of autoregressive components, moving average components, or degree of fractional integration. I treat each cross-section as a unique time-series, meaning that I train and build forecasts with the ARFIMA model one district and one province at a time through a looping function. 4 The forecast function in the arfima package allows the user to build a prediction N units into the future and provides a mean prediction along with 95% confidence intervals. To establish predictions, I use the mean of the one-month-ahead prediction rounded to the nearest integer. Figure 2 demonstrates the use of the arfima package to build a prediction of the number of material conflict events in Bughran province in April, 2009 using data 4 Many districts have long periods of consecutive months with 0 material conflict events, which causes the arfima package to crash. To allow the arfima package to properly converge, I generate a random number from a uniform distribution from 0 to.1 for each district-month, and add that value to the count of material conflict events. 124

133 from February 2001 through March The prediction in Figure 2 provide the mean (the circle) as well as 90 and 95% confidence intervals, indicated by the light and darker vertical shading. [INSERT FIGURE 2 HERE] 4.2. Out-of-sample framework. In order to calculate out-of-sample performance accuracy of the ARFIMA model, I utilize the same approach implemented by Weidmann and Ward (2010), which I implement on my data according to the steps outlined below, using the district-level model as an example: Train the model on an initial in-sample set containing all data from February 2001 until April Predict (and store) the number of material conflict events for May 2008 (i.e. a one-monthahead out-of-sample forecast. Incorporate May 2008 into the in-sample set. Retrain the model on this new in-sample set, which now includes all data from February 2001 to May Predict (and store) the number of material conflict events for June Repeat until a final prediction is made for April 2012 (i.e. the last month in the data set), using a model trained on February 2001 through March This results in 48 out-of-sample, one-month-ahead forecasts for each of the 317 municipalities. At the province-month level, this approach yields 48 out-of-sample, one-month-in-advance forecasts for each of the 32 provinces, and at the country-month level, this results in 48 one-month-in-advance forecasts for Afghanistan as a whole Establishing a benchmark. Since this is the first paper to build nuanced predictions of political conflict in Afghanistan at the monthly level, no existing appropriate benchmark of predictive accuracy exists. Without an appropriate benchmark, it is difficult to assert whether an alternative predictive model is performing well. The literature provides two plausible approaches to assessing how well a predictive model is performing in the absence of other models attempting to predict the same outcome. First, Gurr and Lichbach (1986) provides a strong theoretical argument called the conflict persistence model, which suggests that in the absence of an existing benchmark, it is logical to build a naive model that assumes conflict in the future will be the same in a given location as it is today. Second, Mangion-Zammit et al. (2012) reports the percentage of times that 125

134 the true number of violent events fall within the 95% and 99% confidence intervals of predicted levels of violence. I choose to follow Gurr and Lichbach (1986) s approach, and construct a naive model that predicts the number of material conflict events in District it = District it 1, for three reasons. First, Mangion-Zammit et al. (2012) s approach tells actually tells us little about a model s predictive accuracy because it does not penalize for large confidence intervals. Imagine that the true number of violence events occurring in District it is 75. Now, consider two models. Model 1 generates a prediction for the number of violent events in District it with 95% confidence intervals at 12 and 162, while Model 2 s prediction for District it has 95% confidence intervals at 68 and 74. Mangion-Zammit et al. (2012) s approach would report that Model 1 is accurate and Model 2 is inaccurate, when in reality, it is difficult to imagine a scenario in which we would prefer Model 1 s prediction to that of Model 2. Second, and directly related to the first point, is that Gurr and Lichbach (1986) approach generates a specific point prediction as a benchmark, which creates greater flexibility in assessing model performance. For example, Gurr and Lichbach (1986) s approach allows me to calculate Mean Absolute Error (as detailed below), which is impossible using Mangion-Zammit et al. (2012) s approach. Lastly, in many forecasting contexts (especially predicting civil conflict at the state-year level), the Gurr and Lichbach (1986) approach achieves almost perfect accuracy countries at peace tend to stay at peace and countries at conflict tend to stay at conflict. This naive approach often works so well that it occasionally outperforms far more sophisticated forecasting models. For example, Montgomery, Hollenbach and Ward (2012) introduce Bayesian Model Averaging (BMA) approach, and demonstrate how they are able to leverage the predictions of three separate models in order to build accurate forecasts that outperform all of the three component models. Montgomery, Hollenbach and Ward (2012) report that their BMA technique outperforms all of the three component models, accurately predicting 13 of 35 conflict onsets ( 1 s ) and all 313 of the 313 non-onsets ( 0 s ) in their dataset. While these may appear strong at first, Gurr and Lichbach (1986) s naive benchmark approach accurately predicts 33 of the 35 conflict onsets and 310 of the 313 non-onsets, which is a dramatic improvement over the not only the BMA, but also the three component predictive models. Based on this, I assume that any model that consistently outperforms the naive t=t-1 assumption to be accurate. 126

135 4.4. Calculating accuracy. For each of the 48 months that iteratively serve as the out-of-sample test, I calculate the error rates for the naive model (naive error) and the ARFIMA model (arfima error rate), which reflect the MAE across the N cross-sections (N=317 for the district-month model, N=32 for the province-month model, and N=1 for the country-month model) according to the Formula (4) and Formula (5). (4) naive error m = N naive prediction i,m true count i,m i=1 N (5) arfima error m = N naive prediction i,m true count i,m i=1 N These formulas result in a naive error and arfima error rate for the district-level, province-level, and country-level models for each of the 48 months that serve as the test-month allowing me to determine the extent to which the ARFIMA model outperforms the naive model across the three levels of geo-spatial aggregation (district, province, and country) in the following section. 5. Results Table 1 provides the arfima error rate, naive error rate, and a TRUE/FALSE label indicating whether the ARFIMA forecasts are more accurate on average across all 317 districts for the given month. [INSERT TABLE 1 HERE] As Table 1 indicates, the ARFIMA model outperforms the naive model in 47 out of 48 of the out-of-sample months. Additionally, the ARFIMA model reduces the sum of the 48 monthly MAE s by over 16%. Taken together, these are highly impressive finding, especially when considering that naive models (that assume t=t-1) of conflict tend to perform well in forecasting. 5 5 A potential critique of these results is that I do not perform any rigorous external validity check, meaning that I may simply be predicting the event-data generating process, rather than actual levels of violence. I believe that this is not overly problematic for two main reasons. First, many other forecasting studies likewise rely exclusively on event data and do not perform rigorous external validity checks, which has set a precedent that this is generally accepted practice. Second, the anecdotal story discussed in Chapter 1 serves as an informal external validity check that suggests the GDELT data is accurate. 127

136 [INSERT TABLE 2 HERE] Table 2 provides the arfima error rate, naive error rate, and a TRUE/FALSE label calculated from province-level geo-spatial aggregations, meaning that each of the 48 arfima error and naive error rates reflect their respective means across the 32 provinces. At the province-month level, the ARFIMA does not perform as well as at the district-month level, but it still outperforms the naive model in 40 of the 48, or approximately 83% months that serve as the test month. Furthermore, the ARFIMA model reduces the sum of the 48 month MAE by approximately 13%. Even though the ARFIMA performs slightly worse at the province-level than the district-level, it still achieves a respectable level of enhanced accuracy relative the the naive benchmark. [INSERT TABLE 3 HERE] Table 3 replicates Table 1 and Table 2, except it reflects the arfima error rate, naive error rate, and the TRUE/FALSE label based on a single country-level forecast per month. Table 3 illustrates that at the country-month level, the ARFIMA still outperforms the naive model, but does so at a lower margin than at the district-month or province-month level. Of the 48 months that test sample, the ARFIMA model outperforms the naive model 30 times, or 62.5%. Additionally, the ARFIMA model generates a lower sum of MAE s, but only by approximately 1%, which suggests that the increase in predictive accuracy of the ARFIMA model at the country-month level may be largely meaningless. Across the district-, province-, and country-month forecasts, the key aspect of the ARFIMA model is that it tends to build forecasts that are between the naive model forecast and a longer term moving average. Exactly how much the ARFIMA model shifts forecasts away from the naive forecasts and towards the longer term moving average varies based from by month and by crosssection, but in effect, the ARFIMA acts like a smoothing function. Figure 2 visually demonstrates this. The last observed number of material conflict events is approximately 280 in month 99, meaning that the naive model would predict 280 events for the month 100. However, we can see that the average number of material conflict events in the previous months is less than 280, so the mean ARFIMA forecast (represented by the black dot) is less than 280. To the extent that the ARFIMA model outperforms the naive model, it suggests that levels of future violence tend to exhibit mean reverting characteristics. 128

137 6. Future directions Although the ARFIMA model outlined above largely accomplishes the goal of this paper, in this section I provide preliminary analysis of two logical extensions for the finding in the previous section: first, building features from the univariate time-series to allow for other types of predictive algorithms; second, incorporating exogenous information, such as drug prices Building features and implementing a stacking method. A common approach when building forecasting models is to manipulate existing data in order to build additional features, or covariates, which may uncover meaningful patterns in the data that are hidden in other variables. In many contexts across disciplines, building additional features leads to enhanced predictive accuracy. Note that building features can also decrease predictive accuracy because the additional dimensionality increases the likelihood of over fitting a model. To overcome this, I employ the same out-of-sample predictive framework as previously outline in Section 4.2. Just like there there is no definitive way to pick the best forecasting algorithm, there are no rules for constructing features. As such, I build 11 new features below, all from the univariate time series, in an attempt to enhance predictive accurate beyond the univariate ARFIMA model outlined in the previous section. 2 month MA = (count t + count t 1 )/2 3 month MA = (count t + count t 1 + count t 2 )/3 4 month MA = (count t + count t 1 + count t 2 + count t 3 )/4 5 month MA = (count t + count t 1 + count t 2 + count t 3 + count t 4 )/5 6 month MA = (count t + count t 1 + count t 2 + count t 3 + count t 4 + count t 5 )/6 2 month MA = count t 2 month MA 3 month MA = count t 3 month MA 4 month MA = count t 4 month MA 5 month MA = count t 5 month MA 6 month MA = count t 6 month MA monthly sum = the sum of all material conflict events occurring across all spatial units each month With these additional covariates, I build a number of additional predictive models following the general approach in Section 4.2. Using the glm package in r, I build predictions using linear models 129

138 comprised of various combinations of the 11 additional covariates above (all lagged one-unit) as well as a one-unit lag of the dependent variable, trying both gaussian and poisson distributions. I am unable to find a linear combinations of the covariates above (including the lagged dependent variable) capable of outperforming the naive benchmark at the district-month level in more than 35 out of the 48 district-months that serve as the out-of-sample set. Motivated by the enhanced predictive accuracy of the approach in Montgomery, Hollenbach and Ward (2012), I also implement a stacking approach. 6 To build a stacking prediction, I build use two component models, Model 1 and Model 2, which are specified below and estimated using the glm package in r with a gaussian distribution. 7 (6) Model 1 ˆ District it = β 0 + β 1 2 month MA i(t 1) + β 2 3 month MA i(t 1) + β 3 4 month MA i(t 1) β 4 5 month MA2 i(t 1) + β 5 6 month MA i(t 1) + β 7 monthly sum i(t 1) + β 8 District i(t 1) (7) Model 2 ˆ District it = β 0 + β 1 2 month MA i(t 1) + β 2 3 month MA i(t 1) + β 3 4 month MA i(t 1) + β 4 5 month MA i(t 1) + β 5 6 month MA i(t 1) + β 7 District i(t 1) Using these two models, I build an ensemble forecasting model according to the six steps below: 8 (1) Estimate two models on the same in-sample set as in Section 4.2, which contains all data from February 2001 until April 2008, and generate predictions for these in-sample months and store coefficient estimates (2) Train the Ensemble model using the glm function in R on the in-sample predictions from Model 1 and Model 2 according to the formula below, and store coefficient estimates: 6 I follow the stacking approach suggested by Hastie, Tibshirani and Friedman (2009) on pages Although the dependent variable is a count, predictions made with the glm package using the gaussian distribution consistently outperforms those build with the poisson distribution. 8 This is conceptually similar to Montgomery2012, but install of updated posteriors, I simply weight each component model based on OLS. 130

139 (8) Ensemble = ˆ District it = β 0 + β 1 Model 1 it + β 2 Model 2 it (3) Build predictions for May 2008 (i.e. one-month ahead out-of-sample forecast) for Model 1 and Model 2 by matrix multiplying the coefficient estimates from Step 1 and the covariates for May 2008, which have been lagged one-month to simulate an actual prediction. (4) Calculate and store an Ensemble prediction by matrix multiplying the predicted values for Model 1 and Model 2 by their coefficient estimates from the Ensemble model trained on the in-sample set in Step 3. (5) Incorporate May 2008 into the in-sample set. (6) Repeat Step 1 through Step 4. (7) Repeat Step 1 through Step 6 until a final prediction is made for April 2012 (i.e. the last month in the data set), using a model trained on February 2001 through March This Ensemble model outperforms the naive benchmark in 33 out of 48 months. Although this is not a terrible result, it does not approach the accuracy of the more straightforward, univariate ARFIMA model discussed in the previous section. However, given the large number of predictive algorithms and the infinite number of features that can be built from a univariate time-series, scholars in the future may be able to build on my ensemble approach and build a model that eventually outperforms the predictive accuracy of my straightforward ARFIMA model Incorporating drug prices. In addition to building features from the univariate time-series as performed in the previous section, another way of potentially improving forecast accuracy is to incorporate exogenous variables. Although a large number of studies have found empirical relationships between many exogenous variables and political conflict, most operate at a state-year level of analysis. Finding relevant exogenous variables at sub-annual and sub-state levels is far more difficult. Even studies that do utilize fine-grained exogenous variables, like Weidmann and Ward (2010) and Berman et al. (2011) face considerable limitations. For example, Weidmann and Ward (2010) analyze future violence at the municipality-month unit of analysis as a function of past violence as well as a set of exogenous variables comprised of population, ethnic diversity, terrain, and whether the municipality is on an international border. 9 I tried additional algorithms, including a number of random forest variations as well as additional combinations of component models within various ensembles, and none enhanced predictive accuracy beyond the ARFIMA model. 131

140 However, these exogenous variables vary cross-sectional (i.e. between municipalities) but not temporally (i.e. from month-to-month for the same municipality), which reduces the extent to which they can improve predictive accuracy. Additionally, Berman et al. (2011) collect unemployment statistics at the province-month level for Afghanistan, Iraq, and the Philippines that do vary at a province-month unit of analysis, but the difficulty in collecting such data limit their temporal domain to just six months in the case of Afghanistan, which also inhibits their effectiveness at enhancing predictive models. Therefore, an ideal set of exogenous variables would vary at a fine grained unit of analysis and span a long temporal range, but these are difficult to collect, especially for conflict-prone countries like Afghanistan. For Afghanistan, one potential source of an exogenous variables come from the Afghanistan Opium Survey 2012, which is published by the United Nations Office on Drugs and Crime (UN- ODC). 10 This document provides considerable information at the district-level regarding opium and cannabis prices as well a dataset containing average opium prices at the country-month unit of analysis from September 2004 through March 2012, as illustrated below in Figure 3. Unfortunately, similarly complete time-series data are not publicly provided at the province- or district-month level. [INSERT FIGURE 3 HERE] Given the number of empirical studies that either theoretically suggest or empirically demonstrate relationships between drug prices and conflict (see Palmer (1994), Buhaug and Gates (2002), Ross (2003), Ross (2004), and Collier, Hoeffler and Soderbom (2004)) it seems reasonable that the addition of opium prices as an exogenous variable may enhance predictive accuracy at the countrymonth unit of analysis. To test this, I repeat the six steps outlined in Section 4.2 in order to compare the predictive accuracy of the naive model with the original univariate ARFIMA model outlined in Section 4 and Section 6.2 as well as the ARFIMA model that includes the exogenous opium data, which I call the ARFIMA opium model. Since the opium price data spans a smaller temporal range than my GDELT-derived data on political violence, I set September 2004 through March 2010 as the initial in-sample training set, and use April 2010 through March 2012 as the outof-sample test months. As Table 3 indicates, the ARFIMA model outperforms the Naive model in 18 of the 24 months that serve as the out-of-sample test months. Interestingly, the ARFIMA opium 10 This document is available at: report 2012.pdf 132

141 model only outperforms the Naive model in 17 out of 24 months. Although this suggests that the inclusion of the drug price data may not actually enhance predictive accuracy, it does not rule out the possibility that more nuanced data on drug prices at the province- or district-level of analysis could lead to more accurate predictions. 7. Conclusion This chapter is the first to build temporally and geo-spatially nuanced forecasts of future levels of violence relying exclusively on open-source, machine coded event data. The release of the GDELT dataset made this chapter possible. Before GDELT, the leading open-source, machinecoded datasets did not provide location information, and the hand-coded datasets that did provide location information were too sparse for rigorous empirical forecasting. The Afghan War Diary that was released as past of WikiLeaks provided a notable exception, but this data is not only of questionable legality but also unlikely to be replicable for future conflicts, meaning that forecasting models built from WikiLeaks data may lack real-world applicability moving forward. 11 Using nothing but GDELT data, I build an ARFIMA model capable of providing forecasts at the district month level that nearly always outperform a naive model that simply assumes that the level of conflict tomorrow will be the same as it is today. My empirical findings suggests three major takeaways: First, it appears that it is feasible to build accurate and nuanced predictions at a sub-state level using only open source, machine-coded event data. Second, the level of forecast accuracy decreased as the degree of geo-spatial aggregation increases: forecasts at the districtmonth (N=317), province-month (N=32) and country-month (N=1) level outperform their naive benchmarks in 47 out of 48, 40 out of 48, and 30 out of 48 month, respectively. It appears that patterns in violence that are discernible at fine-grained levels of geo-spatial aggregation (i.e. the district-level in Afghanistan) become increasing noisy a higher levels of geo-spatial aggregation. This strong suggests that researcher attempting to build empirical forecasts of violence should use as finely grained geo-spatial aggregations as possible. Third, the fact that the ARFIMA model tends to outperform the naive model suggests that patterns of violence tend to be mean reverting. This means that when we see a major spike in violence during a specific period of time in a specific 11 Standard questioning when applying to positions require top-secret clearance is whether you have accessed and used Wikileaks data. 133

142 sub-state location, we should expect violence in the following time period to be more subdued. Conversely, when we see a sudden drop in the level of violence, we should expect a rebound-effect. Moving forward, a number of logical extensions to this chapter exist. First, researchers could use the GDELT data to further explore whether the mean-reversion properties present in the levels of violence in Afghanistan hold across other countries. Mean-reversion properties, as first identified by Galton (1886) in his seminal analysis of human heights, are common and influential across other substantive fields like biology and economics. Determining whether local levels of violence in other states also tend to be mean-reverting could be a major theoretical advancement to the study of conflict dynamics. Second, Section 6.1 provides a basic framework for building additional features from the univariate time series and using these features to construct alternative forecasting algorithms to the ARFIMA model. Although my attempts at enhancing predictive accuracy through this approach were unsuccessful, other scholars find greater success by building additional features and experimenting with other predictive algorithms. Similarly, the inclusion of additional exogenous variables, such as drug prices at finer grained spatial coverage than the country-level data modeling in Section 6.2, terrain, or measures of reflecting potential geo-spatial correlation (i.e. a count of the number of conflictual events occurring in neighboring districts or provinces) may also be helpful. Third, since GDELT provides event data for all countries in the world (as opposed to WikiLeaks, which only provides detailed data for Afghanistan) researcher could apply a similar forecasting model to that outlined in this chapter to build geo-spatially and temporally nuanced forecasts of future levels of violence any number of countries with ongoing domestic conflicts, like India or the Democratic Republic of the Congo. Lastly, since the GDELT data is updated daily, the forecasting approach outlined in this chapter could be implemented in near real-time. This could provide real-world guidance to a host of potential benefactors, ranging from military leaders hoping to more efficiently allocate resources, to Afghani businessmen trying to identify the safest routes to transport goods. Overall, I hope that this chapters seres as a foundation for further forecasting efforts at fine-grained temporal and geo-spatial scales. 134

143 References Bagozzi, Benjamin E Forecasting Civil Conflict with Zero-Inflated Count Models. Available at: Barkoulas, John and Christopher F. Baum Long-memory forecasting of US Monetary Indices. Journal of Forecasting 25: Beck, Nathaniel, Gary King and Langche Zeng Improving Quantitative Studies of international Conflict: A Conjecture. American Political Science Review 94(1): Berman, Eli, Michael Callen, Joseph H. Felter and Jacob N. Shapiro Do Working Men Rebel? Insurgency and Unemployment in Afghanistan, Iraq, and the Philippines. Journal of Conflict Resolution 55(4): Bhardwaj, Geetesh and Norman R. Swanson An Empirical Investigation of the Usefulness of ARFIMA models for predicting macroeconomic and financial time series. Journal of Econometrics 131: Bond, Joe, Vladimir Petroff, Sean O Brien and Doug Bond Forecasting Turmoil in Indonesia: An Application of Hidden Markov Models. Presented at the International Studies Association Meetings, Montreal. Brandt, Patrick T. and John R. Freeman Advances in Baysian time Series Modeling and the Study of Politics: Theory testing, Forecasting, and Policy Analysis. Political Analysis 14:1 36. Brandt, Patrick T., John R. Freeman and Philip Schrodt Real Time, Time Series Forecasting of Inter- and intra-state Political Conflict. Conflict Management and Peace Science 28(1): Brandt, Patrick T., Michael P. Colaresi and John R. Freeman The Dynamics of Reciprocity, Accountability and Credibility. Journal of Conflict Resolution 52(3): Bueno de Mesquita, Bruce Predicting Politics. Columbus, Ohio: Ohio State University Press. Buhaug, Halvard and Scott Gates The Geography of Civil War. Journal of Peace Research 39(4): Chojnacki, Sven, Christian Ickler, Michael Spies and John Wiesel Event Data on Armed Conflict and Security: New Perspectives, Old Challenges, and Some Solutions. International Interactions 38(4): Choucri, Nazli Forecasting in International Relations: Problems and Prospects. International Interactions 1:

144 Chu, Fong-Lin Forecasting Tourism Demand with ARMA-based Methods. Tourism Management 30: Collier, Paul, AAnke Hoeffler and Mans Soderbom On the Duration of Civil War. Journal of Peace Research 41(3): Daly, Sarah Zukerman Organizational Legacies of Violence: Conditions favoring insurgency onset in Colombia, Journal of Peace Research 49(3): D Orazio, Vito, James E. Yonamine and Philip A. Schrodt Predicting Intra-state Conflict Onset: An Event Data Approach Using Euclidean and Levenshtein Distance Measures. Presented at the annual Midwest Political Science Association meeting, Chicago. Eck, Kristine In Data we Trust? A comparison of UCDP GED and ACLED conflict events datasets. Conflict and Cooperation 47(1): Fearon, James D. and David D. Laitin Ethnicity, Insurgency, and Civil War. American Political Science Review 97(1): Freeman, John R Systematic Sampling, Temporal Aggregation, and the Study of Political Relationships. Political Analysis 1: Galton, Francis Regression Towards Mediocrity in Hereditary Stature. Journal of the Anthropological Institute of Great Britain and Ireland 15: Gleditsch, Kristian Skrede and Kyle Beardsley Noisy Neighbors: Third-Party Actors in Central American Conflicts. Journal of Conflict Resolution 48(3): Goldstein, Joshua S Reciprocity in Superpower Relations: An Empirical Analysis. Journal of Conflict Resolution 36: Goldstein, Joshua S A Conflict-Cooperation Scale for WEIS Events Data. Journal of Conflict Resolution 36: Goldstone, Jack A., Robert H. Bates, David L. Epstein, Ted Robert Gurr, Michael B. Lustik, Monty G. Marshall, Jay Ulfelder and Mark Woodward A Global Model for Forecasting Political Instability. American Journal of Political Science 54(1): Granger, Clive William and Roselyne Joyeux An Introduction to Long-Memory Time Series Models and Fractional Differencing. Journal of Time Series Analysis 1(1): Gurr, Ted Robert and Barbara Harff Early Warning of Communal Conflict and Humanitarian Crisis. In Monograph Series on Governance and Conflict Resolution. United Nations Press. 136

145 Gurr, Ted Robert and Mark Irving Lichbach Forecasting Internal Conflict: A Competitive Evaluation of Empirical Theories. Comparative Political Studies 19(3):1 37. Harff, Barbara and Ted Robert Gurr Systematic Early Warning of Humanitarian Emergencies. Journal of Peace Research 35(5): Hastie, Trevor, Robert Tibshirani and Jerome Friedman The Elements of Statistical Learning, Second Edition. New York: Springer. King, Gary and Langche Zeng Improving Forecasts of State Failure. World Politics 53: King, Gary and Will Lowe An Automated Information Extraction Tool for International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design. International Organization 57(3): Linke, Andrew M., Frank D Witmer and John O Loughlin Space-Time Granger Analysis of the War in Iraq: A Study of Coalition and Insurgent Action-Reaction. International Interactions 38: Mangion-Zammit, Andrew, Michael Dewar, Visakan Kadirkamanathan and Guido Sanguinetti Point process modeling of the Afghan War Diary. Proceedings of the National Academy of Science 109(31): Montgomery, Jacob, Florian Hollenbach and Michael D. Ward Improving Predictions Using Bayesian Model Averaging. Political Analysis 20(3): O Brien, Sean Crisis Early Warning and Decision Support: Contemporary Approaches and Thoughts on Future Research. International Studies Review 12(1): O Loughlin, John, Frank D.W. Witmer, Andrew M. Linke and Nancy Thorwardson Peering into the Fog of War: The Geography of WikiLeaks Afghanistan War Logs, Eurasian Geography and Economics 51(4): Palmer, David Scott Peru, Drugs, and the Shining Path. In Drug Trafficking in the Americas, ed. Bruce M. Bagley and Wiliam O. Walker III. North-South Center Press pp Parke, William R What is Fractional Integration. The Review of Economics and Statistics 81(4): Petroff, Vladimir, Joe Bond and Doug Bond Using Hidden Markov Models to predict terror before it hits (again). In Handbook on computational approaches to counterterrorism, ed. V.S. Subrahmanian. Springer. 137

146 Pevehouse, Jon C. and Joshua S. Goldstein Serbian Compliance or Defiance in Kosovo? Statistical Analysis and Real-Time Predictions. The Journal of Conflict Resolution 43(4): Raleigh, Clionadh, Andrew Linke, Havard Hegre and Joakim Karlsen Introducting ACLED: An Armed Conflict Location Event Dataset. The Journal of Peace Research 47(5): Ross, Michael What do we know about Natural Resources and Civil War. Journal of Peace Research 41(3): Ross, Michael L Oil, Drugs, and Diamonds: The Varying Role of Natural Resources in Civil War. In The Political Economy of Armed Conflict: Beyond Greed and Grievance, ed. Karen Ballentine and Jake Sherman. Lynne Rienner pp Salehyan, Idean, Cullen S. Hendrix, Jesse Hamner, Christina Case, Christpher Linebarger, Emily Stull and Jennifer Williams Social Conflict in Africa: A New Database. International Interactions 38: Schneider, Gerald, Margit Bussman and Constantine Ruhe The Dynamics of Mass Killings: Testing Time-series models of one-sided violence in the Bosnian Civil War. Journal of Peace Research 49(3): Schrodt, Philip A Early Warning of Conflict in Southern Lebanon using Hidden Markov Models. Presented at the annual meeting of the American Political Science Association, Washington D.C. Schrodt, Philip A Early Warning of Conflict in Southern Lebanon using Hidden Markov Models. In TThe Understanding and Management of Global Violence: New Approaches to Theory and Research of Protracted Conflict, ed. Harvey Starr. New York: St. Martin s Press pp Schrodt, Philip A Pattern Recognition of International Crises using Hidden Markov Models. In Political Complexity: Nonlinear Models of Politics, ed. Diana Richards. Ann Arbor: University of Michigan Press pp Schrodt, Philip A Forecasting Conflict in the Balkans using Hidden Markov Models. In Programming for Peace: Computer-Aided Methods for International Conflict Resolution and Prevention, ed. Robert Trappl. Dordrecht, Netherlands: Kluwer Academic Publishers pp Schrodt, Philip A Response to BBN evaluations of TABARI.. 138

147 Schrodt, Philip A. and Deborah J. Gerner Empirical Indicators of Crisis Phase in the Middle East, Journal of Conflict Resolution 25(4): Schrodt, Philip A. and Deborah J. Gerner Cluster-Based Early Warning Indicators for Political Change in the Contemporary Levant. American Political Science Review 94(4): Schrodt, Philip A. and Deborah J. Gerner Analyzing the Dynamics of International Mediation Processes in the Middle East and the Former Yugoslavia. Presented at the annual meeting of the International Studies Association, Chicago. Schrodt, Philip A., James Yonamine and Benjamin E. Bagozzi Data-based Computational Approached to Forecasting Political Violence. In Handbook on computational approaches to counterterrorism, ed. V.S. Subrahmanian. Springer. Schrodt, Phillip A Inductive Event Data Scaling using Item Response Theory. Presented at the Summer Meeting of the Society of Political Methodology. Available at Shearer, Robert Forecasting Israeli-Palestinian Conflict with Hidden Markov Models. Available at Shellman, Stephen. 2004a. Time Series Intervals and Statistical Inference: The Effects of Temporal Aggregation on Event Data Analysis. Political Analysis 12(1): Shellman, Stephen Process Matters: Conflict and Cooperation in Sequential Government- Dissident Interactions. Security Studies 15(4): Shellman, Stephen, Clare Hatfield and Maggie Mills Dissagregating Actors in Intrastate Conflict. Journal of Peace Research 47(1). Shellman, Stephen M. 2004b. Measuring the Intensity of International Political Interactions Event Data: Two Interval-Like Scales. International Interactions 30(2): Siew, Lim Ying, Lim Ying Chin and Pauline Mah Jin Wee ARIMA and Integrated ARFIMA Models for Forecasting Air Pollution Index in Shah Alam, Selangor. The Malaysian Journal of Analytics Sciences 12(1): Singer, David and Michael David Wallace To Auger Well: Early Warning Indicators in World Poliics. Beverly Hills, CA: Sage Press. Urdal, Henrik and Kristian Hoelscher Explaining Urban Social Disorder and Violence: An Empirical Study of Event Data from Asian and Subsaharan African cities. International 139

148 Interactions 38: Ward, Michael D., Brian D. Greenhill and Kristin M. Bakke The Perils of Policy by P-Value: Predicting Civil Conflicts. Journal of Peace Research 47(5). Weidmann, Nils B. and Idean Salehyan Violence and Ethnic Segregation: A Computational Model Applied to Baghdad. Available at Weidmann, Nils B. and Michael D. Ward Predicting Conflict in Space and Time. Journal of Conflict Resolution 54(6):

149 8. Appendix Table 1. Assessing Accuracy at the District Level m month arfima error naive error arfima error < naive error 1 May TRUE 2 June TRUE 3 July TRUE 4 August TRUE 5 September TRUE 6 October TRUE 7 November TRUE 8 December TRUE 9 January FALSE 10 February TRUE 11 March TRUE 12 April TRUE 13 May TRUE 14 June TRUE 15 July TRUE 16 August TRUE 17 September TRUE 18 October TRUE 19 November TRUE 20 December TRUE 21 January TRUE 22 February TRUE 23 March TRUE 24 April TRUE 25 May TRUE 26 June TRUE 27 July TRUE 28 August TRUE 29 September TRUE 30 October TRUE 31 November TRUE 32 December TRUE 33 January TRUE 34 February TRUE 35 March TRUE 36 April TRUE 37 May TRUE 38 June TRUE 39 July TRUE 40 August TRUE 41 September TRUE 42 October TRUE 43 November TRUE 44 December TRUE 45 January TRUE 46 February TRUE 47 March TRUE 48 April TRUE Total: May Apr TRUE, 1 FALSE 141

150 Table 2. Assessing Accuracy at the Province Level Level m month arfima error naive error arfima error < naive error 1 May TRUE 2 June FALSE 3 July TRUE 4 August TRUE 5 September TRUE 6 October FALSE 7 November TRUE 8 December TRUE 9 January FALSE 10 February TRUE 11 March TRUE 12 April TRUE 13 May TRUE 14 June TRUE 15 July FALSE 16 August TRUE 17 September TRUE 18 October FALSE 19 November TRUE 20 December FALSE 21 January TRUE 22 February TRUE 23 March TRUE 24 April FALSE 25 May TRUE 26 June TRUE 27 July TRUE 28 August TRUE 29 September TRUE 30 October TRUE 31 November TRUE 32 December TRUE 33 January TRUE 34 February TRUE 35 March TRUE 36 April TRUE 37 May TRUE 38 June TRUE 39 July TRUE 40 August TRUE 41 September TRUE 42 October TRUE 43 November TRUE 44 December TRUE 45 January TRUE 46 February TRUE 47 March TRUE 48 April FALSE Total: May Apr , , TRUE, 8 FALSE 142

151 Table 3. Assessing Accuracy at the Country Level Level m month arfima error naive error arfima error < naive error 1 May TRUE 2 June FALSE 3 July FALSE 4 August TRUE 5 September TRUE 6 October FALSE 7 November TRUE 8 December TRUE 9 January FALSE 10 February FALSE 11 March FALSE 12 April FALSE 13 May FALSE 14 June TRUE 15 July FALSE 16 August FALSE 17 September TRUE 18 October TRUE 19 November TRUE 20 December FALSE 21 January TRUE 22 February TRUE 23 March ,035 1,087 TRUE 24 April TRUE 25 May TRUE 26 June TRUE 27 July TRUE 28 August ,037 1,028 FALSE 29 September TRUE 30 October TRUE 31 November TRUE 32 December FALSE 33 January TRUE 34 February TRUE 35 March TRUE 36 April FALSE 37 May FALSE 38 June TRUE 39 July FALSE 40 August FALSE 41 September TRUE 42 October TRUE 43 November TRUE 44 December TRUE 45 January TRUE 46 February TRUE 47 March TRUE 48 April ,759 2,737 FALSE Total: May Apr ,439 16, TRUE, 18 FALSE 143

152 Figure 1. The Number of Material Conflict events per Afghani District from 2001 to

153 Figure 2. One-month Forecast of the of Material Conflict Events in Bughran District using arfima package, with mean, 90%, and 95% confidence intervals. 145

154 Figure 3. Average Farm-Gate Prices for Dry Opium in Afghanistan, September 2004-March

Definitions, sources and methods for Uppsala Conflict Data Program Battle-Death estimates

Definitions, sources and methods for Uppsala Conflict Data Program Battle-Death estimates Uppsala Conflict Data Program (UCDP) Department of Peace and Conflict Research, Uppsala University This document