Migration Systems in Europe: Evidence From Harmonized Flow Data

Similar documents
Labor Market Laws and Intra-European Migration

OECD ECONOMIC SURVEY OF LITHUANIA 2018 Promoting inclusive growth

Curing Europe s Growing Pains: Which Reforms?

Index for the comparison of the efficiency of 42 European judicial systems, with data taken from the World Bank and Cepej reports.

GDP per capita in purchasing power standards

The global and regional policy context: Implications for Cyprus

Asylum Trends. Appendix: Eurostat data

Asylum Trends. Appendix: Eurostat data

Asylum Trends. Appendix: Eurostat data

Asylum Trends. Appendix: Eurostat data

European patent filings

Gender pay gap in public services: an initial report

Migration, Mobility and Integration in the European Labour Market. Lorenzo Corsini

Big Government, Small Government and Corruption: an European Perspective. Alina Mungiu-Pippidi Hertie School of Governance

Asylum Trends. Appendix: Eurostat data

Standard Note: SN/SG/6077 Last updated: 25 April 2014 Author: Oliver Hawkins Section Social and General Statistics

Asylum Trends. Appendix: Eurostat data

Statistical Modeling of Migration Attractiveness of the EU Member States

Asylum Trends. Appendix: Eurostat data

Integrated Modeling of European Migration

Measuring Social Inclusion

Italy Luxembourg Morocco Netherlands Norway Poland Portugal Romania

9 th International Workshop Budapest

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN AUGUST 2016

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN FEBRUARY 2017

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN MARCH 2016

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN MAY 2017

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN AUGUST 2015

Labour mobility within the EU - The impact of enlargement and the functioning. of the transitional arrangements

The economic outlook for Europe and Central Asia, including the impact of China

8193/11 GL/mkl 1 DG C I

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN SEPTEMBER 2015

TRIPS OF BULGARIAN RESIDENTS ABROAD AND ARRIVALS OF VISITORS FROM ABROAD TO BULGARIA IN DECEMBER 2016

EuCham Charts. October Youth unemployment rates in Europe. Rank Country Unemployment rate (%)

3-The effect of immigrants on the welfare state

The effect of migration in the destination country:

Internationalization in Tertiary Education: Intra-European Students Mobility

Globalisation and flexicurity

2nd Ministerial Conference of the Prague Process Action Plan

Appendix to Sectoral Economies

Annex 1. Technical notes for the demographic and epidemiological profile

THE EUROPEAN COURT OF HUMAN RIGHTS IN FACTS & FIGURES

Introduction: The State of Europe s Population, 2003

IMMIGRATION IN THE EU

VISA POLICY OF THE REPUBLIC OF KAZAKHSTAN

Shaping the Future of Transport

Migration Report Central conclusions

Supplementary figures

Impact Of Economic Freedom On Economic Development: A Nonparametric Approach To Evaluation

SPACE I 2015 Facts & Figures

The impact of international patent systems: Evidence from accession to the European Patent Convention

Measuring flows of international migration

Fertility rate and employment rate: how do they interact to each other?

European Union Passport

WILL CHINA S SLOWDOWN BRING HEADWINDS OR OPPORTUNITIES FOR EUROPE AND CENTRAL ASIA?

ASYLUM IN THE EU Source: Eurostat 4/6/2013, unless otherwise indicated ASYLUM APPLICATIONS IN THE EU27

WORLDWIDE DISTRIBUTION OF PRIVATE FINANCIAL ASSETS

Table A.1. Jointly Democratic, Contiguous Dyads (for entire time period noted) Time Period State A State B Border First Joint Which Comes First?

TECHNICAL BRIEF August 2013

2016 Europe Travel Trends Report

Table A.2 reports the complete set of estimates of equation (1). We distinguish between personal

SPACE I 2016 Facts & Numbers

DANMARKS NATIONALBANK

3.1. Importance of rural areas

Social capital and social cohesion in a perspective of social progress: the case of active citizenship

The EU on the move: A Japanese view

LMG Women in Business Law Awards - Europe - Firm Categories

Size and Development of the Shadow Economy of 31 European and 5 other OECD Countries from 2003 to 2013: A Further Decline

ERGP REPORT ON CORE INDICATORS FOR MONITORING THE EUROPEAN POSTAL MARKET

International migration data as input for population projections

Improving the accuracy of outbound tourism statistics with mobile positioning data

Evolution of the European Union, the euro and the Eurozone Sovereign Debt Crisis

Europe in Figures - Eurostat Yearbook 2008 The diversity of the EU through statistics

BULGARIAN TRADE WITH EU IN JANUARY 2017 (PRELIMINARY DATA)

International Goods Returns Service

wiiw Workshop Connectivity in Central Asia Mobility and Labour Migration

BULGARIAN TRADE WITH EU IN THE PERIOD JANUARY - MARCH 2016 (PRELIMINARY DATA)

WHO Global Code of Practice on the International Recruitment of Health Personnel. Findings of the first round of reporting.

Eurostat Yearbook 2006/07 A goldmine of statistical information

Migration Challenge or Opportunity? - Introduction. 15th Munich Economic Summit

Asylum Levels and Trends: Europe and non-european Industrialized Countries, 2003

This refers to the discretionary clause where a Member State decides to examine an application even if such examination is not its responsibility.

Migration Report Central conclusions

Plan for the cooperation with the Polish diaspora and Poles abroad in Elaboration

GALLERY 5: TURNING TABLES INTO GRAPHS

Identification of the respondent: Fields marked with * are mandatory.

Context Indicator 17: Population density

Estimating Global Migration Flow Tables Using Place of Birth Data

Options for Romanian and Bulgarian migrants in 2014

Romania's position in the online database of the European Commission on gender balance in decision-making positions in public administration

INTRODUCTION TO THE COURSE; ECONOMICS OF MIGRATION, INTRODUCTION, TRENDS AND CONCEPTS

DETERMINANTS OF INTERNATIONAL MIGRATION: A SURVEY ON TRANSITION ECONOMIES AND TURKEY. Pınar Narin Emirhan 1. Preliminary Draft (ETSG 2008-Warsaw)

The Construction Industry in Central and Eastern Europe Bucharest, May 19 th 2014

The European health report Dr Claudia Stein Director Division of Information, Evidence, Research and Innovation (DIR)

Territorial indicators for policy purposes: NUTS regions and beyond

Council of Europe Annual Penal Statistics SPACE I & SPACE II Facts, figures and tendencies. Marcelo F. Aebi & Natalia Delgrande

THE VALUE HETEROGENEITY OF THE EUROPEAN COUNTRIES POPULATION: TYPOLOGY BASED ON RONALD INGLEHART S INDICATORS

Second EU Immigrants and Minorities, Integration and Discrimination Survey: Main results

INVESTING IN AN OPEN AND SECURE EUROPE Two Funds for the period

Generating Executive Incentives: The Role of Domestic Judicial Power in International Human Rights Court Effectiveness

Transcription:

Demography (2012) 49:1307 1333 DOI 10.1007/s13524-012-0117-9 Migration Systems in Europe: Evidence From Harmonized Flow Data Jack DeWaard & Keuntae Kim & James Raymer Published online: 12 July 2012 # Population Association of America 2012 Abstract Empirical tests of migration systems theory require consistent and complete data on international migration flows. Publicly available data, however, represent an inconsistent and incomplete set of measurements obtained from a variety of national data collection systems. We overcome these obstacles by standardizing the available migration reports of sending and receiving countries in the European Union and Norway each year from 2003 2007 and by estimating the remaining missing flows. The resulting harmonized estimates are then used to test migration systems theory. First, locating thresholds in the size of flows over time, we identify three migration systems within the European Union and Norway. Second, examining the key determinants of flows with respect to the predictions of migration systems theory, our results highlight the importance of shared experiences of nation-state formation, geography, and accession status in the European Union. Our findings lend support to migration systems theory and demonstrate that knowledge of migration systems may improve the accuracy of migration forecasts toward managing the impacts of migration as a source of social change in Europe. Keywords International migration. Migration systems. Harmonization Introduction Migration systems theory (MST) situates international migration against a backdrop of the ties shared between sending and receiving countries (Kritz et al. 1992; Mabogunje 1970). It is a theoretically encompassing perspective (Massey et al. J. DeWaard (*) Department of Sociology, Center for Demography and Ecology, University of Wisconsin Madison, 8128 William H. Sewell Hall, 1180 Observatory Drive, Madison, WI 53706, USA e-mail: jdewaard@ssc.wisc.edu K. Kim Department of Sociology, Center for Demography and Ecology, University of Wisconsin Madison, Madison, WI, USA J. Raymer Division of Social Sciences, University of Southampton, Southampton, England

1308 J. DeWaard et al. 1998), but efforts to substantiate MST empirically are few. If international migration were perfectly measureable, migration systems might be identified by examining the matrices of in-flows, out-flows, and net-flows between all countries as they evolved through time (Zlotnik 1992:20). Publicly available migration data, however, lack a consistent metric given diverse national data collection systems and timing criteria used to validate migrations. Discrepancies between the migration reports of sending and receiving countries are well documented (Bilsborrow et al. 1997; Kupiszewska and Nowok 2008; Lemaitre 2005; Poulain et al. 2006). For example, Sweden and the United Kingdom (UK) each use a one-year timing criterion to validate the migrations of nationals and foreigners, but rely on different data collection systems (a population register and a passenger survey, respectively). Other countries (e.g., Romania) count only permanent moves as migrations and use separate registers for nationals and foreigners, tracking only their respective emigrations and immigrations. These sorts of problems render publicly available migration data inconsistent and the identification of migration systems tenuous (Zlotnik 1992). Funded by Eurostat and coordinated by the Netherlands Interdisciplinary Demographic Institute, the MIgration MOdeling for Statistical Analysis (MIMOSA) project addressed these issues by standardizing the available migration reports of sending and receiving countries and estimating missing flows for countries neither collecting nor providing these data to Eurostat. 1 Harmonized estimates were developed for flows among 31 countries in the European Union (EU) and European Free Trade Association (EFTA) each year from 2002 to 2007 using an optimization procedure (de Beer et al. 2010; Raymer et al. 2011). 2 In the early stages of the MIMOSA project, one method of standardization considered involved ranking countries by the quality of their migration data, and calculating country-specific immigration and emigration adjustment ratios to scale the reports of countries with less reliable data (van der Erf and van der Gaag 2007). Missing flows were then estimated in a regression framework using covariate information (Raymer and Abel 2008). Although the MIMOSA project ultimately settled on an optimization procedure for standardization, the strategy for estimating missing flows remained the same (Raymer et al. 2011). In this article, our methodological contribution is to extend the method proposed by van der Erf and van der Gaag (2007). Because relative data quality cannot be fully known a priori (Poulain et al. 2006), our approach to standardizing the available migration reports of sending and receiving countries incorporates uncertainty with respect to country rank. We then estimate the remaining missing flows using a technique new to this area of research, the k-nearest neighbor (knn) algorithm. These two steps yield harmonized estimates of flows among EU-27 countries and Norway each year from 2003 to 2007. Our empirical contribution uses these harmonized estimates to test MST. Broadly defined as a group of countries that exchange relatively large numbers of migrants (Kritz and Zlotnik 1992:2), migration systems have typically been identified through 1 The methodology and estimates are available online (www.nidi.knaw.nl/en/projects/230211/). 2 We use the term harmonize to mean both standardization of available migration data and estimation of the remaining missing flows. We distinguish these and the methods associated with each throughout this article.

Migration Systems in Europe 1309 two approaches. The first attempts to locate thresholds in the size of flows, where any submatrix whose entries remained above the threshold during five or ten years would indicate the potential existence of a system (Zlotnik 1992:20). The second attempts to isolate the key ties shared by sending and receiving countries that influence the size of flows (Boyd 1989). With respect to the latter, the MIMOSA estimates are potentially unsuited for this task because the project relied on covariate information, which overlaps the indicators required to test MST (Fawcett 1989; Zlotnik 1992). Our use of the knn algorithm overcomes this issue, estimating missing flows exclusively from the standardized estimates developed in the first step of the harmonization process. International migration has captured the attention of analysts and policy makers on topics ranging from population aging and public pensions (Bongaarts 2004) to labor market pressures associated with EU expansion into Central and Eastern Europe (Bauer and Zimmermann 1999). Knowledge of migration systems can help unpack these issues by improving migration forecasts, thereby enhancing the ability to determine the impact of migration as a catalyst for social change (Bijak 2006). Background MST is an encompassing perspective, combining elements of neoclassical economics, the new economics of migration, world systems theory, bifurcated labor market theory, and social capital theory (Jennissen 2004). Viewed by Massey et al. (1998) as international labor markets, migration systems are characterized by the unique set of ties shared by sending and receiving countries (Bonifazi et al. 2008; Boyd 1989; Kritz et al. 1992). The first stylized account of MST detailed three such linkages (Fawcett 1989). Relational ties include historical and cultural similarities, and their implications for the integration of migrants in receiving countries. Regulatory ties include congruent migration policies rooted in shared economic and political memberships (e.g., the EU). Tangible ties include capital and trade flows, and encompass the notions of economic and relative economic advantage (Greenwood and McDowell 1991). Summarizing these linkages, Zlotnik (1992:20) argued that shared geography, comparable levels of development... [and] cultural affinity are the essential features of migration systems, further distilled by Andrienko and Guriev (2004:2) as geography, initial conditions and legacies. Migration stems from push factors at origin, pull factors at destination, and shared community ties linking sending and receiving countries (Greenwood 1997; van Tubergen et al. 2004:705). MST highlights the third type of ties. Empirical work has demonstrated the importance of such relational linkages as shared colonial histories and common language(s) for the size of flows (Kim and Cohen 2010; Pedersen et al. 2008). Geographic isolation serves to strengthen regional and national expressions of solidarity, indicated by the positive association between country contiguity and the size of flows (Karemera et al. 2000). And tangible linkages, such as relative economic advantage, increase flows when, for example, the GDP per capita ratio of sending to receiving countries favors destinations (Greenwood and McDowell 1991; Leblang et al. 2009). Despite the encompassing potential of MST, few efforts have substantiated its claims empirically because of problems with publicly available migration data. A

1310 J. DeWaard et al. product of different national data collection systems and timing criteria used to validate migrations, these data represent an inconsistent and incomplete set of measurements and are unsuited for cross-national comparison (Lemaitre 2005). As a consequence, research on MST has relied heavily on data on birthplace-specific migrant stocks (Zlotnik 1992), which is problematic because these confound mortality and naturalization with migration (Massey et al. 1998:112) and the distinction between past and recent migrants (Rogers 2008). Labor force surveys are likewise inadequate, given insufficient sample sizes to capture flows between all pairs of sending and receiving countries (Nowok et al. 2006:212). Country-specific data on immigration and emigration flows offer a more promising avenue but are not immune to problems. To illustrate, we present in Fig. 1 the migration reports of sending and receiving countries in the EU-15 in 2003, obtained from Eurostat s New Cronos database (Kupiszewska and Nowok 2008:43 45). Of 420 possible reports, only 72 flows are reported by both sending and receiving countries, and rarely do these agree. 3 The rest are reported by only one country or are missing. The problems of inconsistent data reflect different conventions used to track and measure migration (Kupiszewska and Nowok 2008; Poulain et al. 2006). A population registration system can be compared against many alternatives: for example, separate registration systems for nationals and foreigners in Slovenia, residence permits for foreigners in Hungary, and passenger surveys in Ireland and the UK. Timing criteria also vary, ranging from none specified to permanence, with 3-, 6-, and 12-month variants in between. Ideally, migration reports should reflect a single timing criterion: for example, the 12-month long-term criterion recommended by the United Nations (1998). Finally, additional problems arise because persons have fewer incentives to deregister when migrating abroad, resulting in emigration reports that are downwardly biased (Kupiszewska and Nowok 2008). Together, these problems raise questions about the adequacy of publicly available migration data in empirical work, including on MST. In this article, we address these issues by extending a method to harmonize data on migration flows developed by van der Erf and van der Gaag (2007). Their approach assumes that countries can be ranked by the quality of their migration data. The migration reports of countries with less reliable data are then scaled to reporting conventions of countries with more reliable data using a set of immigration and emigration adjustment ratios. However, our assessment of the available information by which to rank countries suggests that relative data quality cannot be fully known a priori. We assume that only groups of countries can be ranked by relative data quality. Within each group, we permit country rank to vary randomly over 10,000 permutations. By averaging immigration and emigration adjustment ratios across permutations, we account for uncertainty in country rank in the standardization process. We then use the knn algorithm to estimate the missing flows without the use of covariate information. Harmonized (i.e., standardized and complete) estimates are developed for flows among EU-27 countries and Norway each year from 2003 to 2007 and are suitable for testing MST. 3 420 0 15 sending countries 14 receiving countries 2 reports per flow (i.e., sender and receiver).

Migration Systems in Europe 1311 Sending Receiving Country Country AUT BEL DEU DNK ESP FIN FRA GBR GRC IRL ITA LUX NLD PRT SWE E 177 4422 100 300 196 426 668 340 68 852 44 295 180 396 AUT I 13456 262 554 92 7 510 33 333 E/I 0.33 0.38 0.54 2.13 6.29 0.58 5.45 1.19 E BEL I 267 4291 587 3037 184 804 5348 105 399 E/I E 15976 4623 2712 16236 2380 19060 15550 18106 2415 33802 1510 8616 8880 3786 DEU I 12239 E/I 1.31 3221 13746 0.84 1.18 807 2.95 436 3.46 7921 1.09 645 13.77 2872 1.32 DNK E I 231 203 511 2540 2693 1720 764 403 371 1333 4317 229 264 782 131 14 609 474 174 58 3786 2872 E/I 1.14 0.94 2.25 1.09 9.36 1.28 3.00 1.32 E 93 647 2109 130 102 2474 2335 38 487 801 89 600 627 164 ESP I 615 14647 1665 68 28 2794 473 1234 E/I 0.15 0.14 0.08 1.50 3.18 0.21 1.33 0.13 E 76 245 761 397 792 284 1070 56 110 210 57 217 26 3428 FIN I 251 2204 421 802 2 362 15 3395 E/I 0.30 0.35 0.94 0.99 28.50 0.60 1.73 1.01 E FRA I 741 18133 1488 8847 312 987 2919 458 931 E/I E GBR I 1180 13197 3707 34117 914 37 5872 947 3022 E/I GRC E I 465 12959 278 273 63 5 882 15 585 E/I E IRL I E/I 138 2046 306 1649 146 0 3 615 37 230 E ITA I E/I 1460 23702 895 5796 209 68 1661 312 473 E 22 1119 747 119 73 33 1254 171 22 44 208 97 521 74 LUX I 57 1728 196 89 34 166 10 78 E/I 0.39 0.43 0.61 0.82 0.97 0.58 52.10 0.95 E 470 9284 9822 430 3365 292 3373 7022 482 459 1274 150 666 648 NLD I 655 13015 820 3567 239 25 264 707 E/I 0.72 0.75 0.52 0.94 1.22 6.00 2.52 0.92 PRT E I 0 330 0 955 7699 0 170 0 5505 0 56 849 2187 0 0 0 770 512 0 1619 0 143 E/I 0.00 0.12 0.00 0.00 0.00 1.50 0.00 0.00 E 238 411 1580 2585 1356 3386 946 3676 510 205 441 66 499 92 SWE I E/I 474 0.50 3397 0.47 2705 0.96 1537 0.88 3438 0.98 11 6.00 638 0.78 31 2.97 Fig. 1 Double count matrix of migration flows among EU-15 countries: 2003. E 0 emigration flow reported by sending country; I 0 immigration flow reported by receiving country; and E/I 0 emigration flow/immigration flow. AUT 0 Austria; BEL 0 Belgium; DEU 0 Germany; DNK 0 Denmark; ESP 0 Spain; FIN 0 Finland; FRA 0 France; GBR 0 United Kingdom; GRC 0 Greece; IRL 0 Ireland; ITA 0 Italy; LUX 0 Luxembourg; NLD 0 Netherlands; PRT 0 Portugal; and SWE 0 Sweden. Source: Kupiszewska and Nowok (2008) Harmonization of Flow Data Methodology Data on migration flows within Europe are publicly available from Eurostat s New Cronos database. 4 We use the reports of sending and receiving countries classified by next and previous country of residence. The harmonization method detailed herein is divided into two steps: (1) standardization of available flow reports and (2) estimation of missing flows. 4 Retrieved online (http://appsso.eurostat.ec.europa.eu/nui/show.do?dataset0migr_imm5prv&lang0en and http://appsso.eurostat.ec.europa.eu/nui/show.do?dataset0migr_emi3nxt&lang0en).

1312 J. DeWaard et al. Standardization of Available Flow Reports The starting point for standardizing the available migration reports of sending and receiving countries is to organize them into two matrices reported immigration and reported emigration for each year. Elements of these matrices correspond to flows between pairs of sending and receiving countries. We illustrate the method developed by van der Erf and van der Gaag (2007) in Fig. 2, using a hypothetical example of flows among six countries (A, B, C, D, E, and F) in a single year. Countries A and B are assumed to have reliable data. Country C is assumed to have less reliable data than A and B, but more reliable data than D. Countries E and F provide no data. Relative data quality is reflected in the order in which countries are listed in the matrices in Step 1. First, we identify the most reliable immigration reports. In Step 1 of Fig. 2, these are the immigration reports of Countries A and B, and are considered fixed with immigration adjustment ratios of 1.00. Second, the emigration reports of Countries A and B are adjusted in Step 2 of Fig. 2. The emigration adjustment ratio for Country A is obtained by dividing the fixed immigration report of B (20) by the corresponding emigration report of A (15), for a ratio of 20 / 15 0 1.33. This ratio for Country B is 100 / 80 0 1.25. The emigration reports of Countries A and B are then standardized using these ratios. 5 We proceed iteratively in Step 3 of Fig. 2 and adjust the migration reports of Country C. The immigration adjustment ratio for Country C is obtained by dividing the sum of the two standardized flows to this point (27 and 31) by the corresponding immigration reports of C (25 and 30), for a ratio of 1.05. This ratio is used to standardize the immigration reports of Country C from D, E, and F. 6 The emigration adjustment ratio for Country C is calculated by dividing the sum of the two standardized flows (125 and 30) by the corresponding emigration reports of C (120 and 30), for a ratio of 1.03. The emigration reports of Country C to D, E, and F are then standardized using this ratio. 7 Following standardization in Step 4 of Fig. 2, immigration from Country E to D is reduced slightly from 25 to 24, whereas emigration from D to E increases substantially from 40 to 153. The adjustment ratios used to produce these results are 0.94 and 3.83, respectively. This method is unable to estimate the missing flows between Countries E and F because data on these flows were not reported. Having illustrated the method developed by van der Erf and van der Gaag (2007), a key assumption is that countries can be ranked by the relative quality of their migration data, which presumes sufficient a priori knowledge of data quality. Country rank is important because those with more reliable reports are treated as the standard to which less-reliable reports are benchmarked. Relative data quality, however, cannot be fully known apriori. The available taxonomies detailing the sets of conventions used to track and measure migration permit only informed guesses (Kupiszewska and Nowok 2008; Poulain et al. 2006). To overcome this problem, 5 For emigration flows from Country A to C, D, E, and F, the standardized figures are 20 1.33 0 27, 175 1.33 0 233, 35 1.33 0 47, and 40 1.33 0 53, respectively. The standardized figures for Country B to C, D, E, and F are 25 1.25 0 31, 40 1.25 0 50, 65 1.25 0 81, and 100 1.25 0 125, respectively. 6 The standardized flows are 55 1.05 0 58, 65 1.05 0 68, and 100 1.05 0 105, respectively. 7 The standardized flows are 90 1.03 0 93, 75 1.03 0 77, and 45 1.03 0 46, respectively.

Migration Systems in Europe 1313 Immigration Reported by Receiving Country Receiving Country Emigration Reported by Sending Country Receiving Country Standardized Estimates Receiving Country Sending Step Country A B C D E F A B C D E F A B C D E F 1 A 20 25 250 15 20 175 35 40 20 B 100 30 50 80 25 40 65 100 100 C 125 30 100 120 30 90 75 45 125 30 D 85 10 55 20 5 15 40 30 85 10 E 300 5 65 25 300 5 F 175 45 100 125 175 45 2 A 20 25 250 15 20 175 35 40 20 27 233 4 7 53 B 100 30 50 80 25 40 65 100 100 31 50 81 125 C 125 30 100 120 30 90 75 45 125 30 D 85 10 55 20 5 15 40 30 85 10 E 300 5 65 25 300 5 F 175 45 100 125 175 45 3 A 20 25 250 15 20 175 35 40 20 27 233 4 7 53 B 100 30 50 80 25 40 65 100 100 31 50 81 125 C 125 30 100 120 30 90 75 45 125 30 93 77 46 D 85 10 55 20 5 15 40 30 85 10 58 E 300 5 65 2 5 3 00 5 68 F 175 45 100 125 175 45 105 4 A 20 25 250 15 20 175 35 40 20 27 233 47 53 B 100 30 50 80 25 40 65 100 100 31 50 81 125 C 125 30 100 120 30 90 75 45 125 30 93 77 46 D 85 10 55 20 5 15 40 30 85 10 58 153 115 5 E 300 5 65 2 00 5 68 24 F 175 45 100 125 175 45 105 118 3 Immigration (Receiving Country) Adjustment Ratios Country Ste p Calculation S tep A 1 Baseline = 1.00 2 B 1 Baseline = 1.00 2 C 3 (27 + 31) / (25 + 30) = 1.05 3 D 4 (233 + 50 + 93) / (250 + 50 + 100) = 0.94 4 Emigration (Sending Country) Adjustment Ratios Calculation 20 / 15 = 1.33 100 / 80 = 1.25 (125 + 30) / (120 + 30) = 1.03 (85 + 10 + 58) / (20 + 5 + 15) = 3.83 Fig. 2 Illustration of standardization method. Shading 0 reported flows to be standardized at each step. Italics 0 reported and standardized flows used to calculate adjustment ratios. Boldface 0 standardized flows resulting from each step 0 reported flow adjustment ratio (e.g., D to E 0 40 3.83 0 153). Source: Raymer and Abel (2008); van der Erf and van der Gaag (2007)

1314 J. DeWaard et al. we therefore extend the preceding method by treating country rank and thus the order of countries in the initial immigration and emigration matrices (hereafter, rank-order ) as a permutation problem. We analyze the migration reports of EU-27 countries plus Norway each year from 2003 2007. Absent information on data quality, there are 28! combinations by which to rank-order these countries. Because this is computationally unmanageable, we proceed first by combining the data over the 2003 2007 period. Using the information provided by Poulain et al. (2006:222 227), we then assign each country to one of four groups, rank-ordered by the comprehensiveness of the data collection system and proximity to a 12-month timing criterion (United Nations 1998). From most to least reliable, these groups include (1) Nordic countries, (2) non-nordic countries with reliable data, (3) non-nordic countries with semi-reliable data, and (4) non- Nordic countries with unreliable data. 8 No assumptions are made about relative data quality within each group. Instead, we generate 10,000 permutations, preserving the rank-order of the four groups but permitting the rank-order of countries within each group to vary randomly. For each permutation, we implement the standardization procedure detailed in Fig. 2 and calculate immigration and emigration adjustment ratios for each country. We then average each ratio across all permutations and apply these to standardize the migration reports of sending and receiving countries for each year, one at a time. We use the harmonic average because in theory, each permutation should produce estimates of the same flow. Average adjustment ratios should reflect this by mitigating the impact of large values and aggravating the impact of small ones. These ratios produce smoother, less fluctuating patterns over time, and therefore represent a more conservative approach. Estimation of Missing Flows With the available migration reports of sending and receiving countries standardized to the conventions of Nordic countries each year from 2003 to 2007, we estimate the remaining missing flows for countries that neither collect nor provide these data to Eurostat using the knn algorithm. This algorithm uses neighboring observations (defined herein) to impute and smooth data points. The basic steps are (1) locating observations in a defined space; (2) setting the parameter k, the number of nearest neighbors; (3) calculating the distance to each neighbor for each observation; (4) adjusting each observation using an inverse distance weighted average of neighboring observations; and (5) repeating until convergence (Cover and Hart 1967). We implement three variants of the knn algorithm and illustrate our approach in Fig. 3. InStep1,we extendthe examplein Fig.2 by adding two matrices of hypothetical flows, yielding three years of standardized estimates among six countries. Flows with Countries A, B, and C as either the sending or receiving country are complete at all time 8 Nordic countries: Denmark, Finland, Norway, and Sweden. Non-Nordic countries with reliable data: Austria, Germany, Netherlands, and Spain. Non-Nordic countries with semi-reliable data: Cyprus, Czech Republic, Italy, Latvia, Lithuania, Luxembourg, Poland, Slovakia, Slovenia, and the United Kingdom. Non-Nordic countries with unreliable data: Ireland, Portugal, and Romania (emigration only). Countries with missing data: Belgium, Bulgaria, Estonia, France, Greece, Hungary, and Malta.

Migration Systems in Europe 1315 Sending Country Standardized Estimates at Time 1 Receiving Country Standardized Estimates at Time 2 Receiving Country Standardized Estimates at Time 3 Receiving Country Step A B C D E F A B C D E F A B C D E F 1 A 20 27 233 47 53 22 29 257 51 58 30 40 350 70 80 B 100 31 50 81 125 110 34 55 89 138 150 47 75 122 188 C 125 30 93 77 46 138 33 102 85 52 188 45 140 116 69 D 85 10 58 153 115 95 94 11 64 168 105 128 15 87 229 173 143 E 300 5 68 24 330 6 75 26 450 8 103 35 F 175 45 105 118 193 50 116 130 263 68 158 177 178 196 268 2 Missing Flow k Neighbors Euclidean Distance (d ij ) Distance Weight (w ij ) Components of Estimated Flow (105, 196) 1 (95, 178) 1 / 20.59 = 0.049 115 [0.049 / (0.049 + 0.012)] = 92.38 2 (143, 268) 1 / 81.41 = 0.012 173 [0.012 / (0.049 + 0.012)] = 34.03 3 A 20 27 233 47 53 280 22 29 257 51 58 308 30 40 350 70 80 420 B 100 31 50 81 125 181 110 34 55 89 138 199 150 47 75 122 188 272 C 125 30 93 77 46 248 138 33 102 85 52 273 188 45 140 116 69 373 D 85 10 58 153 115 153 94 11 64 168 126 169 128 15 87 229 173 230 E 300 5 68 24 397 330 6 75 26 437 450 8 103 35 596 F 175 45 105 118 443 193 50 116 130 489 263 68 158 177 666 310 60 116 376 358 339 342 66 127 414 393 374 466 90 174 565 537 510 4 Coordinates Defining Plane a For Flows From Country E to F For Flows From Country F to E (397, 510); (596, 510); (596, 339); (397, 339) (443, 537); (666, 537); (666, 358); (443, 358) 5 Estimation of Missing Flows b knn c = 8 Flow (397, 376) 24 (443, 376) 118 (437, 342) 330 (437, 414) 26 (489, 342) 193 (489, 414) 130 (420, 510) 80 (596, 466) 450 Estimated Flow Missing Flows From Country E to F Missing Flows From Country F to E (397, 339) (437, 374) (596, 510) (443, 358) (489, 393) (666, 537) knn c = 4 Flow d ij w ij d ij w ij d ij w ij d ij w ij d ij w ij d ij w ij 37.00 0.027 40.05 0.025 239.91 0.004 (443, 376) 118 18.00 0.056 49.04 0.020 275.05 0.004 59.03 0.017 6.32 0.158 203.38 0.005 (489, 414) 130 72.47 0.014 21.00 0.048 215.54 0.005 40.11 0.025 32.00 0.031 231.31 0.004 (596, 466) 450 187.28 0.005 129.53 0.008 99.70 0.010 85.00 0.012 40.00 0.025 185.73 0.005 (666, 466) 263 247.78 0.004 191.46 0.005 71.00 0.014 92.05 0.011 61.06 0.016 199.18 0.005 118.70 0.008 65.60 0.015 143.75 0.007 172.54 0.006 137.06 0.007 176.00 0.006 236.07 0.004 183.70 0.005 44.00 0.023 151 135 252 Estimated Flow 148 167 282 Fig. 3 Illustration of knn algorithm for estimating missing flows. Shading 0 flows to be estimated at each step. Italics 0 standardized flows used to calculate Euclidean distance and corresponding weights. Boldface 0 estimated flows resulting from knn algorithm. a Listed clockwise from upper left to lower left. b Only the results of these calculations are shown. Refer to Step 2 for the general mechanics. c Nearest neighbors are defined as standardized flows from Step 3 falling within (or on the border) of the plane defined in Step 4

1316 J. DeWaard et al. points. Two sets of flows EtoF,andFtoE are missing at all time points. And one set of flows, D to F, is partially complete during the period because one flow is missing at Time 2. Our approach to estimating missing flows begins by imputing flows for pairs of sending and receiving countries with partially complete data. The matrices in Step 1 of Fig. 3 contain one partial row sum and one partial column sum. These sums represent total flows from Country D to A and B (row sum) and from Countries A and B to D (column sum). As we did in Fig. 2,we assume that Countries A and B have the most reliable data, thereby providing a consistent metric to implement the knn algorithm. The partial row and column sums can be viewed as x and y coordinates, respectively, to locate the flows from Country D to F in twodimensional space. We display these coordinates in Step 2 of Fig. 3. 9 We then calculate the Euclidean distance between the missing flow at Time 2 and each flow at Times 1 and 3. These distances are 20.59 and 81.41, respectively; thus, parameter k 0 2. 10 The inverse of each distance is then used as a weight to estimate the missing flow at Time 2, which is 126. 11 The matrices in Step 3 of Fig. 3 now contain pairs of countries with either complete or missing data over the three-year period. Because this approach is potentially sensitive to period fluctuations, we smooth flows for pairs of countries with complete data over the period (not shown). This step is similar to the last; however, the partial row and column sums now include flows from and to Countries A, B, C, and D. As before, parameter k 0 2 because each pair of countries has three years of data and thus two neighbors per flow. We then calculate the appropriate Euclidean distances and weights. Because these flows are complete over the period, the flow being smoothed is assigned a weight of 0.50. Smoothing thus retains 50 % of the original standardized estimate. The remaining 50 % is a distance weighted average of neighboring flows. The last step is to estimate flows between sending and receiving countries missing data at all time points. In Step 3 of Fig. 3, we again calculate partial row and column sums to include flows from and to Countries A, B, C, and D. Unlike in Step 2, however, we cannot use neighboring observations to impute missing flows because these data are missing at all time points. Instead, we use information from other pairs of countries with complete data. Treating the partial row and column sums in Step 3 as x and y coordinates, respectively, we define a plane in two-dimensional space for each pair of sending and receiving countries missing data over the period. The coordinates defining each plane are given in Step 4 of Fig. 3. Flows from Country E to F have x and y coordinates at Times 1, 2, and 3 of (397, 339), (437, 374), and (596, 510), respectively. The coordinates for flows from Country F to E are (443, 358), (489, 393), and (666, 537), respectively. The corresponding planes are thus bound by the points (397, 510), (596, 510), (596, 339), and (397, 339) for flows from Country E to F; and by (443, 537), (666, 537), (666, 358), and (443, 358) for flows from Country F to E. We define as k-nearest neighbors those flows displayed in the matrices in Step 3 of Fig. 3 that occupy the aforementioned planes. Were one to plot these 84 flows using the partial row and columns sums, one would identify eight and four neighbors (k 0 8; 9 The x and y coordinates are (95, 178), (105, 196), and (143, 268) at Times 1, 2, and 3, respectively. 10 In our analysis, 1 k 4 because pairs of sending and receiving countries with partially complete data have between one and four years of valid data from 2003 to 2007. 11 126 ¼ 115 ½0:049=ð0:049 þ 0:012ÞŠ þ173 ½0:012=ð0:049 þ 0:012ÞŠ.

Migration Systems in Europe 1317 k 0 4), respectively. 12 We then proceed to calculate the appropriate Euclidean distances and inverse distance weights in Step 5 of Fig. 3. 13 Missing flows are estimated as a distance weighted average of neighboring flows. These estimates are displayed at the bottom of Fig. 3. The standardization and estimation procedures detailed herein yield a harmonized set of flows among EU-27 countries and Norway each year from 2003 to 2007. The estimates are consistent because they are standardized to the conventions of Nordic countries. They are complete because they rely on information from neighboring observations to estimate missing flows and for data smoothing. Results Average immigration and emigration adjustment ratios are displayed in Table 1. These ratios were generated from 10,000 random permutations of countries rank ordered in the initial immigration and emigration matrices, and are the first to be developed with a corresponding measure of dispersion. The immigration adjustment ratios for Nordic countries are equal to 1.00, reflecting our treatment of these migration reports as the standard at the outset. The corresponding emigration adjustment ratios are slightly greater than 1.00. The ratios for countries in the remaining three groups exhibit considerable variation. For example, the emigration adjustment ratio for Spain is 4.94, and reflects the average level of scaling required to standardize Spain s emigration reports to the conventions of Nordic countries, Austria, Germany, and the Netherlands. In Table 2, we assess our harmonized estimates relative to those developed by the MIMOSA project (de Beer et al. 2010; Raymer et al. 2011). 14 Our total estimate of migration among countries in the EU-27 and Norway is 7.9 million persons over the 2003 2007 period. The corresponding MIMOSA estimate is 7.7 million. Means and standard deviations each year and over the period are also largely consistent. The largest discrepancy occurs in 2003, for which our mean estimate of migration (1,895) among these 28 countries is 13 % higher than the corresponding MIMOSA estimate (1,754). The congruence of our estimates and the MIMOSA estimates is due to similar immigration and emigration adjustment ratios (de Beer et al. 2010:471 473), with the two sets of estimates picking up a positive trend in the volume of migration flows over the period. To further examine whether our estimates are reasonable, we display in Fig. 4 the emigration and immigration reports for selected sending and receiving countries, respectively, and corresponding harmonized estimates developed in this and the MIMOSA projects. For flows from Denmark to Sweden, the MIMOSA estimates are identical to the latter s immigration reports, whereas our estimates are slightly lower given smoothing prior to estimating missing flows. 12 In our analysis, the maximum value of k is 125. 13 To save space, we show only the results of these calculations, which can be replicated by expanding the equation in Step 2. For example, the missing flow from Country F to E at Time 1 is estimated as follows: 148 ¼ 118 ½0:056=ð0:056þ0:014þ0:005þ0:004ÞŠþ130½0:014=ð0:056þ0:014þ0:005 þ 0:004ÞŠþ 450 ½0:005=ð0:056 þ 0:014 þ 0:005 þ 0:004ÞŠþ263 ½0:004=ð0:056 þ 0:014 þ 0:005 þ 0:004ÞŠ. 14 Abel (2010) and Poulain (1993, 1999) developed harmonized migration estimates, but for fewer sending and receiving countries.

1318 J. DeWaard et al. Table 1 Average immigration and emigration adjustment ratios: 2003 2007 Immigration Emigration Harmonic Mean SD Harmonic Mean SD Nordic countries Denmark 1.000 (0.000) 1.005 (0.000) Finland 1.000 (0.000) 1.006 (0.000) Norway 1.000 (0.000) 1.005 (0.000) Sweden 1.000 (0.000) 1.038 (0.000) Non-Nordic Countries: Reliable Migration Data Austria 0.855 (0.156) 1.132 (0.303) Germany 0.707 (0.149) 0.867 (0.089) Netherlands 0.952 (0.255) 1.025 (0.130) Spain 0.842 (0.050) 4.940 (1.517) Non-Nordic Countries: Semi-Reliable Migration Data Cyprus 0.729 (0.332) 3.630 (1.224) Czech Republic 2.250 (1.131) 4.119 (5.927) Italy 2.742 (0.559) 3.701 (0.655) Latvia 3.378 (0.892) 5.674 (0.699) Lithuania 2.475 (0.618) 2.492 (0.551) Luxembourg 5.228 (0.662) 1.884 (0.342) Poland 23.295 (3.909) 9.129 (1.597) Slovakia 16.893 (24.573) 37.104 (15.994) Slovenia 5.234 (0.633) 2.328 (0.471) United Kingdom 1.187 (0.171) 1.345 (0.259) Non-Nordic Countries: Unreliable Migration Data Belgium a a a a Bulgaria b a a a a Estonia a a a a France a a a a Greece a a a a Hungary a a a a Ireland 1.431 (0.210) 1.898 (0.215) Malta a a a a Portugal 8.712 (1.240) 30.046 (2.685) Romania a a 32.661 (4.199) a Immigration and/or emigration data not collected or provided to Eurostat. b Bulgaria s immigration and emigration reports were recoded as missing following adjustment factors of 445.27 and 101.00, respectively, which are too large for these reports to be considered reliable (Raymer and Abel 2008). The emigration and immigration reports of Austria and Germany, respectively, lack the agreement evident among Nordic countries. Excluding 2007, our estimates fall between the reports of Austria and Germany and are lower than the MIMOSA

Migration Systems in Europe 1319 Table 2 Summary estimates of harmonized migration flows among EU countries and Norway: 2003 2007 Current Estimates MIMOSA Estimates Total Flows 2003 1,432,788 1,325,652 2004 1,488,348 1,437,402 2005 1,507,935 1,479,873 2006 1,673,923 1,641,810 2007 1,789,933 1,819,282 2003 2007 7,892,926 7,704,019 Mean 2003 1,895.2 1,753.5 2004 1,968.7 1,901.3 2005 1,994.6 1,957.5 2006 2,214.2 2,171.7 2007 2,367.6 2,406.5 2003 2007 2,088.1 2,038.1 Standard Deviation 2003 6,996.1 6,695.2 2004 7,345.3 7,223.5 2005 7,403.0 7,194.6 2006 8,382.7 7,859.6 2007 8,890.0 8,568.5 2003 2007 7,833.7 7,535.5 N 3,780 3,780 Source: MIMOSA estimates (de Beer et al. 2010). project s estimates. We selected this example to show that even minor discrepancies in adjustment ratios can produce notable differences in harmonized estimates. We calculated emigration and immigration adjustment ratios for Austria of 1.13 and 0.86, respectively; those from the MIMOSA project are 1.35 and 1.17, respectively (de Beer et al. 2010:473). Likewise, our emigration and immigration adjustment ratios for Germany are 0.87 and 0.71, respectively, compared with 0.71 and 0.81 in the MIMOSA project. The bottom panels in Fig. 4 display reported and harmonized flows from Belgium to the Netherlands and from Portugal to Spain. The emigration reports of Belgium and Portugal are not collected by or provided to Eurostat. The immigration reports of the Netherlands and the two sets of harmonized estimates pick up the same positive trend in flows from Belgium. In contrast, Spain s reported immigration from Portugal is substantially higher than the two sets of harmonized estimates. Here, our estimates in 2003 and 2004 are similar to those from the MIMOSA project but diverge thereafter. The divergence is due to the fact that the MIMOSA project imputed these flows, whereas we used emigration data covering only foreigners to calculate the adjustment ratios.

1320 J. DeWaard et al. Migration Flow (000s) Denmark to Sweden 24 20 16 12 8 4 2003 2004 2005 2006 2007 Austria to Germany Migration Flow (000s) 24 20 16 12 8 4 2003 2004 2005 2006 2007 Belgium to the Netherlands Migration Flow (000s) 24 20 16 12 8 4 2003 2004 2005 2006 2007 Portugal to Spain Migration Flow (000s) 24 20 16 12 8 4 2003 2004 2005 2006 2007 Emigration report of sending country MIMOSA estimate Immigration report of receiving country Current project estimate Fig. 4 Repeated and harmonized migration flows for selected pairs of sending and receiving countries: 2003 2007. Migration flow in unit of persons. Source: Immigration and emigration reports (Eurostat); MIMOSA estimates (de Beer et al. 2010)

Migration Systems in Europe 1321 Identification of Migration Systems Methodology Zlotnik (1992) raised the possibility of locating thresholds in migration flow data by which to identify migration systems. We examine this claim in exploratory fashion using the harmonized estimates developed herein and the tools of cluster analysis. Like the knn algorithm, cluster analysis is an iterative process that begins by treating each flow as its own cluster and subsequently allocates flows among k clusters until a convergence criterion is satisfied. We employ average linkage clustering (Rabe-Hesketh and Everett 2006), which maximizes the average distance between k clusters. Averaging is performed over all possible pairs of flows to maximize the distance function: D XY ¼ 1 N X N Y X N X i¼1 X N Y j¼1 dðx i y j Þ : ð1þ D XY is the average distance between clusters X and Y. The number of observations in each cluster is denoted by N X and N Y, respectively, and d(x i y i ) is the distance between observation x i in cluster X and y j in cluster Y. Eq. (1) can be expanded to accommodate k clusters. To illustrate this method, designate the flow from Portugal to Spain in 2003 as cluster X, which in Fig. 4 is x i 0 8,235 persons. Cluster Y is composed of the remaining flows, y j (j 0 1,2,..., 3,779). 15 Average linkage clustering calculates the absolute distance between the flow in cluster X and each flow in cluster Y, and records the average. At each iteration, one flow from cluster Y is reallocated to X, and the average distance recorded. Clusters X and Y are identified (in our case, as migration systems) when the average distance between X and Y is maximized. Cluster analysis lacks a likelihood-based goodness-of-fit measure for determining the optimal number of clusters; thus, a set of stopping rules is typically employed. Milligan and Cooper (1985) examined 30 such rules and found via Monte Carlo simulations that the Calinski and Duda-Hart Indexes perform best in the analysis of continuous data. Large values of the Calinski Index, combined with large values of the Duda-Hart Index and corresponding small pseudo t-squared ratios, jointly determine the optimal number of clusters and are employed in our analysis (Milligan and Cooper 1985; Rabe-Hesketh and Everett 2006). We perform average linkage clustering on a natural logarithmic transformation of our harmonized estimates after excluding flows equal to 0 (n 0 20). We exclude these flows because migration systems require some volume of migration to be identified (Zlotnik 1992). Given the exploratory nature of this analysis, this fixes our attention on 3,760 flows among EU-27 countries and Norway each year from 2003 to 2007. 16 15 N Y 0 3,779 0 28 sending countries 27 receiving countries 5 years of data 1 flow in cluster X. 16 3,760 0 28 sending countries 27 receiving countries 5 years of data 20 zero flows.

1322 J. DeWaard et al. Results From exploratory cluster analysis, we find evidence for three migration systems among countries in the EU-27 and Norway from 2003 to 2007 after imposing two additional constraints from the literature on MST. 17 First, flows between pairs of sending and receiving countries must fall within the same cluster during the five-year period (Zlotnik 1992:20). Second, countries must exchange (i.e., send and receive) flows with all or most countries sharing the same cluster to be considered part of the migration system (Zlotnik 1992:39). After we impose these restrictions, the clusters identified earlier coalesce into three more or less geographically distinct migration systems: flows primarily among (1) five countries in the core of Europe (France, Germany, Italy, Spain, and the UK); (2) 14 countries located largely at the periphery (Bulgaria, Cyprus, the Czech Republic, Estonia, Greece, Hungary, Latvia, Lithuania, Luxembourg, Malta, Poland, Romania, Slovakia, and Slovenia); and (3) eight countries in the intermediate region (Austria, Belgium, Denmark, Finland, Ireland, the Netherlands, Norway, and Sweden). Of the 28 countries considered in our analysis, only Portugal could not be assigned to one of these three systems; despite exchanging large flows with Spain, migration between Portugal and France, Germany, Italy, and the UK is not sufficiently large to warrant inclusion in the core migration system. The three migration systems, displayed in Fig. 5, are consistent with Salt s (2001:31) conclusion that migration systems are, to some extent, geographically discrete. Although Massey et al. (1998:110) invoked the institutional ties shared by countries for example, the Treaties of Rome to argue for a single migration system (see also, Salt 1989), our evidence for the three aforementioned systems is derived purely from our harmonized estimates of migration flows and the set of criteria for identifying migration systems provided by MST. Flows among the five countries in the core migration system are considerably large (Castles and Miller 2003). The lower threshold with respect to the size of flows defining this system is 6,722 persons per year, which is reasonable given that countries with larger populations tend to produce larger flows in both absolute (Kim and Cohen 2010) and relative (DeWaard and Raymer 2012) terms. At the other extreme, flows among periphery countries are quite small but nonetheless consistent over the period, with an upper bound of 175 persons per year. Despite concerns about the implications of East-West migration in Europe (Bauer and Zimmermann 1999), Kaczmarczyk and Okólski (2005:4) noted that periphery migration does not necessarily spill over the region s boundaries, especially to the West, but to a large extent [is] contained within them. Although these flows are small, their consistency over the period supports this idea and thus supports viewing these flows as a distinct migration system. These systems are more or less geographically discrete, but they are not closed. Castles and Miller (2003), for example, argued that Italy and Spain are relatively 17 The Calinski Index for the three-cluster solution is 6,743.92; the Duda-Hart Index and its corresponding pseudo t-squared ratio are 0.387 and 443.80, respectively. Relative to a four-cluster (or higher-cluster) solution, with values of 4,634.51 on the Calinski Index and 0.335 and 3,252.41 for the Duda-Hart Index and pseudo t-squared ratio, respectively, the stopping rules employed suggest three optimal clusters (Milligan and Cooper 1985; Rabe-Hesketh and Everett 2006).

Migration Systems in Europe 1323 Migration Systems Periphery System Intermediate System Core System Sweden Finland Norway Ireland United Kingdom Spain Portugal Denmark Latvia Lithuania Belarus Netherlands Poland Germany Belgium Czech Republic Luxembourg Ukraine Slovakia Austria France Switzerland Hungary Moldova Croatia Romania Italy Serbia Bosnia & Herzegovina Bulgaria Albania Macedonia Greece Estonia Russia Georgia Armenia Azerbaijan Turkey Cyprus Malta Fig. 5 European migration systems: 2003 2007. Only labels (not shapes) are shown for Malta and Cyprus. Source: Authors construction recent migrant destinations for persons seeking entrance into the core of Europe through what Calavita (2003:347) terms the back door (see also, Zolberg 2006:22). Although our data do not permit examination of these patterns, they do provide an indirect glimpse of a related feature of migration systems: namely, step, return, and circular migration. Typically described within the framework of cumulative causation, these patterns obtain when migration tends to sustain itself (Massey et al. 1998:45). Informed by MST, the two restrictions imposed at the outset of our analysis effectively require that flows between sending and receiving countries in a migration system be consistent over time. To the extent that patterns of step, return, and circular migration require consistency, the countries that make up the three migration systems cited herein are likely candidates for these processes. Despite the exploratory nature of our analysis, the three systems identified represent an important step in the evolution of MST. Identifying migration systems is a hard task, considering the complexity of... economic, social and political

1324 J. DeWaard et al. interactions between sending and receiving countries (Bonifazi et al. 2008:123). Restricting or, more aptly, reorienting one s efforts to the harmonized data themselves represents a viable starting point for empirical assessments of MST toward resolving debates on the existence, quantity, and character of migration systems in Europe (Massey et al. 1998; Salt 1989, 2001; Zlotnik 1992). Determinants of Migration Flows Methodology Several recent efforts have attempted to identify the key determinants of international migration flows (Cohen et al. 2008; Kim and Cohen 2010; Mayda 2005; Pedersen et al. 2008); however, none have connected their work to MST or used harmonized flow data. Using a gravity model approach (Greenwood 1997), we aim to identify the key ties shared by sending and receiving countries that influence the size of flows. Although we consider the relevant push and pull factors of sending and receiving countries, respectively, our focus is with the shared community ties that link countries (van Tubergen et al. 2004:705). Recalling Fawcett s (1989) trichotomy, shared relational ties include historical similarities and intersections of sending and receiving countries with respect to nation-state formation and past colonial relationships. Shared official or national language has also been widely cited as a relevant relational linkage (Kim and Cohen 2010; Mayda 2005). Regulatory ties include geographic isolation, typically measured by country contiguity or shared region (Pedersen et al. 2008), as well as economic and political memberships. Raymer et al. (2011), for example, considered new accession status to the EU as a relevant predictor of flows. Finally, tangible ties are usually taken to mean the volume of trade flows between countries (Pedersen et al. 2008). Thinking more broadly, however, these also include features of relative economic well-being and standards of living in sending and receiving countries (Greenwood and McDowell 1991). Descriptions of the measures used to capture the above linkages are provided in the Appendix table. To take one example, consider the indicator of shared national or colonial origins. We combined two variables to express whether sending and receiving countries were ever the same country or in a colonial relationship for a period of 75 years or more, up to and including the nineteenth century, or for 25 50 years during the twentieth century, thereby capturing such historical features as the Austro-Hungarian Empire and the division of Czechoslovakia in 1993. We also consider the following push and pull factors in sending and receiving countries, respectively: population size, percentage of the population living in urban areas, old-age dependency ratio, change in labor force participation from the prior year, and social expenditures per household head. Additionally, as is typical in gravity models, we include a measure of the geographic distance between sending and receiving countries. Our unit of analysis is the harmonized migration flow between a pair of sending and receiving countries in a single year. Using the natural logarithm of these flows as