Classifier Evaluation and Selection. Review and Overview of Methods

Size: px
Start display at page:

Download "Classifier Evaluation and Selection. Review and Overview of Methods"

Transcription

1 Classifier Evaluation and Selection Review and Overview of Methods

2 Things to consider Ø Interpretation vs. Prediction Ø Model Parsimony vs. Model Error Ø Type of prediction task: Ø Decisions Interested only in resulting classification Ø Rankings Interested in ranking individuals by their true likelihood of an outcome Ø Estimates Interested in predicting probabilities or a continuous outcome accurately

3 Model Fit Statistics Summary Prediction Type Model Fit Statistics Decisions Accuracy/ Misclassification Profit/Loss KS-Statistic Rankings ROC Index (concordance statistic) Gini Coefficient Estimates Average Squared error SBC/Likelihood MAPE R "

4 Confusion Matrix Metrics from Confusion Matrix: 1. Accuracy: Proportion of total predictions that were correct 2. Precision/ Positive Predictive Value: Proportion of predicted positive that were actually positive 3. Negative Predictive Value: Proportion of predicted negative that were actually negative 4. Sensitivity/Recall: Proportion of actual positive cases correctly identified 5. Specificity: Proportion of actual negative cases which are correctly identified

5 Kolmogorov-Smirnov (KS) Statistic 100% 90% 80% 80% of negative observations have predicted probability <48% 70% 60% 50% 40% 30% 20% 10% 0% Cumulative NEG % Cumulative POS % 0% 16% 32% 48% 64% 80% 100% Predicted Probability from Model 25% of positive observations have predicted probability <48%

6 Kolmogorov-Smirnov (KS) Statistic 100% 90% 80% 70% 60% 50% 40% Max Distance: Kolmogorov-Smirnov (KS) Statistic 30% 20% 10% 0% Cumulative NEG % Cumulative POS % 0% 16% 32% 48% 64% 80% 100% Predicted Probability from Model

7

8 ROC Charts Each point on ROC curve corresponds to fraction of cases, ordered by decreasing predicted value. The x,y coordinates assume we predict that fraction of cases positive.

9 ROC Charts For example, this point might represent the 40% of cases with the highest predicted probabilities.

10 ROC Charts 70% of the actual positive outcome cases are captured => True Positive Rate = 0.7

11 ROC Charts ~10% of the actual negative outcome cases are captured => False Positive Rate = 0.1

12 Gini Coefficient Gini = 2*Shaded Area = 2*(AUC-0.5)

13 ROC Charts for Decision Trees p=3/4 p=1/3 p $%&'(( = 1 TPR = 0 FPR = < p $%&'(( < 3 4 TPR = 0.6 FPR = 0.2 p $%&'(( < 1 3 TPR = 1 FPR = 1

14 ROC Charts for Decision Trees

15

16 Response/Gain Charts 100% 90% Cumulative % Responders 80% 70% 60% 50% 40% 30% 20% 10% 0% 0% 18% 36% 54% 72% 90% Percentile of Modeled Values

17 Response/Gain Charts Cumulative % Responders 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% 0% 18% 36% 54% 72% 90% Percentile of Modeled Values Of top 18% of observations by predicted probability, 90% are responders (positive outcomes)

18 Response/Gain Charts Cumulative % Responders 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% 0% 18% 36% 54% 72% 90% Percentile of Modeled Values Overall population response rate is ~27%

19 Lift Chart While it s great to know what percent of responders you should get using the top p% of observations scored by the model, it s even better to know how this compares to random selection. Lift = % Responders from Model % Responders from Random Selection

20 Cumulative Lift At a depth of ~20%, we have a lift of almost If we target the top 20% of customers as scored by our model, we ll get 3.5 times as many responders than we would if we randomly targeted customers. 0% 18% 36% 54% 72% 90%

21

22 Average Squared Error (ASE) M J 1 nl D D yf " GH y GH HKL GKL Ø For class targets, let L be the number of levels in the target. Ø This objective function sets y GH = 1 if observation i takes level j of the target and 0 otherwise. Ø Computes sum of squared error with probabilities.

23 Average Squared Error (ASE) M J 1 nl D D yf " GH y GH HKL GKL Example: Name P(red) P(blue) P(none) Actual JimBob BLUE BillyBob NONE

24 Average Squared Error (ASE) M J 1 nl D D yf " GH y GH HKL GKL Example: Name P(red) P(blue) P(none) Actual JimBob BLUE BillyBob NONE " " " " " " 2 3

25 Average Squared Error (ASE) M J 1 nl D D yf " GH y GH HKL GKL Example: Name P(red) P(blue) P(none) Actual JimBob BLUE BillyBob NONE " " " " " " 2 3

26

27 Decisions: Accounting for Profit/Loss (or other external evaluation metrics)

28 Decisions in SAS EM Ø Enter information about profit/loss into the decisions on a dataset panel Ø Enterprise miner calculates the most profitable or least costly decision for each obs. Ø Click Build when first opening prompt, then open decisions tab.

29 Decisions in SAS EM Ø Decision and Cost Matrices do not affect: Ø Estimating parameters in the regression node Ø Learning weights in the neural network node Ø Growing decision trees Ø Fit statistics Ø Residuals, error functions, misclassification rate Ø Decision and Cost Matrices do affect: Ø Choice of models in regression node Ø Pruning trees in decision tree node

30

31 Undersampling/ Oversampling and Prior Probabilities Can be accounted for automatically in SAS EM

32 Undersampling and Prior Probabilities Ø Say you have a rare event as target (<10% of data) Ø Fraud Ø Catastrophic failure Ø 10%+ single day change in value of stock market index Ø May have trouble modelling because a model is accurate for classifying everything as nonevent! Ø Potential Solution: Create a biased sample Ø Under-represent the common events in the training data. Ø Keep all rare events and only an equal number of common events

33 Undersampling and Prior Probabilities Ø Models provide posterior probabilities for events. Ø The accuracy of the posterior probabilities rely on a representative sample. Ø If we bias our sample, must adjust the posterior probabilities to account for this.

34 Undersampling and Prior Probabilities Ø Let l = l L, l ",, l J be the levels of the target variable Ø Let i = 1,2,, n index the observations in the data Ø Let OldPost(i, l) be the posterior probability from the model on oversampled data Ø Let OldPrior(l) be the proportion of target level in the oversampled data Ø Let Prior(l) be the correct proportion of target level in true population NewPost i, l = Prior(l) OldPost(i, l) OldPrior(l) J Prior(l H ) HKL OldPost(i, l H ) OldPrior(l H )

35 Ø Priors are also adjusted in the decisions on a dataset panel. Entering Priors into SAS EM Ø Click Build when first opening the prompt, then click priors tab.

36 Undersampling and Prior Probabilities Ø In SAS EM, accounting for priors has no effect on: Ø Estimating parameters in logistic regression Ø Learning weights in Neural Network Ø Fit statistics like misclassification rate and average squared error Ø Growing decision trees Ø Priors do affect: Ø Pruning decision trees Ø Net Effects: Ø Increasing a prior probability increases the posterior probability Ø Decreasing a prior decreases the posterior probability Ø Changing prior will have more noticeable effect if the original posterior is near 0.5 than if it is near 0 or 1.

37 Oversampling Ø Instead of undersampling the common events, we can replicate the rare events in our data. Ø We have to be careful to do this after the training/validation split so that we don t have the same observation in both training and validation set. Ø OR, use a hybrid technique like SMOTE (Chawla, 2002) that creates new data points like the rare events (not exact replicates) as well as undersamples the common events

38

39 Using the Model Comparison Node in SAS EM

40

41 Cutoff Node Ø Cutoff node used to specify a cutoff probability other than 0.5 when you have decision factors. Ø Currently, the model comparison node does not use the cutoff probability from the cutoff node. Ø Most of the assessment statistics are not affected anyway, aside from misclassification rate.

42 Self Study: Using Enterprise Miner to Determine a Custom Probability Cutoff Profit/Loss or other Decisions

43 Average Profit on Pred_Yes Ø EM can use a decision matrix to compute the average profit per observation. Ø This calculation assumes that you have some level of profit/loss for every person in the data and want to average over every person in the data. Ø What if you only stand to profit/lose from those observations which you predict positive? i.e. nothing ventured, nothing gained (or lost). Ø Then you d want to take the profit from the model and average it only over those who were predicted positive. Ø EM cannot use a decision matrix to compute an average profit per positive prediction. Ø But we can do it quite easily with the program editor and dataset explorer!

44 Open Results from Cutoff Node

45 Open Model Diagnostics Table

46 Save Model Diagnostics Table

47 Save Model Diagnostics Table

48 Open Program Editor

49 Write Program to Calculate Avg. Profit

50 Run Program

51 Check Log

52 Open Explorer

53 Navigate to Dataset and Open

54 Sort by Average Profit Find largest for validation data

Random Forests. Gradient Boosting. and. Bagging and Boosting

Random Forests. Gradient Boosting. and. Bagging and Boosting Random Forests and Gradient Boosting Bagging and Boosting The Bootstrap Sample and Bagging Simple ideas to improve any model via ensemble Bootstrap Samples Ø Random samples of your data with replacement

More information

Support Vector Machines

Support Vector Machines Support Vector Machines Linearly Separable Data SVM: Simple Linear Separator hyperplane Which Simple Linear Separator? Classifier Margin Objective #1: Maximize Margin MARGIN MARGIN How s this look? MARGIN

More information

JUDGE, JURY AND CLASSIFIER

JUDGE, JURY AND CLASSIFIER JUDGE, JURY AND CLASSIFIER An Introduction to Trees 15.071x The Analytics Edge The American Legal System The legal system of the United States operates at the state level and at the federal level Federal

More information

Practice Questions for Exam #2

Practice Questions for Exam #2 Fall 2007 Page 1 Practice Questions for Exam #2 1. Suppose that we have collected a stratified random sample of 1,000 Hispanic adults and 1,000 non-hispanic adults. These respondents are asked whether

More information

Statistical Analysis of Corruption Perception Index across countries

Statistical Analysis of Corruption Perception Index across countries Statistical Analysis of Corruption Perception Index across countries AMDA Project Summary Report (Under the guidance of Prof Malay Bhattacharya) Group 3 Anit Suri 1511007 Avishek Biswas 1511013 Diwakar

More information

Supplementary Materials A: Figures for All 7 Surveys Figure S1-A: Distribution of Predicted Probabilities of Voting in Primary Elections

Supplementary Materials A: Figures for All 7 Surveys Figure S1-A: Distribution of Predicted Probabilities of Voting in Primary Elections Supplementary Materials (Online), Supplementary Materials A: Figures for All 7 Surveys Figure S-A: Distribution of Predicted Probabilities of Voting in Primary Elections (continued on next page) UT Republican

More information

SIMPLE LINEAR REGRESSION OF CPS DATA

SIMPLE LINEAR REGRESSION OF CPS DATA SIMPLE LINEAR REGRESSION OF CPS DATA Using the 1995 CPS data, hourly wages are regressed against years of education. The regression output in Table 4.1 indicates that there are 1003 persons in the CPS

More information

Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships

Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships Neural Networks Overview Ø s are considered black-box models Ø They are complex and do not provide much insight into variable relationships Ø They have the potential to model very complicated patterns

More information

Remittances and Poverty. in Guatemala* Richard H. Adams, Jr. Development Research Group (DECRG) MSN MC World Bank.

Remittances and Poverty. in Guatemala* Richard H. Adams, Jr. Development Research Group (DECRG) MSN MC World Bank. Public Disclosure Authorized Public Disclosure Authorized Public Disclosure Authorized Public Disclosure Authorized Remittances and Poverty in Guatemala* Richard H. Adams, Jr. Development Research Group

More information

Response to the Report Evaluation of Edison/Mitofsky Election System

Response to the Report Evaluation of Edison/Mitofsky Election System US Count Votes' National Election Data Archive Project Response to the Report Evaluation of Edison/Mitofsky Election System 2004 http://exit-poll.net/election-night/evaluationjan192005.pdf Executive Summary

More information

Analysis of Categorical Data from the California Department of Corrections

Analysis of Categorical Data from the California Department of Corrections Lab 5 Analysis of Categorical Data from the California Department of Corrections About the Data The dataset you ll examine is from a study by the California Department of Corrections (CDC) on the effectiveness

More information

List of Tables and Appendices

List of Tables and Appendices Abstract Oregonians sentenced for felony convictions and released from jail or prison in 2005 and 2006 were evaluated for revocation risk. Those released from jail, from prison, and those served through

More information

Lab 3: Logistic regression models

Lab 3: Logistic regression models Lab 3: Logistic regression models In this lab, we will apply logistic regression models to United States (US) presidential election data sets. The main purpose is to predict the outcomes of presidential

More information

Cluster Analysis. (see also: Segmentation)

Cluster Analysis. (see also: Segmentation) Cluster Analysis (see also: Segmentation) Cluster Analysis Ø Unsupervised: no target variable for training Ø Partition the data into groups (clusters) so that: Ø Observations within a cluster are similar

More information

Differences Lead to Differences: Diversity and Income Inequality Across Countries

Differences Lead to Differences: Diversity and Income Inequality Across Countries Illinois State University ISU ReD: Research and edata Master's Theses - Economics Economics 6-2008 Differences Lead to Differences: Diversity and Income Inequality Across Countries Michael Hotard Illinois

More information

Identifying Factors in Congressional Bill Success

Identifying Factors in Congressional Bill Success Identifying Factors in Congressional Bill Success CS224w Final Report Travis Gingerich, Montana Scher, Neeral Dodhia Introduction During an era of government where Congress has been criticized repeatedly

More information

Understanding factors that influence L1-visa outcomes in US

Understanding factors that influence L1-visa outcomes in US Understanding factors that influence L1-visa outcomes in US By Nihar Dalmia, Meghana Murthy and Nianthrini Vivekanandan Link to online course gallery : https://www.ischool.berkeley.edu/projects/2017/understanding-factors-influence-l1-work

More information

Analyzing the Power Consumption Behavior of a Large Scale Data Center

Analyzing the Power Consumption Behavior of a Large Scale Data Center Analyzing the Power Consumption Behavior of a Large Scale Data Center KASHIF NIZAM KHAN, AALTO UNIVERSITY, FINLAND. SANJA S., TAPIO N., JUKKA K. N., SEBASTIAN V. A. & OLLI-PEKKA L. 1 Outline Ø Motivation

More information

Classification of posts on Reddit

Classification of posts on Reddit Classification of posts on Reddit Pooja Naik Graduate Student CSE Dept UCSD, CA, USA panaik@ucsd.edu Sachin A S Graduate Student CSE Dept UCSD, CA, USA sachinas@ucsd.edu Vincent Kuri Graduate Student CSE

More information

Learning from Small Subsamples without Cherry Picking: The Case of Non-Citizen Registration and Voting

Learning from Small Subsamples without Cherry Picking: The Case of Non-Citizen Registration and Voting Learning from Small Subsamples without Cherry Picking: The Case of Non-Citizen Registration and Voting Jesse Richman Old Dominion University jrichman@odu.edu David C. Earnest Old Dominion University, and

More information

Preliminary Effects of Oversampling on the National Crime Victimization Survey

Preliminary Effects of Oversampling on the National Crime Victimization Survey Preliminary Effects of Oversampling on the National Crime Victimization Survey Katrina Washington, Barbara Blass and Karen King U.S. Census Bureau, Washington D.C. 20233 Note: This report is released to

More information

Corruption and business procedures: an empirical investigation

Corruption and business procedures: an empirical investigation Corruption and business procedures: an empirical investigation S. Roy*, Department of Economics, High Point University, High Point, NC - 27262, USA. Email: sroy@highpoint.edu Abstract We implement OLS,

More information

Psychological Factors

Psychological Factors Psychological Factors Consumer Decision Making e.g., Impulsiveness, openness e.g., Buying choices Personalization 1. 2. 3. Increase click-through rate predictions Enhance recommendation quality Improve

More information

Volume 35, Issue 1. An examination of the effect of immigration on income inequality: A Gini index approach

Volume 35, Issue 1. An examination of the effect of immigration on income inequality: A Gini index approach Volume 35, Issue 1 An examination of the effect of immigration on income inequality: A Gini index approach Brian Hibbs Indiana University South Bend Gihoon Hong Indiana University South Bend Abstract This

More information

Migration and Tourism Flows to New Zealand

Migration and Tourism Flows to New Zealand Migration and Tourism Flows to New Zealand Murat Genç University of Otago, Dunedin, New Zealand Email address for correspondence: murat.genc@otago.ac.nz 30 April 2010 PRELIMINARY WORK IN PROGRESS NOT FOR

More information

Report for the Associated Press. November 2015 Election Studies in Kentucky and Mississippi. Randall K. Thomas, Frances M. Barlas, Linda McPetrie,

Report for the Associated Press. November 2015 Election Studies in Kentucky and Mississippi. Randall K. Thomas, Frances M. Barlas, Linda McPetrie, Report for the Associated Press November 2015 Election Studies in Kentucky and Mississippi Randall K. Thomas, Frances M. Barlas, Linda McPetrie, Annie Weber, Mansour Fahimi, & Robert Benford GfK Custom

More information

Supplementary Materials for Strategic Abstention in Proportional Representation Systems (Evidence from Multiple Countries)

Supplementary Materials for Strategic Abstention in Proportional Representation Systems (Evidence from Multiple Countries) Supplementary Materials for Strategic Abstention in Proportional Representation Systems (Evidence from Multiple Countries) Guillem Riambau July 15, 2018 1 1 Construction of variables and descriptive statistics.

More information

ANNUAL SURVEY REPORT: REGIONAL OVERVIEW

ANNUAL SURVEY REPORT: REGIONAL OVERVIEW ANNUAL SURVEY REPORT: REGIONAL OVERVIEW 2nd Wave (Spring 2017) OPEN Neighbourhood Communicating for a stronger partnership: connecting with citizens across the Eastern Neighbourhood June 2017 TABLE OF

More information

Educated Preferences: Explaining Attitudes Toward Immigration In Europe. Jens Hainmueller and Michael J. Hiscox. Last revised: December 2005

Educated Preferences: Explaining Attitudes Toward Immigration In Europe. Jens Hainmueller and Michael J. Hiscox. Last revised: December 2005 Educated Preferences: Explaining Attitudes Toward Immigration In Jens Hainmueller and Michael J. Hiscox Last revised: December 2005 Supplement III: Detailed Results for Different Cutoff points of the Dependent

More information

John Parman Introduction. Trevon Logan. William & Mary. Ohio State University. Measuring Historical Residential Segregation. Trevon Logan.

John Parman Introduction. Trevon Logan. William & Mary. Ohio State University. Measuring Historical Residential Segregation. Trevon Logan. Ohio State University William & Mary Across Over and its NAACP March for Open Housing, Detroit, 1963 Motivation There is a long history of racial discrimination in the United States Tied in with this is

More information

EXAMINATION 3 VERSION B "Wage Structure, Mobility, and Discrimination" April 19, 2018

EXAMINATION 3 VERSION B Wage Structure, Mobility, and Discrimination April 19, 2018 William M. Boal Signature: Printed name: EXAMINATION 3 VERSION B "Wage Structure, Mobility, and Discrimination" April 19, 2018 INSTRUCTIONS: This exam is closed-book, closed-notes. Simple calculators are

More information

Skill Classification Does Matter: Estimating the Relationship Between Trade Flows and Wage Inequality

Skill Classification Does Matter: Estimating the Relationship Between Trade Flows and Wage Inequality Skill Classification Does Matter: Estimating the Relationship Between Trade Flows and Wage Inequality By Kristin Forbes* M.I.T.-Sloan School of Management and NBER First version: April 1998 This version:

More information

Kakuma Refugee Camp: Household Vulnerability Study

Kakuma Refugee Camp: Household Vulnerability Study Kakuma Refugee Camp: Household Vulnerability Study Dr. Helen Guyatt Flavia Della Rosa Jenny Spencer Dr. Eric Nussbaumer Perry Muthoka Mehari Belachew Acknowledgements Commissioned by WFP, UNHCR and partners

More information

Introduction to Path Analysis: Multivariate Regression

Introduction to Path Analysis: Multivariate Regression Introduction to Path Analysis: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #7 March 9, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate

More information

WP 2015: 9. Education and electoral participation: Reported versus actual voting behaviour. Ivar Kolstad and Arne Wiig VOTE

WP 2015: 9. Education and electoral participation: Reported versus actual voting behaviour. Ivar Kolstad and Arne Wiig VOTE WP 2015: 9 Reported versus actual voting behaviour Ivar Kolstad and Arne Wiig VOTE Chr. Michelsen Institute (CMI) is an independent, non-profit research institution and a major international centre in

More information

Happiness and economic freedom: Are they related?

Happiness and economic freedom: Are they related? Happiness and economic freedom: Are they related? Ilkay Yilmaz 1,a, and Mehmet Nasih Tag 2 1 Mersin University, Department of Economics, Mersin University, 33342 Mersin, Turkey 2 Mersin University, Department

More information

Analysis of the Reputation System and User Contributions on a Question Answering Website: StackOverflow

Analysis of the Reputation System and User Contributions on a Question Answering Website: StackOverflow Analysis of the Reputation System and User Contributions on a Question Answering Website: StackOverflow Dana Movshovitz-Attias Yair Movshovitz-Attias Peter Steenkiste Christos Faloutsos August 27, 2013

More information

Online Appendix for The Contribution of National Income Inequality to Regional Economic Divergence

Online Appendix for The Contribution of National Income Inequality to Regional Economic Divergence Online Appendix for The Contribution of National Income Inequality to Regional Economic Divergence APPENDIX 1: Trends in Regional Divergence Measured Using BEA Data on Commuting Zone Per Capita Personal

More information

TECHNICAL APPENDIX. Immigrant Earnings Growth: Selection Bias or Real Progress. Garnett Picot and Patrizio Piraino*

TECHNICAL APPENDIX. Immigrant Earnings Growth: Selection Bias or Real Progress. Garnett Picot and Patrizio Piraino* TECHNICAL APPENDIX Immigrant Earnings Growth: Selection Bias or Real Progress Garnett Picot and Patrizio Piraino* * Picot, Statistics Canada, Analytical Studies Branch, dgpicot@reogers.com. Piraino, School

More information

The Impact of Unionization on the Wage of Hispanic Workers. Cinzia Rienzo and Carlos Vargas-Silva * This Version, December 2014.

The Impact of Unionization on the Wage of Hispanic Workers. Cinzia Rienzo and Carlos Vargas-Silva * This Version, December 2014. The Impact of Unionization on the Wage of Hispanic Workers Cinzia Rienzo and Carlos Vargas-Silva * This Version, December 2014 Abstract This paper explores the role of unionization on the wages of Hispanic

More information

The Cook Political Report / LSU Manship School Midterm Election Poll

The Cook Political Report / LSU Manship School Midterm Election Poll The Cook Political Report / LSU Manship School Midterm Election Poll The Cook Political Report-LSU Manship School poll, a national survey with an oversample of voters in the most competitive U.S. House

More information

The Mexican Migration Project weights 1

The Mexican Migration Project weights 1 The Mexican Migration Project weights 1 Introduction The Mexican Migration Project (MMP) gathers data in places of various sizes, carrying out its survey in large metropolitan areas, medium-size cities,

More information

The Impact of Unionization on the Wage of Hispanic Workers. Cinzia Rienzo and Carlos Vargas-Silva * This Version, May 2015.

The Impact of Unionization on the Wage of Hispanic Workers. Cinzia Rienzo and Carlos Vargas-Silva * This Version, May 2015. The Impact of Unionization on the Wage of Hispanic Workers Cinzia Rienzo and Carlos Vargas-Silva * This Version, May 2015 Abstract This paper explores the role of unionization on the wages of Hispanic

More information

Issue Importance and Performance Voting. *** Soumis à Political Behavior ***

Issue Importance and Performance Voting. *** Soumis à Political Behavior *** Issue Importance and Performance Voting Patrick Fournier, André Blais, Richard Nadeau, Elisabeth Gidengil, and Neil Nevitte *** Soumis à Political Behavior *** Issue importance mediates the impact of public

More information

CALTECH/MIT VOTING TECHNOLOGY PROJECT A

CALTECH/MIT VOTING TECHNOLOGY PROJECT A CALTECH/MIT VOTING TECHNOLOGY PROJECT A multi-disciplinary, collaborative project of the California Institute of Technology Pasadena, California 91125 and the Massachusetts Institute of Technology Cambridge,

More information

Violent Conflict and Inequality

Violent Conflict and Inequality Violent Conflict and Inequality work in progress Cagatay Bircan University of Michigan Tilman Brück DIW Berlin, Humboldt University Berlin, IZA and Households in Conflict Network Marc Vothknecht DIW Berlin

More information

VoteCastr methodology

VoteCastr methodology VoteCastr methodology Introduction Going into Election Day, we will have a fairly good idea of which candidate would win each state if everyone voted. However, not everyone votes. The levels of enthusiasm

More information

Migrant Wages, Human Capital Accumulation and Return Migration

Migrant Wages, Human Capital Accumulation and Return Migration Migrant Wages, Human Capital Accumulation and Return Migration Jérôme Adda Christian Dustmann Joseph-Simon Görlach February 14, 2014 PRELIMINARY and VERY INCOMPLETE Abstract This paper analyses the wage

More information

Attenuation Bias in Measuring the Wage Impact of Immigration. Abdurrahman Aydemir and George J. Borjas Statistics Canada and Harvard University

Attenuation Bias in Measuring the Wage Impact of Immigration. Abdurrahman Aydemir and George J. Borjas Statistics Canada and Harvard University Attenuation Bias in Measuring the Wage Impact of Immigration Abdurrahman Aydemir and George J. Borjas Statistics Canada and Harvard University November 2006 1 Attenuation Bias in Measuring the Wage Impact

More information

Research and strategy for the land community.

Research and strategy for the land community. Research and strategy for the land community. To: Northeastern Minnesotans for Wilderness From: Sonia Wang, Spencer Phillips Date: 2/27/2018 Subject: Full results from the review of comments on the proposed

More information

Magruder s American Government

Magruder s American Government Presentation Pro Magruder s American Government C H A P T E R 8 Mass Media and Public Opinion 200 by Prentice Hall, Inc. S E C T I O N The Formation of Public Opinion 2 3 Chapter 8, Section What is Public

More information

Appendix: Uncovering Patterns Among Latent Variables: Human Rights and De Facto Judicial Independence

Appendix: Uncovering Patterns Among Latent Variables: Human Rights and De Facto Judicial Independence Appendix: Uncovering Patterns Among Latent Variables: Human Rights and De Facto Judicial Independence Charles D. Crabtree Christopher J. Fariss August 12, 2015 CONTENTS A Variable descriptions 3 B Correlation

More information

Probabilistic Latent Semantic Analysis Hofmann (1999)

Probabilistic Latent Semantic Analysis Hofmann (1999) Probabilistic Latent Semantic Analysis Hofmann (1999) Presenter: Mercè Vintró Ricart February 8, 2016 Outline Background Topic models: What are they? Why do we use them? Latent Semantic Analysis (LSA)

More information

Supporting Information for Do Perceptions of Ballot Secrecy Influence Turnout? Results from a Field Experiment

Supporting Information for Do Perceptions of Ballot Secrecy Influence Turnout? Results from a Field Experiment Supporting Information for Do Perceptions of Ballot Secrecy Influence Turnout? Results from a Field Experiment Alan S. Gerber Yale University Professor Department of Political Science Institution for Social

More information

Towards Tackling Hate Online Automatically

Towards Tackling Hate Online Automatically Towards Tackling Hate Online Automatically Nikola Ljubešić 1, Darja Fišer 2,1, Tomaž Erjavec 1 1 Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana 2 Department of Translation, University

More information

School Choice & Segregation

School Choice & Segregation School Choice & Segregation by Martin Söderström a and Roope Uusitalo b May 20, 2004 Preliminary draft Abstract This paper studies the effects of school choice on segregation. Segregation is measured along

More information

Is Corruption Anti Labor?

Is Corruption Anti Labor? Is Corruption Anti Labor? Suryadipta Roy Lawrence University Department of Economics PO Box- 599, Appleton, WI- 54911. Abstract This paper investigates the effect of corruption on trade openness in low-income

More information

Part 2: Risk Analysis and Scenario-Based Planning

Part 2: Risk Analysis and Scenario-Based Planning Preparedness Package for Refugee Emergencies 1 Part 2: Risk Analysis and Scenario-Based Planning Review of the preparedness process Refugee Emergency CP Process 2-4 Weeks Scenario-Based Contingency Plan

More information

Children's Referendum Poll

Children's Referendum Poll Children's Referendum Poll 18 th Oct 2012 Prepared for the National Youth Council of Ireland Job No: 52012 (1) Methodology and Weighting 1003 interviews were conducted between the 15 th 17 th October among

More information

ARTNeT Trade Economists Conference Trade in the Asian century - delivering on the promise of economic prosperity rd September 2014

ARTNeT Trade Economists Conference Trade in the Asian century - delivering on the promise of economic prosperity rd September 2014 ASIA-PACIFIC RESEARCH AND TRAINING NETWORK ON TRADE ARTNeT CONFERENCE ARTNeT Trade Economists Conference Trade in the Asian century - delivering on the promise of economic prosperity 22-23 rd September

More information

Labour Market Success of Immigrants to Australia: An analysis of an Index of Labour Market Success

Labour Market Success of Immigrants to Australia: An analysis of an Index of Labour Market Success Labour Market Success of Immigrants to Australia: An analysis of an Index of Labour Market Success Laurence Lester NILS 17 August 2007 Macquarie University Research Seminar Series Plan Introduction The

More information

Appendix: Supplementary Tables for Legislating Stock Prices

Appendix: Supplementary Tables for Legislating Stock Prices Appendix: Supplementary Tables for Legislating Stock Prices In this Appendix we describe in more detail the method and data cut-offs we use to: i.) classify bills into industries (as in Cohen and Malloy

More information

Online Appendix: The Effect of Education on Civic and Political Engagement in Non-Consolidated Democracies: Evidence from Nigeria

Online Appendix: The Effect of Education on Civic and Political Engagement in Non-Consolidated Democracies: Evidence from Nigeria Online Appendix: The Effect of Education on Civic and Political Engagement in Non-Consolidated Democracies: Evidence from Nigeria Horacio Larreguy John Marshall May 2016 1 Missionary schools Figure A1:

More information

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner Abstract For our project, we analyze data from US Congress voting records, a dataset that consists

More information

! = ( tapping time ).

! = ( tapping time ). AP Statistics Name: Per: Date: 3. Least- Squares Regression p164 168 Ø What is the general form of a regression equation? What is the difference between y and ŷ? Example: Tapping on cans Don t you hate

More information

Supplementary Material for Preventing Civil War: How the potential for international intervention can deter conflict onset.

Supplementary Material for Preventing Civil War: How the potential for international intervention can deter conflict onset. Supplementary Material for Preventing Civil War: How the potential for international intervention can deter conflict onset. World Politics, vol. 68, no. 2, April 2016.* David E. Cunningham University of

More information

Identity Theft. What does a victim look like?

Identity Theft. What does a victim look like? Identity Theft What does a victim look like? Mehmet Hondur Benjama Kounthongkul Patcharaporn Makarasara Brenda Martineau Sophie Shuklin http://www.youtube.com/watch?v=0cfo7prezya Outline Project Goals/Research

More information

Deep Learning and Visualization of Election Data

Deep Learning and Visualization of Election Data Deep Learning and Visualization of Election Data Garcia, Jorge A. New Mexico State University Tao, Ng Ching City University of Hong Kong Betancourt, Frank University of Tennessee, Knoxville Wong, Kwai

More information

RBS SAMPLING FOR EFFICIENT AND ACCURATE TARGETING OF TRUE VOTERS

RBS SAMPLING FOR EFFICIENT AND ACCURATE TARGETING OF TRUE VOTERS Dish RBS SAMPLING FOR EFFICIENT AND ACCURATE TARGETING OF TRUE VOTERS Comcast Patrick Ruffini May 19, 2017 Netflix 1 HOW CAN WE USE VOTER FILES FOR ELECTION SURVEYS? Research Synthesis TRADITIONAL LIKELY

More information

To What Extent Are Canadians Exposed to Low-Income?

To What Extent Are Canadians Exposed to Low-Income? To What Extent Are Canadians Exposed to Low-Income? by René Morissette* and Marie Drolet** No. 146 11F0019MPE No. 146 ISSN: 1200-5223 ISBN: 0-660-18061-8 Price: $5.00 per issue, $25.00 annually Business

More information

Gender preference and age at arrival among Asian immigrant women to the US

Gender preference and age at arrival among Asian immigrant women to the US Gender preference and age at arrival among Asian immigrant women to the US Ben Ost a and Eva Dziadula b a Department of Economics, University of Illinois at Chicago, 601 South Morgan UH718 M/C144 Chicago,

More information

Do two parties represent the US? Clustering analysis of US public ideology survey

Do two parties represent the US? Clustering analysis of US public ideology survey Do two parties represent the US? Clustering analysis of US public ideology survey Louisa Lee 1 and Siyu Zhang 2, 3 Advised by: Vicky Chuqiao Yang 1 1 Department of Engineering Sciences and Applied Mathematics,

More information

Dimension Reduction. Why and How

Dimension Reduction. Why and How Dimension Reduction Why and How The Curse of Dimensionality As the dimensionality (i.e. number of variables) of a space grows, data points become so spread out that the ideas of distance and density become

More information

Supporting Information Political Quid Pro Quo Agreements: An Experimental Study

Supporting Information Political Quid Pro Quo Agreements: An Experimental Study Supporting Information Political Quid Pro Quo Agreements: An Experimental Study Jens Großer Florida State University and IAS, Princeton Ernesto Reuben Columbia University and IZA Agnieszka Tymula New York

More information

Errata Summary. Comparison of the Original Results with the New Results

Errata Summary. Comparison of the Original Results with the New Results Errata for Karim and Beardsley (2016), Explaining Sexual Exploitation and Abuse in Peacekeeping Missions: The Role of Female Peacekeepers and Gender Equality in Contributing Countries, Journal of Peace

More information

A comparative analysis of subreddit recommenders for Reddit

A comparative analysis of subreddit recommenders for Reddit A comparative analysis of subreddit recommenders for Reddit Jay Baxter Massachusetts Institute of Technology jbaxter@mit.edu Abstract Reddit has become a very popular social news website, but even though

More information

IV. Labour Market Institutions and Wage Inequality

IV. Labour Market Institutions and Wage Inequality Fortin Econ 56 Lecture 4B IV. Labour Market Institutions and Wage Inequality 5. Decomposition Methodologies. Measuring the extent of inequality 2. Links to the Classic Analysis of Variance (ANOVA) Fortin

More information

CHAPTER 5 SOCIAL INCLUSION LEVEL

CHAPTER 5 SOCIAL INCLUSION LEVEL CHAPTER 5 SOCIAL INCLUSION LEVEL Social Inclusion means involving everyone in the society, making sure all have equal opportunities in work or to take part in social activities. It means that no one should

More information

Guns and Butter in U.S. Presidential Elections

Guns and Butter in U.S. Presidential Elections Guns and Butter in U.S. Presidential Elections by Stephen E. Haynes and Joe A. Stone September 20, 2004 Working Paper No. 91 Department of Economics, University of Oregon Abstract: Previous models of the

More information

Comparison of the Psychometric Properties of Several Computer-Based Test Designs for. Credentialing Exams

Comparison of the Psychometric Properties of Several Computer-Based Test Designs for. Credentialing Exams CBT DESIGNS FOR CREDENTIALING 1 Running head: CBT DESIGNS FOR CREDENTIALING Comparison of the Psychometric Properties of Several Computer-Based Test Designs for Credentialing Exams Michael Jodoin, April

More information

Voter Turnout, Income Inequality, and Redistribution. Henning Finseraas PhD student Norwegian Social Research

Voter Turnout, Income Inequality, and Redistribution. Henning Finseraas PhD student Norwegian Social Research Voter Turnout, Income Inequality, and Redistribution Henning Finseraas PhD student Norwegian Social Research hfi@nova.no Introduction Motivation Robin Hood paradox No robust effect of voter turnout on

More information

Is there a Strategic Selection Bias in Roll Call Votes. in the European Parliament?

Is there a Strategic Selection Bias in Roll Call Votes. in the European Parliament? Is there a Strategic Selection Bias in Roll Call Votes in the European Parliament? Revised. 22 July 2014 Simon Hix London School of Economics and Political Science Abdul Noury New York University Gerard

More information

1/12/12. Introduction-cont Pattern classification. Behavioral vs Physical Traits. Announcements

1/12/12. Introduction-cont Pattern classification. Behavioral vs Physical Traits. Announcements Announcements Introduction-cont Pattern classification Biometrics CSE 190 Lecture 2 Sign up for the course. Web page is up: http://www.cs.ucsd.edu/classes/wi12/ cse190-c/ HW0 posted. Intro to Matlab How

More information

The transition of corruption: From poverty to honesty

The transition of corruption: From poverty to honesty February 26 th 2009 Kiel and Aarhus The transition of corruption: From poverty to honesty Erich Gundlach a, *, Martin Paldam b,1 a Kiel Institute for the World Economy, P.O. Box 4309, 24100 Kiel, Germany

More information

Chapter. Estimating the Value of a Parameter Using Confidence Intervals Pearson Prentice Hall. All rights reserved

Chapter. Estimating the Value of a Parameter Using Confidence Intervals Pearson Prentice Hall. All rights reserved Chapter 9 Estimating the Value of a Parameter Using Confidence Intervals 2010 Pearson Prentice Hall. All rights reserved Section 9.1 The Logic in Constructing Confidence Intervals for a Population Mean

More information

UTS:IPPG Project Team. Project Director: Associate Professor Roberta Ryan, Director IPPG. Project Manager: Catherine Hastings, Research Officer

UTS:IPPG Project Team. Project Director: Associate Professor Roberta Ryan, Director IPPG. Project Manager: Catherine Hastings, Research Officer IPPG Project Team Project Director: Associate Professor Roberta Ryan, Director IPPG Project Manager: Catherine Hastings, Research Officer Research Assistance: Theresa Alvarez, Research Assistant Acknowledgements

More information

A Profile of the Gauteng Province: Demographics, Poverty, Income, Inequality and Unemployment from 2000 till 2007

A Profile of the Gauteng Province: Demographics, Poverty, Income, Inequality and Unemployment from 2000 till 2007 Background Paper Series Background Paper 2009:1(7) A Profile of the Gauteng Province: Demographics, Poverty, Income, Inequality and Unemployment from 2000 till 2007 Elsenburg February 2009 Overview The

More information

Public Opinions towards Gun Control vs. Gun Ownership. Society today is witnessing a major increase in violent crimes involving guns.

Public Opinions towards Gun Control vs. Gun Ownership. Society today is witnessing a major increase in violent crimes involving guns. 1 May 5, 2016 Public Opinions towards Gun Control vs. Gun Ownership Society today is witnessing a major increase in violent crimes involving guns. From mass shootings to gang violence, almost all of the

More information

Europeans support a proportional allocation of asylum seekers

Europeans support a proportional allocation of asylum seekers In the format provided by the authors and unedited. SUPPLEMENTARY INFORMATION VOLUME: 1 ARTICLE NUMBER: 0133 Europeans support a proportional allocation of asylum seekers Kirk Bansak, 1,2 Jens Hainmueller,

More information

Online Appendix 1: Treatment Stimuli

Online Appendix 1: Treatment Stimuli Online Appendix 1: Treatment Stimuli Polarized Stimulus: 1 Electorate as Divided as Ever by Jefferson Graham (USA Today) In the aftermath of the 2012 presidential election, interviews with voters at a

More information

Corruption and quality of public institutions: evidence from Generalized Method of Moment

Corruption and quality of public institutions: evidence from Generalized Method of Moment Document de travail de la série Etudes et Documents E 2008.13 Corruption and quality of public institutions: evidence from Generalized Method of Moment Gbewopo Attila 1 University Clermont I, CERDI-CNRS

More information

Intersections of political and economic relations: a network study

Intersections of political and economic relations: a network study Procedia Computer Science Volume 66, 2015, Pages 239 246 YSC 2015. 4th International Young Scientists Conference on Computational Science Intersections of political and economic relations: a network study

More information

Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University

Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University 7 July 1999 This appendix is a supplement to Non-Parametric

More information

Case Study: Get out the Vote

Case Study: Get out the Vote Case Study: Get out the Vote Do Phone Calls to Encourage Voting Work? Why Randomize? This case study is based on Comparing Experimental and Matching Methods Using a Large-Scale Field Experiment on Voter

More information

GENDER EQUALITY IN THE LABOUR MARKET AND FOREIGN DIRECT INVESTMENT

GENDER EQUALITY IN THE LABOUR MARKET AND FOREIGN DIRECT INVESTMENT THE STUDENT ECONOMIC REVIEWVOL. XXIX GENDER EQUALITY IN THE LABOUR MARKET AND FOREIGN DIRECT INVESTMENT CIÁN MC LEOD Senior Sophister With Southeast Asia attracting more foreign direct investment than

More information

Raymundo Miguel Campos-Vázquez. Center for Economic Studies, El Colegio de México, and consultant to the OECD. and. José Antonio Rodríguez-López

Raymundo Miguel Campos-Vázquez. Center for Economic Studies, El Colegio de México, and consultant to the OECD. and. José Antonio Rodríguez-López INTERNATIONAL COLLABORATIVE INITIATIVE FOR TRADE AND EMPLOYMENT (ICITE) ICITE REGIONAL CONFERENCE, SANTIAGO, CHILE SESSION 2, PAPER 4 TRADE AND OCCUPATIONAL EMPLOYMENT IN MEXICO SINCE NAFTA Raymundo Miguel

More information

Uncovering patterns among latent variables: human rights and de facto judicial independence

Uncovering patterns among latent variables: human rights and de facto judicial independence 605343RAP0010.1177/2053168015605343Research & PoliticsCrabtree and Fariss research-article2015 Research Article Uncovering patterns among latent variables: human rights and de facto judicial independence

More information

Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work

Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work Michael Clemens and Erwin Tiongson Review of Economics and Statistics (Forthcoming) Marian Atallah Presented by: Mohamed

More information

Candidate Faces and Election Outcomes: Is the Face-Vote Correlation Caused by Candidate Selection? Corrigendum

Candidate Faces and Election Outcomes: Is the Face-Vote Correlation Caused by Candidate Selection? Corrigendum Quarterly Journal of Political Science, 2010, 5: 99 105 Corrigendum Candidate Faces and Election Outcomes: Is the Face-Vote Correlation Caused by Candidate Selection? Corrigendum Matthew D. Atkinson, Ryan

More information

Impact of Human Rights Abuses on Economic Outlook

Impact of Human Rights Abuses on Economic Outlook Digital Commons @ George Fox University Student Scholarship - School of Business School of Business 1-1-2016 Impact of Human Rights Abuses on Economic Outlook Benjamin Antony George Fox University, bantony13@georgefox.edu

More information