Support Vector Machines

Size: px
Start display at page:

Download "Support Vector Machines"

Transcription

1 Support Vector Machines

2 Linearly Separable Data

3 SVM: Simple Linear Separator hyperplane

4 Which Simple Linear Separator?

5 Classifier Margin

6 Objective #1: Maximize Margin MARGIN MARGIN

7 How s this look? MARGIN MARGIN

8 Objective #2: Minimize Misclassifications MARGIN MARGIN

9 Support Vectors SUPPORT VECTORS

10 Not Linearly Separable

11 SVM w/ Soft Margin

12 The model Ø A hyperplane in R " can be represented by a vector w with n elements, plus a bias term, w % which lifts it away from the origin. Ø w % + w T x = 0 (equation of the decision boundary itself) Ø Any observation, x, above the hyperplane has Ø w % + w T x > 0 Ø Any observation, x, below the hyperplane has Ø w % + w T x < 0

13 The input Ø Input data and a class target. Ø For best results, input data should be centered and standardized/normalized Ø Can be either a linear scaling or a statistical scaling Ø You will frequently need to enter and tune other parameters for regularization and kernels. Ø (more on this later)

14 The output Ø The output will typically be a set of parameters (i.e. a vector, w, plus an intercept w % ) For a new example, x: Ø If w % + w T x < 0 then predict target = 1 Ø If w % + w T x > 0 then predict target = +1 The above formulation changes when kernels are used, and it is best to use the model as an output object.

15 Nonlinear SVMs The Kernel Trick

16 Not Linearly Separable

17 Create Additional Variables?

18 2 2 z = x +y

19 New Data is Linearly Separable!

20 Another view The last trick seems difficult in this case! Not immediately clear what transformation will make this data linearly separable.

21 Kernels l 2 Ø Suppose we add two points, which we ll call landmarks. Ø Now suppose we create two new variables, f 2 and f 3, which measure the similarity of each point to those landmarks. l 1 l

22 Kernels l 2 Ø f 2 is some measure of similarity (proximity) to l 2. Ø It takes large values near l 2 and small values far from l 2. l 1 l

23 Kernels l 2 Ø f 3 is some measure of similarity (proximity) to l 3. Ø It takes large values near l 3 and small values far from l 3. l 1 l

24 Kernels l 2 Ø Let s ignore our previous variables (the axis shown) and instead use f 2 and f 3. Ø Suppose the blue target is +1 and the red target is - 1. Ø Consider the SVM model f(x) = f 2-100f 3 l 1 l When f 2 or f 3 >.5 (i.e. when points are close to l 2 or l 3 ) the prediction is negative (red). When f 2 and f 3 <.5 (i.e. when points are far from l 2 and l 3 ) the prediction is positive (blue).

25 Kernels l 2 Ø Next natural question How do we choose the landmarks? Ø You could choose a modest number of landmarks (using clustering or other methodology). Ø In practice, a kernel uses every data point as a landmark. Ø Essentially computes a similarity matrix to use as the data. l 1 l

26 Summary of Kernels Ø Kernels are similarity functions that measure some kind of proximity between data points. Ø Number of data points becomes number of variables Ø So this is not good for large datasets! SAS has trouble running a kernel method with 50K data points! Ø SVMs can use kernels in a very efficient way (similarity matrix never explicitly computed/stored). Ø Kernels can improve the performance of SVMs in many situations.

27 Choosing Kernels Ø Kernels embed data in a higher dimensional space (implicitly) Ø Cannot typically know ahead of time which kernel function will work best Ø Can try several, take best performer on validation data

28 Popular Kernels Ø Linear (è NO kernel) Ø Radial Basis Functions (RBFs) Ø Gaussian in particular is most common and usually default Ø exp < = > = exp γ x D x E 3 2 Ø γ = is hyper parameter controlling shape of function. Ø Some packages want you to specify gamma (γ). Some ask you to specify sigma (σ). Ø Overwhelmingly THE most popular option when kernel needed. Ø NOT good for text classification. Typically linear is best for text

29 RBF/Gaussian Kernel exp < exp < σ = 1 σ = 0.5

30 Kernels l 2 Ø The circles shown are meant to represent contours of those Gaussian functions. l 1 l

31 RBF/Gaussian Kernel exp < exp < σ = 1 σ = 0.5

32 Tuning σ (or equivalently, γ) Ø This hyperparameter controls the influence of each training observation. Ø A larger value of σ (equivalently, a smaller value of γ) means that basis functions are wider the influence of a single point reaches far. Ø Smoother decision boundary => Reduce potential for overfitting. Ø A smaller value of σ (equivalently, a larger value of γ) means that basis functions are slimmer the influence of a single point is more local. Ø More localized/jagged decision boundary => Overfitting more likely Ø Consider: if σ were small enough, every point might be identified individually!

33 Ø Polynomial Other Kernels Ø ax J D x E + c L where a and c are constants and d is degree of polynomial Ø much less popular Ø Sigmoid Ø tanh ax J D x E + c where a and c are constants Ø much less popular

34 What kernels can do

35 What kernels can do

36 Regularization Ø As with most machine learning algorithms, a regularization penalty is built in to most packages. Ø Rather than specifying a λ as we would in most algorithms, SVMs are generally coded to expect C = 2 Q Ø C controls the tradeoff between a smooth decision boundary (bias/underfitting) and classifying training points correctly (variance/overfitting). Ø Larger C aims to classify all points correctly. Ø Smaller C aims to make decision surface more smooth.

37 Tuning Hyperparameters Ø How do we choose the specific values of the hyperparameters σ (or γ) and C? Ø One option is a grid search. See how the algorithm performs for all combinations of σ and C within a certain range: high CV accuracy low CV accuracy

38 Extensions of SVMs Multiclass classification Regression

39 Multiclass Classification with SVM Ø Most straightforward approach: One vs. All method 1. Starting with k classes 2. Train one SVM for each class, separating the points in that class (code as +1) from all other points (code as -1). 3. For SVM on class i, result is a set of parameters w i 4. To classify a new data point d, compute w T i d and place d in the class for which w T i d is largest. Ø This is still an ongoing research issue: how to define a larger objective function efficiently to avoid several binary classifiers. Ø New methods/packages constantly being developed. Ø Most existing packages can handle multiclass targets.

40 Support Vector Regression Ø The methodology behind SVMs has been extended to the regression problem. Ø Essentially, the data is imbedded in a very high dimensional space via kernels and then a regression hyperplane is determined via optimization.

41 Creating an SVM in SAS EM In my experience, this algorithm does not work as effectively as those implemented in R or Python. You also don t have the flexibility of hyperparameter tuning via cross validation.

42 SVM in SAS EM Under the HPDM tab, find HP SVM node

43 SVM in SAS EM The parameter C is called the Penalty and is listed under the option panel Train

44 SVM in SAS EM To use SVM with kernels, change the optimization method to Active Set and click the ellipses for more options.

45 SVM in SAS EM See the various options for the kernel used and the parameters. The parameter for the RBF kernel is gamma not sigma.

Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships

Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships Neural Networks Overview Ø s are considered black-box models Ø They are complex and do not provide much insight into variable relationships Ø They have the potential to model very complicated patterns

More information

Random Forests. Gradient Boosting. and. Bagging and Boosting

Random Forests. Gradient Boosting. and. Bagging and Boosting Random Forests and Gradient Boosting Bagging and Boosting The Bootstrap Sample and Bagging Simple ideas to improve any model via ensemble Bootstrap Samples Ø Random samples of your data with replacement

More information

CS 229 Final Project - Party Predictor: Predicting Political A liation

CS 229 Final Project - Party Predictor: Predicting Political A liation CS 229 Final Project - Party Predictor: Predicting Political A liation Brandon Ewonus bewonus@stanford.edu Bryan McCann bmccann@stanford.edu Nat Roth nroth@stanford.edu Abstract In this report we analyze

More information

Classifier Evaluation and Selection. Review and Overview of Methods

Classifier Evaluation and Selection. Review and Overview of Methods Classifier Evaluation and Selection Review and Overview of Methods Things to consider Ø Interpretation vs. Prediction Ø Model Parsimony vs. Model Error Ø Type of prediction task: Ø Decisions Interested

More information

Probabilistic Latent Semantic Analysis Hofmann (1999)

Probabilistic Latent Semantic Analysis Hofmann (1999) Probabilistic Latent Semantic Analysis Hofmann (1999) Presenter: Mercè Vintró Ricart February 8, 2016 Outline Background Topic models: What are they? Why do we use them? Latent Semantic Analysis (LSA)

More information

Do two parties represent the US? Clustering analysis of US public ideology survey

Do two parties represent the US? Clustering analysis of US public ideology survey Do two parties represent the US? Clustering analysis of US public ideology survey Louisa Lee 1 and Siyu Zhang 2, 3 Advised by: Vicky Chuqiao Yang 1 1 Department of Engineering Sciences and Applied Mathematics,

More information

Instructors: Tengyu Ma and Chris Re

Instructors: Tengyu Ma and Chris Re Instructors: Tengyu Ma and Chris Re cs229.stanford.edu Ø Probability (CS109 or STAT 116) Ø distribution, random variable, expectation, conditional probability, variance, density Ø Linear algebra (Math

More information

Classification of Short Legal Lithuanian Texts

Classification of Short Legal Lithuanian Texts Classification of Short Legal Lithuanian Texts Vytautas Mickevičius 1,2 Tomas Krilavičius 1,2 Vaidas Morkevičius 3 1 Vytautas Magnus University, 2 Baltic Institute of Advanced Technologies, 3 Kaunas University

More information

Cluster Analysis. (see also: Segmentation)

Cluster Analysis. (see also: Segmentation) Cluster Analysis (see also: Segmentation) Cluster Analysis Ø Unsupervised: no target variable for training Ø Partition the data into groups (clusters) so that: Ø Observations within a cluster are similar

More information

Deep Learning and Visualization of Election Data

Deep Learning and Visualization of Election Data Deep Learning and Visualization of Election Data Garcia, Jorge A. New Mexico State University Tao, Ng Ching City University of Hong Kong Betancourt, Frank University of Tennessee, Knoxville Wong, Kwai

More information

A comparative analysis of subreddit recommenders for Reddit

A comparative analysis of subreddit recommenders for Reddit A comparative analysis of subreddit recommenders for Reddit Jay Baxter Massachusetts Institute of Technology jbaxter@mit.edu Abstract Reddit has become a very popular social news website, but even though

More information

P(x) testing training. x Hi

P(x) testing training. x Hi ÙÑÙÐ Ø Ú ÈÖÓ Ø ± Ê Ú Û Ó Ä ØÙÖ ½ Ç Ñ³ Ê ÞÓÖ Ì ÑÔÐ Ø ÑÓ Ð Ø Ø Ø Ø Ø Ð Ó Ø ÑÓ Ø ÔÐ Ù Ð º Ë ÑÔÐ Ò P(x) testing training Ø ÒÓÓÔ Ò x ÓÑÔÐ Ü ØÝ Ó h ÓÑÔÐ Ü ØÝ Ó H ¼ ¾¼ ½¼ ¼ ¹½¼ ÒÓÓÔ Ò ÒÓ ÒÓÓÔ Ò ÙÒÐ ÐÝ Ú ÒØ Ò

More information

JUDGE, JURY AND CLASSIFIER

JUDGE, JURY AND CLASSIFIER JUDGE, JURY AND CLASSIFIER An Introduction to Trees 15.071x The Analytics Edge The American Legal System The legal system of the United States operates at the state level and at the federal level Federal

More information

Popularity Prediction of Reddit Texts

Popularity Prediction of Reddit Texts San Jose State University SJSU ScholarWorks Master's Theses Master's Theses and Graduate Research Spring 2016 Popularity Prediction of Reddit Texts Tracy Rohlin San Jose State University Follow this and

More information

Predicting Congressional Votes Based on Campaign Finance Data

Predicting Congressional Votes Based on Campaign Finance Data 1 Predicting Congressional Votes Based on Campaign Finance Data Samuel Smith, Jae Yeon (Claire) Baek, Zhaoyi Kang, Dawn Song, Laurent El Ghaoui, Mario Frank Department of Electrical Engineering and Computer

More information

Constraint satisfaction problems. Lirong Xia

Constraint satisfaction problems. Lirong Xia Constraint satisfaction problems Lirong Xia Spring, 2017 Project 1 Ø You can use Windows Ø Read the instruction carefully, make sure you understand the goal search for YOUR CODE HERE Ø Ask and answer questions

More information

Automated Classification of Congressional Legislation

Automated Classification of Congressional Legislation Automated Classification of Congressional Legislation Stephen Purpura John F. Kennedy School of Government Harvard University +-67-34-2027 stephen_purpura@ksg07.harvard.edu Dustin Hillard Electrical Engineering

More information

Tengyu Ma Facebook AI Research. Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC)

Tengyu Ma Facebook AI Research. Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC) Tengyu Ma Facebook AI Research Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC) Users Optimization Researchers function f Solution gradient descent local search Convex relaxation + Rounding

More information

Thinkwell s Homeschool Microeconomics Course Lesson Plan: 31 weeks

Thinkwell s Homeschool Microeconomics Course Lesson Plan: 31 weeks Thinkwell s Homeschool Microeconomics Course Lesson Plan: 31 weeks Welcome to Thinkwell s Homeschool Microeconomics! We re thrilled that you ve decided to make us part of your homeschool curriculum. This

More information

Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University

Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University 7 July 1999 This appendix is a supplement to Non-Parametric

More information

Automatic Thematic Classification of the Titles of the Seimas Votes

Automatic Thematic Classification of the Titles of the Seimas Votes Automatic Thematic Classification of the Titles of the Seimas Votes Vytautas Mickevičius 1,2 Tomas Krilavičius 1,2 Vaidas Morkevičius 3 Aušra Mackutė-Varoneckienė 1 1 Vytautas Magnus University, 2 Baltic

More information

Classification of posts on Reddit

Classification of posts on Reddit Classification of posts on Reddit Pooja Naik Graduate Student CSE Dept UCSD, CA, USA panaik@ucsd.edu Sachin A S Graduate Student CSE Dept UCSD, CA, USA sachinas@ucsd.edu Vincent Kuri Graduate Student CSE

More information

Research and strategy for the land community.

Research and strategy for the land community. Research and strategy for the land community. To: Northeastern Minnesotans for Wilderness From: Sonia Wang, Spencer Phillips Date: 2/27/2018 Subject: Full results from the review of comments on the proposed

More information

Lab 3: Logistic regression models

Lab 3: Logistic regression models Lab 3: Logistic regression models In this lab, we will apply logistic regression models to United States (US) presidential election data sets. The main purpose is to predict the outcomes of presidential

More information

Migration and Tourism Flows to New Zealand

Migration and Tourism Flows to New Zealand Migration and Tourism Flows to New Zealand Murat Genç University of Otago, Dunedin, New Zealand Email address for correspondence: murat.genc@otago.ac.nz 30 April 2010 PRELIMINARY WORK IN PROGRESS NOT FOR

More information

Introduction to Path Analysis: Multivariate Regression

Introduction to Path Analysis: Multivariate Regression Introduction to Path Analysis: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #7 March 9, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate

More information

CS 229: r/classifier - Subreddit Text Classification

CS 229: r/classifier - Subreddit Text Classification CS 229: r/classifier - Subreddit Text Classification Andrew Giel agiel@stanford.edu Jonathan NeCamp jnecamp@stanford.edu Hussain Kader hkader@stanford.edu Abstract This paper presents techniques for text

More information

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner Abstract For our project, we analyze data from US Congress voting records, a dataset that consists

More information

Statistical Analysis of Corruption Perception Index across countries

Statistical Analysis of Corruption Perception Index across countries Statistical Analysis of Corruption Perception Index across countries AMDA Project Summary Report (Under the guidance of Prof Malay Bhattacharya) Group 3 Anit Suri 1511007 Avishek Biswas 1511013 Diwakar

More information

PROJECTING THE LABOUR SUPPLY TO 2024

PROJECTING THE LABOUR SUPPLY TO 2024 PROJECTING THE LABOUR SUPPLY TO 2024 Charles Simkins Helen Suzman Professor of Political Economy School of Economic and Business Sciences University of the Witwatersrand May 2008 centre for poverty employment

More information

Coalitional Game Theory

Coalitional Game Theory Coalitional Game Theory Game Theory Algorithmic Game Theory 1 TOC Coalitional Games Fair Division and Shapley Value Stable Division and the Core Concept ε-core, Least core & Nucleolus Reading: Chapter

More information

Identifying Factors in Congressional Bill Success

Identifying Factors in Congressional Bill Success Identifying Factors in Congressional Bill Success CS224w Final Report Travis Gingerich, Montana Scher, Neeral Dodhia Introduction During an era of government where Congress has been criticized repeatedly

More information

the notion that poverty causes terrorism. Certainly, economic theory suggests that it would be

the notion that poverty causes terrorism. Certainly, economic theory suggests that it would be he Nonlinear Relationship Between errorism and Poverty Byline: Poverty and errorism Walter Enders and Gary A. Hoover 1 he fact that most terrorist attacks are staged in low income countries seems to support

More information

Probabilistic earthquake early warning in complex earth models using prior sampling

Probabilistic earthquake early warning in complex earth models using prior sampling Probabilistic earthquake early warning in complex earth models using prior sampling Andrew Valentine, Paul Käufl & Jeannot Trampert EGU 2016 21 st April www.geo.uu.nl/~andrew a.p.valentine@uu.nl A case

More information

Parties, Candidates, Issues: electoral competition revisited

Parties, Candidates, Issues: electoral competition revisited Parties, Candidates, Issues: electoral competition revisited Introduction The partisan competition is part of the operation of political parties, ranging from ideology to issues of public policy choices.

More information

Migrant Wages, Human Capital Accumulation and Return Migration

Migrant Wages, Human Capital Accumulation and Return Migration Migrant Wages, Human Capital Accumulation and Return Migration Jérôme Adda Christian Dustmann Joseph-Simon Görlach February 14, 2014 PRELIMINARY and VERY INCOMPLETE Abstract This paper analyses the wage

More information

Immigrants Inflows, Native outflows, and the Local Labor Market Impact of Higher Immigration David Card

Immigrants Inflows, Native outflows, and the Local Labor Market Impact of Higher Immigration David Card Immigrants Inflows, Native outflows, and the Local Labor Market Impact of Higher Immigration David Card Mehdi Akhbari, Ali Choubdaran 1 Table of Contents Introduction Theoretical Framework limitation of

More information

Computational Inelasticity FHLN05. Assignment A non-linear elasto-plastic problem

Computational Inelasticity FHLN05. Assignment A non-linear elasto-plastic problem Computational Inelasticity FHLN05 Assignment 2016 A non-linear elasto-plastic problem General instructions A written report should be submitted to the Division of Solid Mechanics no later than 1 November

More information

σ IηIη Andrew Askew Florida State University

σ IηIη Andrew Askew Florida State University σ IηIη Andrew Askew Florida State University Ø As I have doubtless said previously, photon identification is difficult. There are many reasons for this, but foremost among them is that you have few direct

More information

PROJECTION OF NET MIGRATION USING A GRAVITY MODEL 1. Laboratory of Populations 2

PROJECTION OF NET MIGRATION USING A GRAVITY MODEL 1. Laboratory of Populations 2 UN/POP/MIG-10CM/2012/11 3 February 2012 TENTH COORDINATION MEETING ON INTERNATIONAL MIGRATION Population Division Department of Economic and Social Affairs United Nations Secretariat New York, 9-10 February

More information

Combining national and constituency polling for forecasting

Combining national and constituency polling for forecasting Combining national and constituency polling for forecasting Chris Hanretty, Ben Lauderdale, Nick Vivyan Abstract We describe a method for forecasting British general elections by combining national and

More information

Read My Lips : Using Automatic Text Analysis to Classify Politicians by Party and Ideology 1

Read My Lips : Using Automatic Text Analysis to Classify Politicians by Party and Ideology 1 Read My Lips : Using Automatic Text Analysis to Classify Politicians by Party and Ideology 1 Eitan Sapiro-Gheiler 2 June 15, 2018 Department of Economics Princeton University 1 Acknowledgements: I would

More information

The Analytics of the Wage Effect of Immigration. George J. Borjas Harvard University September 2009

The Analytics of the Wage Effect of Immigration. George J. Borjas Harvard University September 2009 The Analytics of the Wage Effect of Immigration George J. Borjas Harvard University September 2009 1. The question Do immigrants alter the employment opportunities of native workers? After World War I,

More information

Announcements. HW3 Due tonight HW4 posted No class Thursday (Thanksgiving) 2017 Kevin Jamieson

Announcements. HW3 Due tonight HW4 posted No class Thursday (Thanksgiving) 2017 Kevin Jamieson Announcements HW3 Due tonight HW4 posted No class Thursday (Thanksgiving) 2017 Kevin Jamieson 1 Mixtures of Gaussians Machine Learning CSE546 Kevin Jamieson University of Washington November 20, 2016 Kevin

More information

! = ( tapping time ).

! = ( tapping time ). AP Statistics Name: Per: Date: 3. Least- Squares Regression p164 168 Ø What is the general form of a regression equation? What is the difference between y and ŷ? Example: Tapping on cans Don t you hate

More information

IMMIGRATION REFORM, JOB SELECTION AND WAGES IN THE U.S. FARM LABOR MARKET

IMMIGRATION REFORM, JOB SELECTION AND WAGES IN THE U.S. FARM LABOR MARKET IMMIGRATION REFORM, JOB SELECTION AND WAGES IN THE U.S. FARM LABOR MARKET Lurleen M. Walters International Agricultural Trade & Policy Center Food and Resource Economics Department P.O. Box 040, University

More information

Classification and Regression Approaches to Predicting United States Senate Elections. Rohan Sampath, Yue Teng

Classification and Regression Approaches to Predicting United States Senate Elections. Rohan Sampath, Yue Teng Classification and Regression Approaches to Predicting United States Senate Elections Rohan Sapath, Yue Teng Abstract The United States Senate is arguably the finest deocratic institution for debate and

More information

Generalized Scoring Rules: A Framework That Reconciles Borda and Condorcet

Generalized Scoring Rules: A Framework That Reconciles Borda and Condorcet Generalized Scoring Rules: A Framework That Reconciles Borda and Condorcet Lirong Xia Harvard University Generalized scoring rules [Xia and Conitzer 08] are a relatively new class of social choice mechanisms.

More information

Drug Trafficking Organizations and Local Economic Activity in Mexico

Drug Trafficking Organizations and Local Economic Activity in Mexico RESEARCH ARTICLE Drug Trafficking Organizations and Local Economic Activity in Mexico Felipe González* Department of Economics, University of California, Berkeley, California, United States of America

More information

Improved Boosting Algorithms Using Confidence-rated Predictions

Improved Boosting Algorithms Using Confidence-rated Predictions Improved Boosting Algorithms Using Confidence-rated Predictions ÊÇÊÌ º ËÀÈÁÊ schapire@research.att.com AT&T Labs, Shannon Laboratory, 18 Park Avenue, Room A279, Florham Park, NJ 7932-971 ÇÊÅ ËÁÆÊ singer@research.att.com

More information

Deep Classification and Generation of Reddit Post Titles

Deep Classification and Generation of Reddit Post Titles Deep Classification and Generation of Reddit Post Titles Tyler Chase tchase56@stanford.edu Rolland He rhe@stanford.edu William Qiu willqiu@stanford.edu Abstract The online news aggregation website Reddit

More information

The Trade Effects of Skilled versus Unskilled Migration

The Trade Effects of Skilled versus Unskilled Migration The Trade Effects of Skilled versus Unskilled Migration Peter H. Egger ETH Zurich Maximilian von Ehrlich University of Bern September 29, 2014 Douglas R. Nelson Tulane University Abstract In this paper,

More information

An Investigation into a Circuit Based Supply Chain Analyzer for FPGAs

An Investigation into a Circuit Based Supply Chain Analyzer for FPGAs An Investigation into a Circuit Based Supply Chain Analyzer for FPGAs FPL-2016 9/1/2016 Jacob Couch 1 John Arkorian Staff Researchers 1 jacob.couch@jhuapl.edu What is the problem anyways? How can FPGAs

More information

The cost of ruling, cabinet duration, and the median-gap model

The cost of ruling, cabinet duration, and the median-gap model Public Choice 113: 157 178, 2002. 2002 Kluwer Academic Publishers. Printed in the Netherlands. 157 The cost of ruling, cabinet duration, and the median-gap model RANDOLPH T. STEVENSON Department of Political

More information

Political Economics II Spring Lectures 4-5 Part II Partisan Politics and Political Agency. Torsten Persson, IIES

Political Economics II Spring Lectures 4-5 Part II Partisan Politics and Political Agency. Torsten Persson, IIES Lectures 4-5_190213.pdf Political Economics II Spring 2019 Lectures 4-5 Part II Partisan Politics and Political Agency Torsten Persson, IIES 1 Introduction: Partisan Politics Aims continue exploring policy

More information

The Costs of Remoteness, Evidence From German Division and Reunification by Redding and Sturm (AER, 2008)

The Costs of Remoteness, Evidence From German Division and Reunification by Redding and Sturm (AER, 2008) The Costs of Remoteness, Evidence From German Division and Reunification by Redding and Sturm (AER, 2008) MIT Spatial Economics Reading Group Presentation Adam Guren May 13, 2010 Testing the New Economic

More information

A Vote Equation and the 2004 Election

A Vote Equation and the 2004 Election A Vote Equation and the 2004 Election Ray C. Fair November 22, 2004 1 Introduction My presidential vote equation is a great teaching example for introductory econometrics. 1 The theory is straightforward,

More information

MIPAS Temperature and Pressure Validation by RO Data

MIPAS Temperature and Pressure Validation by RO Data MIPAS and Validation by RO Data Marc Schwaerz and Gottfried Kirchengast Wegener Center (WEGC), Graz, Austria MIPAS Quality Working Group Meeting 40, November 3, 2015 Outline 1 2 Validation and Reference

More information

CHAPTER FIVE RESULTS REGARDING ACCULTURATION LEVEL. This chapter reports the results of the statistical analysis

CHAPTER FIVE RESULTS REGARDING ACCULTURATION LEVEL. This chapter reports the results of the statistical analysis CHAPTER FIVE RESULTS REGARDING ACCULTURATION LEVEL This chapter reports the results of the statistical analysis which aimed at answering the research questions regarding acculturation level. 5.1 Discriminant

More information

Remittances and the Brain Drain: Evidence from Microdata for Sub-Saharan Africa

Remittances and the Brain Drain: Evidence from Microdata for Sub-Saharan Africa Remittances and the Brain Drain: Evidence from Microdata for Sub-Saharan Africa Julia Bredtmann 1, Fernanda Martinez Flores 1,2, and Sebastian Otten 1,2,3 1 RWI, Rheinisch-Westfälisches Institut für Wirtschaftsforschung

More information

THE EVALUATION OF OUTPUT CONVERGENCE IN SEVERAL CENTRAL AND EASTERN EUROPEAN COUNTRIES

THE EVALUATION OF OUTPUT CONVERGENCE IN SEVERAL CENTRAL AND EASTERN EUROPEAN COUNTRIES ISSN 1392-1258. ekonomika 2015 Vol. 94(1) THE EVALUATION OF OUTPUT CONVERGENCE IN SEVERAL CENTRAL AND EASTERN EUROPEAN COUNTRIES Simionescu M.* Institute for Economic Forecasting of the Romanian Academy

More information

SIMPLE LINEAR REGRESSION OF CPS DATA

SIMPLE LINEAR REGRESSION OF CPS DATA SIMPLE LINEAR REGRESSION OF CPS DATA Using the 1995 CPS data, hourly wages are regressed against years of education. The regression output in Table 4.1 indicates that there are 1003 persons in the CPS

More information

(a) Draw side-by-side box plots that show the yields of the two types of land. Check for outliers before making the plots.

(a) Draw side-by-side box plots that show the yields of the two types of land. Check for outliers before making the plots. 1. In hilly areas, farmers often contour their fields to reduce the erosion due to water flow. This might have the unintended effect of changing the yield since the rows may not be aligned in an east-west

More information

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012 Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012 Abstract In this paper we attempt to develop an algorithm to generate a set of post recommendations

More information

Using Poole s Optimal Classification in R

Using Poole s Optimal Classification in R Using Poole s Optimal Classification in R January 22, 2018 1 Introduction This package estimates Poole s Optimal Classification scores from roll call votes supplied though a rollcall object from package

More information

VOTING ON INCOME REDISTRIBUTION: HOW A LITTLE BIT OF ALTRUISM CREATES TRANSITIVITY DONALD WITTMAN ECONOMICS DEPARTMENT UNIVERSITY OF CALIFORNIA

VOTING ON INCOME REDISTRIBUTION: HOW A LITTLE BIT OF ALTRUISM CREATES TRANSITIVITY DONALD WITTMAN ECONOMICS DEPARTMENT UNIVERSITY OF CALIFORNIA 1 VOTING ON INCOME REDISTRIBUTION: HOW A LITTLE BIT OF ALTRUISM CREATES TRANSITIVITY DONALD WITTMAN ECONOMICS DEPARTMENT UNIVERSITY OF CALIFORNIA SANTA CRUZ wittman@ucsc.edu ABSTRACT We consider an election

More information

JudgeIt II: A Program for Evaluating Electoral Systems and Redistricting Plans 1

JudgeIt II: A Program for Evaluating Electoral Systems and Redistricting Plans 1 JudgeIt II: A Program for Evaluating Electoral Systems and Redistricting Plans 1 Andrew Gelman Gary King 2 Andrew C. Thomas 3 Version 1.3.4 August 31, 2010 1 Available from CRAN (http://cran.r-project.org/)

More information

Wage Rigidity and Spatial Misallocation: Evidence from Italy and Germany

Wage Rigidity and Spatial Misallocation: Evidence from Italy and Germany Wage Rigidity and Spatial Misallocation: Evidence from Italy and Germany Tito Boeri 1 Andrea Ichino 2 Enrico Moretti 3 Johanna Posch 2 1 Bocconi 2 European University Institute 3 Berkeley 10 April 2018

More information

CS269I: Incentives in Computer Science Lecture #4: Voting, Machine Learning, and Participatory Democracy

CS269I: Incentives in Computer Science Lecture #4: Voting, Machine Learning, and Participatory Democracy CS269I: Incentives in Computer Science Lecture #4: Voting, Machine Learning, and Participatory Democracy Tim Roughgarden October 5, 2016 1 Preamble Last lecture was all about strategyproof voting rules

More information

Hoboken Public Schools. College Algebra Curriculum

Hoboken Public Schools. College Algebra Curriculum Hoboken Public Schools College Algebra Curriculum College Algebra HOBOKEN PUBLIC SCHOOLS Course Description College Algebra reflects the New Jersey learning standards at the high school level and is designed

More information

The Shadow Value of Legal Status --A Hedonic Analysis of the Earnings of U.S. Farm Workers 1

The Shadow Value of Legal Status --A Hedonic Analysis of the Earnings of U.S. Farm Workers 1 The Shadow Value of Legal Status --A Hedonic Analysis of the Earnings of U.S. Farm Workers 1 June, 3 rd, 2013 Sun Ling Wang 2 Economic Research Service, U.S. Department of Agriculture Daniel Carroll Employment

More information

Wage Trends among Disadvantaged Minorities

Wage Trends among Disadvantaged Minorities National Poverty Center Working Paper Series #05-12 August 2005 Wage Trends among Disadvantaged Minorities George J. Borjas Harvard University This paper is available online at the National Poverty Center

More information

RECOMMENDED CITATION: Pew Research Center, May, 2017, Partisan Identification Is Sticky, but About 10% Switched Parties Over the Past Year

RECOMMENDED CITATION: Pew Research Center, May, 2017, Partisan Identification Is Sticky, but About 10% Switched Parties Over the Past Year NUMBERS, FACTS AND TRENDS SHAPING THE WORLD FOR RELEASE MAY 17, 2017 FOR MEDIA OR OTHER INQUIRIES: Carroll Doherty, Director of Political Research Jocelyn Kiley, Associate Director, Research Bridget Johnson,

More information

Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content

Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content Ruben Sipos Dept. of Computer Science Cornell University Ithaca, NY rs@cs.cornell.edu Arpita Ghosh Dept. of Information

More information

Supplementary Tables for Online Publication: Impact of Judicial Elections in the Sentencing of Black Crime

Supplementary Tables for Online Publication: Impact of Judicial Elections in the Sentencing of Black Crime Supplementary Tables for Online Publication: Impact of Judicial Elections in the Sentencing of Black Crime Kyung H. Park Wellesley College March 23, 2016 A Kansas Background A.1 Partisan versus Retention

More information

Hoboken Public Schools. AP Statistics Curriculum

Hoboken Public Schools. AP Statistics Curriculum Hoboken Public Schools AP Statistics Curriculum AP Statistics HOBOKEN PUBLIC SCHOOLS Course Description AP Statistics is the high school equivalent of a one semester, introductory college statistics course.

More information

Category-level localization. Cordelia Schmid

Category-level localization. Cordelia Schmid Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

On the Determinants of Global Bilateral Migration Flows

On the Determinants of Global Bilateral Migration Flows On the Determinants of Global Bilateral Migration Flows Jesus Crespo Cuaresma Mathias Moser Anna Raggl Preliminary Draft, May 2013 Abstract We present a method aimed at estimating global bilateral migration

More information

Congressional Gridlock: The Effects of the Master Lever

Congressional Gridlock: The Effects of the Master Lever Congressional Gridlock: The Effects of the Master Lever Olga Gorelkina Max Planck Institute, Bonn Ioanna Grypari Max Planck Institute, Bonn Preliminary & Incomplete February 11, 2015 Abstract This paper

More information

Pivoted Text Scaling for Open-Ended Survey Responses

Pivoted Text Scaling for Open-Ended Survey Responses Pivoted Text Scaling for Open-Ended Survey Responses William Hobbs September 28, 2017 Abstract Short texts such as open-ended survey responses and tweets contain valuable information about public opinions,

More information

Chapter Five: Forces. Ø 5.1 Forces. Ø 5.2 Friction. Ø 5.3 Forces and Equilibrium

Chapter Five: Forces. Ø 5.1 Forces. Ø 5.2 Friction. Ø 5.3 Forces and Equilibrium Chapter Five: Forces Ø 5.1 Forces Ø 5.2 Friction Ø 5.3 Forces and Equilibrium 5.1 The cause of forces Ø A force is a push or pull, or an action that has the ability to change motion. Ø Forces can increase

More information

Climate Change Around the World

Climate Change Around the World Climate Change Around the World Per Krusell Institute for International Economic Studies, NBER, CEPR Joint with Anthony A. Smith, Jr. Yale University, NBER World Congress Montréal Août, 215 The project

More information

Schooling and Cohort Size: Evidence from Vietnam, Thailand, Iran and Cambodia. Evangelos M. Falaris University of Delaware. and

Schooling and Cohort Size: Evidence from Vietnam, Thailand, Iran and Cambodia. Evangelos M. Falaris University of Delaware. and Schooling and Cohort Size: Evidence from Vietnam, Thailand, Iran and Cambodia by Evangelos M. Falaris University of Delaware and Thuan Q. Thai Max Planck Institute for Demographic Research March 2012 2

More information

Do Individual Heterogeneity and Spatial Correlation Matter?

Do Individual Heterogeneity and Spatial Correlation Matter? Do Individual Heterogeneity and Spatial Correlation Matter? An Innovative Approach to the Characterisation of the European Political Space. Giovanna Iannantuoni, Elena Manzoni and Francesca Rossi EXTENDED

More information

SocialSecurityEligibilityandtheLaborSuplyofOlderImigrants. George J. Borjas Harvard University

SocialSecurityEligibilityandtheLaborSuplyofOlderImigrants. George J. Borjas Harvard University SocialSecurityEligibilityandtheLaborSuplyofOlderImigrants George J. Borjas Harvard University February 2010 1 SocialSecurityEligibilityandtheLaborSuplyofOlderImigrants George J. Borjas ABSTRACT The employment

More information

Vote Compass Methodology

Vote Compass Methodology Vote Compass Methodology 1 Introduction Vote Compass is a civic engagement application developed by the team of social and data scientists from Vox Pop Labs. Its objective is to promote electoral literacy

More information

Use and abuse of voter migration models in an election year. Dr. Peter Moser Statistical Office of the Canton of Zurich

Use and abuse of voter migration models in an election year. Dr. Peter Moser Statistical Office of the Canton of Zurich Use and abuse of voter migration models in an election year Statistical Office of the Canton of Zurich Overview What is a voter migration model? How are they estimated? Their use in forecasting election

More information

Quant 101 Learn2Quant HK, 14 September Vinesh Jha CEO, ExtractAlpha

Quant 101 Learn2Quant HK, 14 September Vinesh Jha CEO, ExtractAlpha Quant 101 Learn2Quant HK, 14 September 2017 Vinesh Jha CEO, ExtractAlpha Data-driven investment is the future ExtractAlpha confidential. Do not copy or distribute. 2 Agenda Ø Quick intro to ExtractAlpha

More information

Understanding factors that influence L1-visa outcomes in US

Understanding factors that influence L1-visa outcomes in US Understanding factors that influence L1-visa outcomes in US By Nihar Dalmia, Meghana Murthy and Nianthrini Vivekanandan Link to online course gallery : https://www.ischool.berkeley.edu/projects/2017/understanding-factors-influence-l1-work

More information

Final Review. Chenyang Lu. CSE 467S Embedded Compu5ng Systems

Final Review. Chenyang Lu. CSE 467S Embedded Compu5ng Systems Final Review Chenyang Lu CSE 467S Embedded Compu5ng Systems OS: Basic Func2ons Ø OS controls resources: q who gets the CPU; q when I/O takes place; q how much memory is allocated; q power management. Ø

More information

Migration With Endogenous Social Networks in China

Migration With Endogenous Social Networks in China Migration With Endogenous Social Networks in China Jin Zhou (University of Western Ontario) May 2015 Abstract Numerous empirical studies have documented a strong association between social networks and

More information

Hierarchical Item Response Models for Analyzing Public Opinion

Hierarchical Item Response Models for Analyzing Public Opinion Hierarchical Item Response Models for Analyzing Public Opinion Xiang Zhou Harvard University July 16, 2017 Xiang Zhou (Harvard University) Hierarchical IRT for Public Opinion July 16, 2017 Page 1 Features

More information

Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work

Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work Michael Clemens and Erwin Tiongson Review of Economics and Statistics (Forthcoming) Marian Atallah Presented by: Mohamed

More information

Data Assimilation in Geosciences

Data Assimilation in Geosciences Data Assimilation in Geosciences Alberto Carrassi The Nordic Centre of Excellence for ensemble-based data assimilation Laurent Bertino (Lead), Alberto Carrassi (Co-Lead), Colin Grudzien (PD), Patrick Raanes

More information

Women and Power: Unpopular, Unwilling, or Held Back? Comment

Women and Power: Unpopular, Unwilling, or Held Back? Comment Women and Power: Unpopular, Unwilling, or Held Back? Comment Manuel Bagues, Pamela Campa May 22, 2017 Abstract Casas-Arce and Saiz (2015) study how gender quotas in candidate lists affect voting behavior

More information

School Quality and Returns to Education of U.S. Immigrants. Bernt Bratsberg. and. Dek Terrell* RRH: BRATSBERG & TERRELL:

School Quality and Returns to Education of U.S. Immigrants. Bernt Bratsberg. and. Dek Terrell* RRH: BRATSBERG & TERRELL: Forthcoming, Economic Inquiry School Quality and Returns to Education of U.S. Immigrants Bernt Bratsberg and Dek Terrell* RRH: BRATSBERG & TERRELL: SCHOOL QUALITY AND EDUCATION RETURNS OF IMMIGRANTS JEL

More information

Determinants and Effects of Negative Advertising in Politics

Determinants and Effects of Negative Advertising in Politics Department of Economics- FEA/USP Determinants and Effects of Negative Advertising in Politics DANILO P. SOUZA MARCOS Y. NAKAGUMA WORKING PAPER SERIES Nº 2017-25 DEPARTMENT OF ECONOMICS, FEA-USP WORKING

More information

IPSA International Conference Concordia University, Montreal (Quebec), Canada April 30 May 2, 2008

IPSA International Conference Concordia University, Montreal (Quebec), Canada April 30 May 2, 2008 IPSA International Conference Concordia University, Montreal (Quebec), Canada April 30 May 2, 2008 Yuri A. Polunin, Sc. D., Professor. Phone: +7 (495) 433-34-95 E-mail: : polunin@expert.ru polunin@crpi.ru

More information

Wind power integration and consumer behavior: a complementarity approach

Wind power integration and consumer behavior: a complementarity approach 1 Wind power integration and consumer behavior: a complementarity approach 8 th Annual Trans-Atlantic INFRADAY Conference on Energy November 7 th, 2014 Ali Daraeepour, Duke University Dr. Jalal Kazempour,

More information

The Effects of Housing Prices, Wages, and Commuting Time on Joint Residential and Job Location Choices

The Effects of Housing Prices, Wages, and Commuting Time on Joint Residential and Job Location Choices The Effects of Housing Prices, Wages, and Commuting Time on Joint Residential and Job Location Choices Kim S. So, Peter F. Orazem, and Daniel M. Otto a May 1998 American Agricultural Economics Association

More information