Support Vector Machines
|
|
- Angela Melton
- 6 years ago
- Views:
Transcription
1 Support Vector Machines
2 Linearly Separable Data
3 SVM: Simple Linear Separator hyperplane
4 Which Simple Linear Separator?
5 Classifier Margin
6 Objective #1: Maximize Margin MARGIN MARGIN
7 How s this look? MARGIN MARGIN
8 Objective #2: Minimize Misclassifications MARGIN MARGIN
9 Support Vectors SUPPORT VECTORS
10 Not Linearly Separable
11 SVM w/ Soft Margin
12 The model Ø A hyperplane in R " can be represented by a vector w with n elements, plus a bias term, w % which lifts it away from the origin. Ø w % + w T x = 0 (equation of the decision boundary itself) Ø Any observation, x, above the hyperplane has Ø w % + w T x > 0 Ø Any observation, x, below the hyperplane has Ø w % + w T x < 0
13 The input Ø Input data and a class target. Ø For best results, input data should be centered and standardized/normalized Ø Can be either a linear scaling or a statistical scaling Ø You will frequently need to enter and tune other parameters for regularization and kernels. Ø (more on this later)
14 The output Ø The output will typically be a set of parameters (i.e. a vector, w, plus an intercept w % ) For a new example, x: Ø If w % + w T x < 0 then predict target = 1 Ø If w % + w T x > 0 then predict target = +1 The above formulation changes when kernels are used, and it is best to use the model as an output object.
15 Nonlinear SVMs The Kernel Trick
16 Not Linearly Separable
17 Create Additional Variables?
18 2 2 z = x +y
19 New Data is Linearly Separable!
20 Another view The last trick seems difficult in this case! Not immediately clear what transformation will make this data linearly separable.
21 Kernels l 2 Ø Suppose we add two points, which we ll call landmarks. Ø Now suppose we create two new variables, f 2 and f 3, which measure the similarity of each point to those landmarks. l 1 l
22 Kernels l 2 Ø f 2 is some measure of similarity (proximity) to l 2. Ø It takes large values near l 2 and small values far from l 2. l 1 l
23 Kernels l 2 Ø f 3 is some measure of similarity (proximity) to l 3. Ø It takes large values near l 3 and small values far from l 3. l 1 l
24 Kernels l 2 Ø Let s ignore our previous variables (the axis shown) and instead use f 2 and f 3. Ø Suppose the blue target is +1 and the red target is - 1. Ø Consider the SVM model f(x) = f 2-100f 3 l 1 l When f 2 or f 3 >.5 (i.e. when points are close to l 2 or l 3 ) the prediction is negative (red). When f 2 and f 3 <.5 (i.e. when points are far from l 2 and l 3 ) the prediction is positive (blue).
25 Kernels l 2 Ø Next natural question How do we choose the landmarks? Ø You could choose a modest number of landmarks (using clustering or other methodology). Ø In practice, a kernel uses every data point as a landmark. Ø Essentially computes a similarity matrix to use as the data. l 1 l
26 Summary of Kernels Ø Kernels are similarity functions that measure some kind of proximity between data points. Ø Number of data points becomes number of variables Ø So this is not good for large datasets! SAS has trouble running a kernel method with 50K data points! Ø SVMs can use kernels in a very efficient way (similarity matrix never explicitly computed/stored). Ø Kernels can improve the performance of SVMs in many situations.
27 Choosing Kernels Ø Kernels embed data in a higher dimensional space (implicitly) Ø Cannot typically know ahead of time which kernel function will work best Ø Can try several, take best performer on validation data
28 Popular Kernels Ø Linear (è NO kernel) Ø Radial Basis Functions (RBFs) Ø Gaussian in particular is most common and usually default Ø exp < = > = exp γ x D x E 3 2 Ø γ = is hyper parameter controlling shape of function. Ø Some packages want you to specify gamma (γ). Some ask you to specify sigma (σ). Ø Overwhelmingly THE most popular option when kernel needed. Ø NOT good for text classification. Typically linear is best for text
29 RBF/Gaussian Kernel exp < exp < σ = 1 σ = 0.5
30 Kernels l 2 Ø The circles shown are meant to represent contours of those Gaussian functions. l 1 l
31 RBF/Gaussian Kernel exp < exp < σ = 1 σ = 0.5
32 Tuning σ (or equivalently, γ) Ø This hyperparameter controls the influence of each training observation. Ø A larger value of σ (equivalently, a smaller value of γ) means that basis functions are wider the influence of a single point reaches far. Ø Smoother decision boundary => Reduce potential for overfitting. Ø A smaller value of σ (equivalently, a larger value of γ) means that basis functions are slimmer the influence of a single point is more local. Ø More localized/jagged decision boundary => Overfitting more likely Ø Consider: if σ were small enough, every point might be identified individually!
33 Ø Polynomial Other Kernels Ø ax J D x E + c L where a and c are constants and d is degree of polynomial Ø much less popular Ø Sigmoid Ø tanh ax J D x E + c where a and c are constants Ø much less popular
34 What kernels can do
35 What kernels can do
36 Regularization Ø As with most machine learning algorithms, a regularization penalty is built in to most packages. Ø Rather than specifying a λ as we would in most algorithms, SVMs are generally coded to expect C = 2 Q Ø C controls the tradeoff between a smooth decision boundary (bias/underfitting) and classifying training points correctly (variance/overfitting). Ø Larger C aims to classify all points correctly. Ø Smaller C aims to make decision surface more smooth.
37 Tuning Hyperparameters Ø How do we choose the specific values of the hyperparameters σ (or γ) and C? Ø One option is a grid search. See how the algorithm performs for all combinations of σ and C within a certain range: high CV accuracy low CV accuracy
38 Extensions of SVMs Multiclass classification Regression
39 Multiclass Classification with SVM Ø Most straightforward approach: One vs. All method 1. Starting with k classes 2. Train one SVM for each class, separating the points in that class (code as +1) from all other points (code as -1). 3. For SVM on class i, result is a set of parameters w i 4. To classify a new data point d, compute w T i d and place d in the class for which w T i d is largest. Ø This is still an ongoing research issue: how to define a larger objective function efficiently to avoid several binary classifiers. Ø New methods/packages constantly being developed. Ø Most existing packages can handle multiclass targets.
40 Support Vector Regression Ø The methodology behind SVMs has been extended to the regression problem. Ø Essentially, the data is imbedded in a very high dimensional space via kernels and then a regression hyperplane is determined via optimization.
41 Creating an SVM in SAS EM In my experience, this algorithm does not work as effectively as those implemented in R or Python. You also don t have the flexibility of hyperparameter tuning via cross validation.
42 SVM in SAS EM Under the HPDM tab, find HP SVM node
43 SVM in SAS EM The parameter C is called the Penalty and is listed under the option panel Train
44 SVM in SAS EM To use SVM with kernels, change the optimization method to Active Set and click the ellipses for more options.
45 SVM in SAS EM See the various options for the kernel used and the parameters. The parameter for the RBF kernel is gamma not sigma.
Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships
Neural Networks Overview Ø s are considered black-box models Ø They are complex and do not provide much insight into variable relationships Ø They have the potential to model very complicated patterns
More informationRandom Forests. Gradient Boosting. and. Bagging and Boosting
Random Forests and Gradient Boosting Bagging and Boosting The Bootstrap Sample and Bagging Simple ideas to improve any model via ensemble Bootstrap Samples Ø Random samples of your data with replacement
More informationCS 229 Final Project - Party Predictor: Predicting Political A liation
CS 229 Final Project - Party Predictor: Predicting Political A liation Brandon Ewonus bewonus@stanford.edu Bryan McCann bmccann@stanford.edu Nat Roth nroth@stanford.edu Abstract In this report we analyze
More informationClassifier Evaluation and Selection. Review and Overview of Methods
Classifier Evaluation and Selection Review and Overview of Methods Things to consider Ø Interpretation vs. Prediction Ø Model Parsimony vs. Model Error Ø Type of prediction task: Ø Decisions Interested
More informationProbabilistic Latent Semantic Analysis Hofmann (1999)
Probabilistic Latent Semantic Analysis Hofmann (1999) Presenter: Mercè Vintró Ricart February 8, 2016 Outline Background Topic models: What are they? Why do we use them? Latent Semantic Analysis (LSA)
More informationDo two parties represent the US? Clustering analysis of US public ideology survey
Do two parties represent the US? Clustering analysis of US public ideology survey Louisa Lee 1 and Siyu Zhang 2, 3 Advised by: Vicky Chuqiao Yang 1 1 Department of Engineering Sciences and Applied Mathematics,
More informationInstructors: Tengyu Ma and Chris Re
Instructors: Tengyu Ma and Chris Re cs229.stanford.edu Ø Probability (CS109 or STAT 116) Ø distribution, random variable, expectation, conditional probability, variance, density Ø Linear algebra (Math
More informationClassification of Short Legal Lithuanian Texts
Classification of Short Legal Lithuanian Texts Vytautas Mickevičius 1,2 Tomas Krilavičius 1,2 Vaidas Morkevičius 3 1 Vytautas Magnus University, 2 Baltic Institute of Advanced Technologies, 3 Kaunas University
More informationCluster Analysis. (see also: Segmentation)
Cluster Analysis (see also: Segmentation) Cluster Analysis Ø Unsupervised: no target variable for training Ø Partition the data into groups (clusters) so that: Ø Observations within a cluster are similar
More informationDeep Learning and Visualization of Election Data
Deep Learning and Visualization of Election Data Garcia, Jorge A. New Mexico State University Tao, Ng Ching City University of Hong Kong Betancourt, Frank University of Tennessee, Knoxville Wong, Kwai
More informationA comparative analysis of subreddit recommenders for Reddit
A comparative analysis of subreddit recommenders for Reddit Jay Baxter Massachusetts Institute of Technology jbaxter@mit.edu Abstract Reddit has become a very popular social news website, but even though
More informationP(x) testing training. x Hi
ÙÑÙÐ Ø Ú ÈÖÓ Ø ± Ê Ú Û Ó Ä ØÙÖ ½ Ç Ñ³ Ê ÞÓÖ Ì ÑÔÐ Ø ÑÓ Ð Ø Ø Ø Ø Ø Ð Ó Ø ÑÓ Ø ÔÐ Ù Ð º Ë ÑÔÐ Ò P(x) testing training Ø ÒÓÓÔ Ò x ÓÑÔÐ Ü ØÝ Ó h ÓÑÔÐ Ü ØÝ Ó H ¼ ¾¼ ½¼ ¼ ¹½¼ ÒÓÓÔ Ò ÒÓ ÒÓÓÔ Ò ÙÒÐ ÐÝ Ú ÒØ Ò
More informationJUDGE, JURY AND CLASSIFIER
JUDGE, JURY AND CLASSIFIER An Introduction to Trees 15.071x The Analytics Edge The American Legal System The legal system of the United States operates at the state level and at the federal level Federal
More informationPopularity Prediction of Reddit Texts
San Jose State University SJSU ScholarWorks Master's Theses Master's Theses and Graduate Research Spring 2016 Popularity Prediction of Reddit Texts Tracy Rohlin San Jose State University Follow this and
More informationPredicting Congressional Votes Based on Campaign Finance Data
1 Predicting Congressional Votes Based on Campaign Finance Data Samuel Smith, Jae Yeon (Claire) Baek, Zhaoyi Kang, Dawn Song, Laurent El Ghaoui, Mario Frank Department of Electrical Engineering and Computer
More informationConstraint satisfaction problems. Lirong Xia
Constraint satisfaction problems Lirong Xia Spring, 2017 Project 1 Ø You can use Windows Ø Read the instruction carefully, make sure you understand the goal search for YOUR CODE HERE Ø Ask and answer questions
More informationAutomated Classification of Congressional Legislation
Automated Classification of Congressional Legislation Stephen Purpura John F. Kennedy School of Government Harvard University +-67-34-2027 stephen_purpura@ksg07.harvard.edu Dustin Hillard Electrical Engineering
More informationTengyu Ma Facebook AI Research. Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC)
Tengyu Ma Facebook AI Research Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC) Users Optimization Researchers function f Solution gradient descent local search Convex relaxation + Rounding
More informationThinkwell s Homeschool Microeconomics Course Lesson Plan: 31 weeks
Thinkwell s Homeschool Microeconomics Course Lesson Plan: 31 weeks Welcome to Thinkwell s Homeschool Microeconomics! We re thrilled that you ve decided to make us part of your homeschool curriculum. This
More informationAppendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University
Appendix to Non-Parametric Unfolding of Binary Choice Data Keith T. Poole Graduate School of Industrial Administration Carnegie-Mellon University 7 July 1999 This appendix is a supplement to Non-Parametric
More informationAutomatic Thematic Classification of the Titles of the Seimas Votes
Automatic Thematic Classification of the Titles of the Seimas Votes Vytautas Mickevičius 1,2 Tomas Krilavičius 1,2 Vaidas Morkevičius 3 Aušra Mackutė-Varoneckienė 1 1 Vytautas Magnus University, 2 Baltic
More informationClassification of posts on Reddit
Classification of posts on Reddit Pooja Naik Graduate Student CSE Dept UCSD, CA, USA panaik@ucsd.edu Sachin A S Graduate Student CSE Dept UCSD, CA, USA sachinas@ucsd.edu Vincent Kuri Graduate Student CSE
More informationResearch and strategy for the land community.
Research and strategy for the land community. To: Northeastern Minnesotans for Wilderness From: Sonia Wang, Spencer Phillips Date: 2/27/2018 Subject: Full results from the review of comments on the proposed
More informationLab 3: Logistic regression models
Lab 3: Logistic regression models In this lab, we will apply logistic regression models to United States (US) presidential election data sets. The main purpose is to predict the outcomes of presidential
More informationMigration and Tourism Flows to New Zealand
Migration and Tourism Flows to New Zealand Murat Genç University of Otago, Dunedin, New Zealand Email address for correspondence: murat.genc@otago.ac.nz 30 April 2010 PRELIMINARY WORK IN PROGRESS NOT FOR
More informationIntroduction to Path Analysis: Multivariate Regression
Introduction to Path Analysis: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #7 March 9, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate
More informationCS 229: r/classifier - Subreddit Text Classification
CS 229: r/classifier - Subreddit Text Classification Andrew Giel agiel@stanford.edu Jonathan NeCamp jnecamp@stanford.edu Hussain Kader hkader@stanford.edu Abstract This paper presents techniques for text
More informationLearning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract
Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner Abstract For our project, we analyze data from US Congress voting records, a dataset that consists
More informationStatistical Analysis of Corruption Perception Index across countries
Statistical Analysis of Corruption Perception Index across countries AMDA Project Summary Report (Under the guidance of Prof Malay Bhattacharya) Group 3 Anit Suri 1511007 Avishek Biswas 1511013 Diwakar
More informationPROJECTING THE LABOUR SUPPLY TO 2024
PROJECTING THE LABOUR SUPPLY TO 2024 Charles Simkins Helen Suzman Professor of Political Economy School of Economic and Business Sciences University of the Witwatersrand May 2008 centre for poverty employment
More informationCoalitional Game Theory
Coalitional Game Theory Game Theory Algorithmic Game Theory 1 TOC Coalitional Games Fair Division and Shapley Value Stable Division and the Core Concept ε-core, Least core & Nucleolus Reading: Chapter
More informationIdentifying Factors in Congressional Bill Success
Identifying Factors in Congressional Bill Success CS224w Final Report Travis Gingerich, Montana Scher, Neeral Dodhia Introduction During an era of government where Congress has been criticized repeatedly
More informationthe notion that poverty causes terrorism. Certainly, economic theory suggests that it would be
he Nonlinear Relationship Between errorism and Poverty Byline: Poverty and errorism Walter Enders and Gary A. Hoover 1 he fact that most terrorist attacks are staged in low income countries seems to support
More informationProbabilistic earthquake early warning in complex earth models using prior sampling
Probabilistic earthquake early warning in complex earth models using prior sampling Andrew Valentine, Paul Käufl & Jeannot Trampert EGU 2016 21 st April www.geo.uu.nl/~andrew a.p.valentine@uu.nl A case
More informationParties, Candidates, Issues: electoral competition revisited
Parties, Candidates, Issues: electoral competition revisited Introduction The partisan competition is part of the operation of political parties, ranging from ideology to issues of public policy choices.
More informationMigrant Wages, Human Capital Accumulation and Return Migration
Migrant Wages, Human Capital Accumulation and Return Migration Jérôme Adda Christian Dustmann Joseph-Simon Görlach February 14, 2014 PRELIMINARY and VERY INCOMPLETE Abstract This paper analyses the wage
More informationImmigrants Inflows, Native outflows, and the Local Labor Market Impact of Higher Immigration David Card
Immigrants Inflows, Native outflows, and the Local Labor Market Impact of Higher Immigration David Card Mehdi Akhbari, Ali Choubdaran 1 Table of Contents Introduction Theoretical Framework limitation of
More informationComputational Inelasticity FHLN05. Assignment A non-linear elasto-plastic problem
Computational Inelasticity FHLN05 Assignment 2016 A non-linear elasto-plastic problem General instructions A written report should be submitted to the Division of Solid Mechanics no later than 1 November
More informationσ IηIη Andrew Askew Florida State University
σ IηIη Andrew Askew Florida State University Ø As I have doubtless said previously, photon identification is difficult. There are many reasons for this, but foremost among them is that you have few direct
More informationPROJECTION OF NET MIGRATION USING A GRAVITY MODEL 1. Laboratory of Populations 2
UN/POP/MIG-10CM/2012/11 3 February 2012 TENTH COORDINATION MEETING ON INTERNATIONAL MIGRATION Population Division Department of Economic and Social Affairs United Nations Secretariat New York, 9-10 February
More informationCombining national and constituency polling for forecasting
Combining national and constituency polling for forecasting Chris Hanretty, Ben Lauderdale, Nick Vivyan Abstract We describe a method for forecasting British general elections by combining national and
More informationRead My Lips : Using Automatic Text Analysis to Classify Politicians by Party and Ideology 1
Read My Lips : Using Automatic Text Analysis to Classify Politicians by Party and Ideology 1 Eitan Sapiro-Gheiler 2 June 15, 2018 Department of Economics Princeton University 1 Acknowledgements: I would
More informationThe Analytics of the Wage Effect of Immigration. George J. Borjas Harvard University September 2009
The Analytics of the Wage Effect of Immigration George J. Borjas Harvard University September 2009 1. The question Do immigrants alter the employment opportunities of native workers? After World War I,
More informationAnnouncements. HW3 Due tonight HW4 posted No class Thursday (Thanksgiving) 2017 Kevin Jamieson
Announcements HW3 Due tonight HW4 posted No class Thursday (Thanksgiving) 2017 Kevin Jamieson 1 Mixtures of Gaussians Machine Learning CSE546 Kevin Jamieson University of Washington November 20, 2016 Kevin
More information! = ( tapping time ).
AP Statistics Name: Per: Date: 3. Least- Squares Regression p164 168 Ø What is the general form of a regression equation? What is the difference between y and ŷ? Example: Tapping on cans Don t you hate
More informationIMMIGRATION REFORM, JOB SELECTION AND WAGES IN THE U.S. FARM LABOR MARKET
IMMIGRATION REFORM, JOB SELECTION AND WAGES IN THE U.S. FARM LABOR MARKET Lurleen M. Walters International Agricultural Trade & Policy Center Food and Resource Economics Department P.O. Box 040, University
More informationClassification and Regression Approaches to Predicting United States Senate Elections. Rohan Sampath, Yue Teng
Classification and Regression Approaches to Predicting United States Senate Elections Rohan Sapath, Yue Teng Abstract The United States Senate is arguably the finest deocratic institution for debate and
More informationGeneralized Scoring Rules: A Framework That Reconciles Borda and Condorcet
Generalized Scoring Rules: A Framework That Reconciles Borda and Condorcet Lirong Xia Harvard University Generalized scoring rules [Xia and Conitzer 08] are a relatively new class of social choice mechanisms.
More informationDrug Trafficking Organizations and Local Economic Activity in Mexico
RESEARCH ARTICLE Drug Trafficking Organizations and Local Economic Activity in Mexico Felipe González* Department of Economics, University of California, Berkeley, California, United States of America
More informationImproved Boosting Algorithms Using Confidence-rated Predictions
Improved Boosting Algorithms Using Confidence-rated Predictions ÊÇÊÌ º ËÀÈÁÊ schapire@research.att.com AT&T Labs, Shannon Laboratory, 18 Park Avenue, Room A279, Florham Park, NJ 7932-971 ÇÊÅ ËÁÆÊ singer@research.att.com
More informationDeep Classification and Generation of Reddit Post Titles
Deep Classification and Generation of Reddit Post Titles Tyler Chase tchase56@stanford.edu Rolland He rhe@stanford.edu William Qiu willqiu@stanford.edu Abstract The online news aggregation website Reddit
More informationThe Trade Effects of Skilled versus Unskilled Migration
The Trade Effects of Skilled versus Unskilled Migration Peter H. Egger ETH Zurich Maximilian von Ehrlich University of Bern September 29, 2014 Douglas R. Nelson Tulane University Abstract In this paper,
More informationAn Investigation into a Circuit Based Supply Chain Analyzer for FPGAs
An Investigation into a Circuit Based Supply Chain Analyzer for FPGAs FPL-2016 9/1/2016 Jacob Couch 1 John Arkorian Staff Researchers 1 jacob.couch@jhuapl.edu What is the problem anyways? How can FPGAs
More informationThe cost of ruling, cabinet duration, and the median-gap model
Public Choice 113: 157 178, 2002. 2002 Kluwer Academic Publishers. Printed in the Netherlands. 157 The cost of ruling, cabinet duration, and the median-gap model RANDOLPH T. STEVENSON Department of Political
More informationPolitical Economics II Spring Lectures 4-5 Part II Partisan Politics and Political Agency. Torsten Persson, IIES
Lectures 4-5_190213.pdf Political Economics II Spring 2019 Lectures 4-5 Part II Partisan Politics and Political Agency Torsten Persson, IIES 1 Introduction: Partisan Politics Aims continue exploring policy
More informationThe Costs of Remoteness, Evidence From German Division and Reunification by Redding and Sturm (AER, 2008)
The Costs of Remoteness, Evidence From German Division and Reunification by Redding and Sturm (AER, 2008) MIT Spatial Economics Reading Group Presentation Adam Guren May 13, 2010 Testing the New Economic
More informationA Vote Equation and the 2004 Election
A Vote Equation and the 2004 Election Ray C. Fair November 22, 2004 1 Introduction My presidential vote equation is a great teaching example for introductory econometrics. 1 The theory is straightforward,
More informationMIPAS Temperature and Pressure Validation by RO Data
MIPAS and Validation by RO Data Marc Schwaerz and Gottfried Kirchengast Wegener Center (WEGC), Graz, Austria MIPAS Quality Working Group Meeting 40, November 3, 2015 Outline 1 2 Validation and Reference
More informationCHAPTER FIVE RESULTS REGARDING ACCULTURATION LEVEL. This chapter reports the results of the statistical analysis
CHAPTER FIVE RESULTS REGARDING ACCULTURATION LEVEL This chapter reports the results of the statistical analysis which aimed at answering the research questions regarding acculturation level. 5.1 Discriminant
More informationRemittances and the Brain Drain: Evidence from Microdata for Sub-Saharan Africa
Remittances and the Brain Drain: Evidence from Microdata for Sub-Saharan Africa Julia Bredtmann 1, Fernanda Martinez Flores 1,2, and Sebastian Otten 1,2,3 1 RWI, Rheinisch-Westfälisches Institut für Wirtschaftsforschung
More informationTHE EVALUATION OF OUTPUT CONVERGENCE IN SEVERAL CENTRAL AND EASTERN EUROPEAN COUNTRIES
ISSN 1392-1258. ekonomika 2015 Vol. 94(1) THE EVALUATION OF OUTPUT CONVERGENCE IN SEVERAL CENTRAL AND EASTERN EUROPEAN COUNTRIES Simionescu M.* Institute for Economic Forecasting of the Romanian Academy
More informationSIMPLE LINEAR REGRESSION OF CPS DATA
SIMPLE LINEAR REGRESSION OF CPS DATA Using the 1995 CPS data, hourly wages are regressed against years of education. The regression output in Table 4.1 indicates that there are 1003 persons in the CPS
More information(a) Draw side-by-side box plots that show the yields of the two types of land. Check for outliers before making the plots.
1. In hilly areas, farmers often contour their fields to reduce the erosion due to water flow. This might have the unintended effect of changing the yield since the rows may not be aligned in an east-west
More informationRecommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012
Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012 Abstract In this paper we attempt to develop an algorithm to generate a set of post recommendations
More informationUsing Poole s Optimal Classification in R
Using Poole s Optimal Classification in R January 22, 2018 1 Introduction This package estimates Poole s Optimal Classification scores from roll call votes supplied though a rollcall object from package
More informationVOTING ON INCOME REDISTRIBUTION: HOW A LITTLE BIT OF ALTRUISM CREATES TRANSITIVITY DONALD WITTMAN ECONOMICS DEPARTMENT UNIVERSITY OF CALIFORNIA
1 VOTING ON INCOME REDISTRIBUTION: HOW A LITTLE BIT OF ALTRUISM CREATES TRANSITIVITY DONALD WITTMAN ECONOMICS DEPARTMENT UNIVERSITY OF CALIFORNIA SANTA CRUZ wittman@ucsc.edu ABSTRACT We consider an election
More informationJudgeIt II: A Program for Evaluating Electoral Systems and Redistricting Plans 1
JudgeIt II: A Program for Evaluating Electoral Systems and Redistricting Plans 1 Andrew Gelman Gary King 2 Andrew C. Thomas 3 Version 1.3.4 August 31, 2010 1 Available from CRAN (http://cran.r-project.org/)
More informationWage Rigidity and Spatial Misallocation: Evidence from Italy and Germany
Wage Rigidity and Spatial Misallocation: Evidence from Italy and Germany Tito Boeri 1 Andrea Ichino 2 Enrico Moretti 3 Johanna Posch 2 1 Bocconi 2 European University Institute 3 Berkeley 10 April 2018
More informationCS269I: Incentives in Computer Science Lecture #4: Voting, Machine Learning, and Participatory Democracy
CS269I: Incentives in Computer Science Lecture #4: Voting, Machine Learning, and Participatory Democracy Tim Roughgarden October 5, 2016 1 Preamble Last lecture was all about strategyproof voting rules
More informationHoboken Public Schools. College Algebra Curriculum
Hoboken Public Schools College Algebra Curriculum College Algebra HOBOKEN PUBLIC SCHOOLS Course Description College Algebra reflects the New Jersey learning standards at the high school level and is designed
More informationThe Shadow Value of Legal Status --A Hedonic Analysis of the Earnings of U.S. Farm Workers 1
The Shadow Value of Legal Status --A Hedonic Analysis of the Earnings of U.S. Farm Workers 1 June, 3 rd, 2013 Sun Ling Wang 2 Economic Research Service, U.S. Department of Agriculture Daniel Carroll Employment
More informationWage Trends among Disadvantaged Minorities
National Poverty Center Working Paper Series #05-12 August 2005 Wage Trends among Disadvantaged Minorities George J. Borjas Harvard University This paper is available online at the National Poverty Center
More informationRECOMMENDED CITATION: Pew Research Center, May, 2017, Partisan Identification Is Sticky, but About 10% Switched Parties Over the Past Year
NUMBERS, FACTS AND TRENDS SHAPING THE WORLD FOR RELEASE MAY 17, 2017 FOR MEDIA OR OTHER INQUIRIES: Carroll Doherty, Director of Political Research Jocelyn Kiley, Associate Director, Research Bridget Johnson,
More informationWas This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content
Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content Ruben Sipos Dept. of Computer Science Cornell University Ithaca, NY rs@cs.cornell.edu Arpita Ghosh Dept. of Information
More informationSupplementary Tables for Online Publication: Impact of Judicial Elections in the Sentencing of Black Crime
Supplementary Tables for Online Publication: Impact of Judicial Elections in the Sentencing of Black Crime Kyung H. Park Wellesley College March 23, 2016 A Kansas Background A.1 Partisan versus Retention
More informationHoboken Public Schools. AP Statistics Curriculum
Hoboken Public Schools AP Statistics Curriculum AP Statistics HOBOKEN PUBLIC SCHOOLS Course Description AP Statistics is the high school equivalent of a one semester, introductory college statistics course.
More informationCategory-level localization. Cordelia Schmid
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationOn the Determinants of Global Bilateral Migration Flows
On the Determinants of Global Bilateral Migration Flows Jesus Crespo Cuaresma Mathias Moser Anna Raggl Preliminary Draft, May 2013 Abstract We present a method aimed at estimating global bilateral migration
More informationCongressional Gridlock: The Effects of the Master Lever
Congressional Gridlock: The Effects of the Master Lever Olga Gorelkina Max Planck Institute, Bonn Ioanna Grypari Max Planck Institute, Bonn Preliminary & Incomplete February 11, 2015 Abstract This paper
More informationPivoted Text Scaling for Open-Ended Survey Responses
Pivoted Text Scaling for Open-Ended Survey Responses William Hobbs September 28, 2017 Abstract Short texts such as open-ended survey responses and tweets contain valuable information about public opinions,
More informationChapter Five: Forces. Ø 5.1 Forces. Ø 5.2 Friction. Ø 5.3 Forces and Equilibrium
Chapter Five: Forces Ø 5.1 Forces Ø 5.2 Friction Ø 5.3 Forces and Equilibrium 5.1 The cause of forces Ø A force is a push or pull, or an action that has the ability to change motion. Ø Forces can increase
More informationClimate Change Around the World
Climate Change Around the World Per Krusell Institute for International Economic Studies, NBER, CEPR Joint with Anthony A. Smith, Jr. Yale University, NBER World Congress Montréal Août, 215 The project
More informationSchooling and Cohort Size: Evidence from Vietnam, Thailand, Iran and Cambodia. Evangelos M. Falaris University of Delaware. and
Schooling and Cohort Size: Evidence from Vietnam, Thailand, Iran and Cambodia by Evangelos M. Falaris University of Delaware and Thuan Q. Thai Max Planck Institute for Demographic Research March 2012 2
More informationDo Individual Heterogeneity and Spatial Correlation Matter?
Do Individual Heterogeneity and Spatial Correlation Matter? An Innovative Approach to the Characterisation of the European Political Space. Giovanna Iannantuoni, Elena Manzoni and Francesca Rossi EXTENDED
More informationSocialSecurityEligibilityandtheLaborSuplyofOlderImigrants. George J. Borjas Harvard University
SocialSecurityEligibilityandtheLaborSuplyofOlderImigrants George J. Borjas Harvard University February 2010 1 SocialSecurityEligibilityandtheLaborSuplyofOlderImigrants George J. Borjas ABSTRACT The employment
More informationVote Compass Methodology
Vote Compass Methodology 1 Introduction Vote Compass is a civic engagement application developed by the team of social and data scientists from Vox Pop Labs. Its objective is to promote electoral literacy
More informationUse and abuse of voter migration models in an election year. Dr. Peter Moser Statistical Office of the Canton of Zurich
Use and abuse of voter migration models in an election year Statistical Office of the Canton of Zurich Overview What is a voter migration model? How are they estimated? Their use in forecasting election
More informationQuant 101 Learn2Quant HK, 14 September Vinesh Jha CEO, ExtractAlpha
Quant 101 Learn2Quant HK, 14 September 2017 Vinesh Jha CEO, ExtractAlpha Data-driven investment is the future ExtractAlpha confidential. Do not copy or distribute. 2 Agenda Ø Quick intro to ExtractAlpha
More informationUnderstanding factors that influence L1-visa outcomes in US
Understanding factors that influence L1-visa outcomes in US By Nihar Dalmia, Meghana Murthy and Nianthrini Vivekanandan Link to online course gallery : https://www.ischool.berkeley.edu/projects/2017/understanding-factors-influence-l1-work
More informationFinal Review. Chenyang Lu. CSE 467S Embedded Compu5ng Systems
Final Review Chenyang Lu CSE 467S Embedded Compu5ng Systems OS: Basic Func2ons Ø OS controls resources: q who gets the CPU; q when I/O takes place; q how much memory is allocated; q power management. Ø
More informationMigration With Endogenous Social Networks in China
Migration With Endogenous Social Networks in China Jin Zhou (University of Western Ontario) May 2015 Abstract Numerous empirical studies have documented a strong association between social networks and
More informationHierarchical Item Response Models for Analyzing Public Opinion
Hierarchical Item Response Models for Analyzing Public Opinion Xiang Zhou Harvard University July 16, 2017 Xiang Zhou (Harvard University) Hierarchical IRT for Public Opinion July 16, 2017 Page 1 Features
More informationSplit Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work
Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work Michael Clemens and Erwin Tiongson Review of Economics and Statistics (Forthcoming) Marian Atallah Presented by: Mohamed
More informationData Assimilation in Geosciences
Data Assimilation in Geosciences Alberto Carrassi The Nordic Centre of Excellence for ensemble-based data assimilation Laurent Bertino (Lead), Alberto Carrassi (Co-Lead), Colin Grudzien (PD), Patrick Raanes
More informationWomen and Power: Unpopular, Unwilling, or Held Back? Comment
Women and Power: Unpopular, Unwilling, or Held Back? Comment Manuel Bagues, Pamela Campa May 22, 2017 Abstract Casas-Arce and Saiz (2015) study how gender quotas in candidate lists affect voting behavior
More informationSchool Quality and Returns to Education of U.S. Immigrants. Bernt Bratsberg. and. Dek Terrell* RRH: BRATSBERG & TERRELL:
Forthcoming, Economic Inquiry School Quality and Returns to Education of U.S. Immigrants Bernt Bratsberg and Dek Terrell* RRH: BRATSBERG & TERRELL: SCHOOL QUALITY AND EDUCATION RETURNS OF IMMIGRANTS JEL
More informationDeterminants and Effects of Negative Advertising in Politics
Department of Economics- FEA/USP Determinants and Effects of Negative Advertising in Politics DANILO P. SOUZA MARCOS Y. NAKAGUMA WORKING PAPER SERIES Nº 2017-25 DEPARTMENT OF ECONOMICS, FEA-USP WORKING
More informationIPSA International Conference Concordia University, Montreal (Quebec), Canada April 30 May 2, 2008
IPSA International Conference Concordia University, Montreal (Quebec), Canada April 30 May 2, 2008 Yuri A. Polunin, Sc. D., Professor. Phone: +7 (495) 433-34-95 E-mail: : polunin@expert.ru polunin@crpi.ru
More informationWind power integration and consumer behavior: a complementarity approach
1 Wind power integration and consumer behavior: a complementarity approach 8 th Annual Trans-Atlantic INFRADAY Conference on Energy November 7 th, 2014 Ali Daraeepour, Duke University Dr. Jalal Kazempour,
More informationThe Effects of Housing Prices, Wages, and Commuting Time on Joint Residential and Job Location Choices
The Effects of Housing Prices, Wages, and Commuting Time on Joint Residential and Job Location Choices Kim S. So, Peter F. Orazem, and Daniel M. Otto a May 1998 American Agricultural Economics Association
More information