Instructors: Tengyu Ma and Chris Re

Similar documents
Support Vector Machines

Cluster Analysis. (see also: Segmentation)

Overview. Ø Neural Networks are considered black-box models Ø They are complex and do not provide much insight into variable relationships

Tengyu Ma Facebook AI Research. Based on joint work with Rong Ge (Duke) and Jason D. Lee (USC)

Do two parties represent the US? Clustering analysis of US public ideology survey

Towards Tackling Hate Online Automatically

Probabilistic Latent Semantic Analysis Hofmann (1999)

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract

Classification of posts on Reddit

Introduction to Path Analysis: Multivariate Regression

CS 229 Final Project - Party Predictor: Predicting Political A liation

Essential Questions Content Skills Assessments Standards/PIs. Identify prime and composite numbers, GCF, and prime factorization.

Tengyu Ma Facebook AI Research. Based on joint work with Yuanzhi Li (Princeton) and Hongyang Zhang (Stanford)

Hoboken Public Schools. PLTW Introduction to Computer Science Curriculum

CHE 572: Modelling Process Dynamics

Statistics, Politics, and Policy

No Adults Allowed! Unsupervised Learning Applied to Gerrymandered School Districts

Deep Learning Working Group R-CNN

Dimension Reduction. Why and How

THE PREPARED CURRICULUM:

Hoboken Public Schools. Project Lead The Way Curriculum Grade 8

Congressional Gridlock: The Effects of the Master Lever

A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation

Rock the Vote or Vote The Rock

Voting and Markov Processes

CS 229: r/classifier - Subreddit Text Classification

Category-level localization. Cordelia Schmid

Classifier Evaluation and Selection. Review and Overview of Methods

Hoboken Public Schools. Algebra II Honors Curriculum

A Study on Chinese Firms in Hamburg

Psychological Factors

NEW YORK CITY COLLEGE OF TECHNOLOGY The City University of New York

Hoboken Public Schools. AP Statistics Curriculum

AMONG the vast and diverse collection of videos in

Do natives beliefs about refugees education level affect attitudes toward refugees? Evidence from randomized survey experiments

DU PhD in Home Science

1. The augmented matrix for this system is " " " # (remember, I can't draw the V Ç V ß #V V Ä V ß $V V Ä V

THE PRIMITIVES OF LEGAL PROTECTION AGAINST DATA TOTALITARIANISMS

Students Union Elections Nomination Package

An Algorithmic and Computational Approach to Optimizing Gerrymandering

Big Data, information and political campaigns: an application to the 2016 US Presidential Election

DOWNLOAD OR READ : THE UNOFFICIAL BOOK ON HOOTSUITE THE 1 TOOL FOR SOCIAL MEDIA MANAGEMENT PDF EBOOK EPUB MOBI

P(x) testing training. x Hi

Exploring QR Factorization on GPU for Quantum Monte Carlo Simulation

Supporting Information Political Quid Pro Quo Agreements: An Experimental Study

Random Forests. Gradient Boosting. and. Bagging and Boosting

Essential Patterns of Mature Agile Leadership

Statistical Analysis of Corruption Perception Index across countries

IPSA International Conference Concordia University, Montreal (Quebec), Canada April 30 May 2, 2008

Deep Learning and Visualization of Election Data

MATH 1340 Mathematics & Politics

Experiments on Data Preprocessing of Persian Blog Networks

Can Politicians Police Themselves? Natural Experimental Evidence from Brazil s Audit Courts Supplementary Appendix

Analysis of the Reputation System and User Contributions on a Question Answering Website: StackOverflow

INTEGRATION OF THE SECOND GENERATION IN ITALY: DATA ON SCHOOL INCLUSION PATHS

Hoboken Public Schools. Algebra I Curriculum

SIMPLE LINEAR REGRESSION OF CPS DATA

Rules of the Control and Risk Committee of Eni SpA

Hoboken Public Schools. College Algebra Curriculum

A Global Perspective on Socioeconomic Differences in Learning Outcomes

Hoboken Public Schools. Project Lead The Way Curriculum Grade 7

Political Economics II Spring Lectures 4-5 Part II Partisan Politics and Political Agency. Torsten Persson, IIES

(a) Draw side-by-side box plots that show the yields of the two types of land. Check for outliers before making the plots.

Learning Systems. Research at the Intersection of Machine Learning & Data Systems. Joseph E. Gonzalez

THE PREPARED CURRICULUM: FOR POST-SECONDARY AND CAREEER READINESS

! = ( tapping time ).

Hoboken Public Schools. Geometry Curriculum

Evidence-Based Practices and Access to Justice

Analyzing and Representing Two-Mode Network Data Week 8: Reading Notes

New Jersey Frequently Asked Questions

Hoboken Public Schools. Environmental Science Honors Curriculum

CHAPTER 5 SOCIAL INCLUSION LEVEL

My career in astronomy: From watching sci-fi to studying the stars. James Wurster

What makes people feel free: Subjective freedom in comparative perspective Progress Report

DETERMINANTS OF IMMIGRANTS EARNINGS IN THE ITALIAN LABOUR MARKET: THE ROLE OF HUMAN CAPITAL AND COUNTRY OF ORIGIN

List of Tables and Appendices

Weekly Assignments March 6-10, 2017

DOWNLOAD OR READ : TOPICS IN COHOMOLOGY OF GROUPS PDF EBOOK EPUB MOBI

Pivoted Text Scaling for Open-Ended Survey Responses

Practice Questions for Exam #2

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012

Chapter 8: Recursion

If you have questions about Speak Up or the contents of this packet, please contact the Speak Up team at

Constraint satisfaction problems. Lirong Xia

Alabama Frequently Asked Questions TABLE OF CONTENTS

CSE 190 Professor Julian McAuley Assignment 2: Reddit Data. Forrest Merrill, A Marvin Chau, A William Werner, A

World of Labor. John V. Winters Oklahoma State University, USA, and IZA, Germany. Cons. Pros

NAMI is leading a National Call-In Day for Senate Mental Health Reform on Friday, September 2 nd.

Lecture 18 Sociology 621 November 14, 2011 Class Struggle and Class Compromise

Homework 4 solutions

SUBJECT: SPECIAL CONDITION X (SEX OFFENDER CONDITION)

COMPUTATIONAL CREATIVITY EVALUATION

Understanding factors that influence L1-visa outcomes in US

Distributed representations of politicians

Behind a thin veil of ignorance and beyond the original position: a social experiment for distributive policy preferences of young people in Greece.

making infographics How to turn data into dat s awesome

CAMPAIGN MANAGEMENT & ORGANIZATION

A GENERAL TYPOLOGY OF PERSONAL NETWORKS OF IMMIGRANTS WITH LESS THAN 10 YEARS LIVING IN SPAIN

Recount Process. Ventura County Elections Division. 800 South Victoria Avenue Ventura, CA (805) venturavote.

Subreddit Recommendations within Reddit Communities

Transcription:

Instructors: Tengyu Ma and Chris Re

cs229.stanford.edu

Ø Probability (CS109 or STAT 116) Ø distribution, random variable, expectation, conditional probability, variance, density Ø Linear algebra (Math 104, Math 113, or CS205) Ø matrix multiplication Ø eigenvector Ø Basic programming (in Python) Ø Will be reviewed in Friday sections (recorded) This is a mathematically intense course. But that s why it s exciting and rewarding!

Do s Ø write down the solutions independently Ø write down the names of people with whom you ve discussed the homework Ø read the longer description on the course website Don ts Ø copy, refer to, or look at any official or unofficial previous years solutions in preparingthe answers

Ø We encourage you to form a group of 1-3 people Ø same criterion for 1-3 people Ø More information and previous course projects can be found on course website Ø List of potential topics Athletics & Sensing Devices Audio & Music Computer Vision Finance & Commerce General Machine Learning Life Sciences Natural Language Physical Sciences Theory Reinforcement Learning

Ø Piazza: cs229.stanford.edu Ø technical and logistical question (anonymous or nonanonymous, private or public) Ø to find study groups friends Ø all announcement Ø Videos on canvas Ø Course calendar: office hours and deadlines Ø Section (not Fri section) vs office hour Ø Gradescope Ø you will receive invite after Axess enrollment within 24hrs Ø Late days policy Ø FAQ

cs229.stanford.edu 2. Topics Covered in This Course

Arthur Samuel (1959): Machine Learning is the field of study that gives the computer the ability to learn without being explicitly programmed. Photos from Wikipedia

Tom Mitchell (1998): a computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E. Experience (data): games played by the program (with itself) Performance measure: winning rate Image from Tom Mitchell s homepage

Supervised Learning Unsupervised Learning Reinforcement Learning

Supervised Learning Unsupervised Learning Reinforcement Learning can also be viewed as tools/methods

Ø Given: a dataset that contains, samples! (, % (, (!., %. ) Ø Task: if a residence has! square feet, predict its price? 15th sample (! (), % () )! = 800 % =?

Ø Given: a dataset that contains ' samples! (, % (, (!,, %, ) Ø Task: if a residence has! square feet, predict its price? Ø Lecture 2&3: fitting! linear/qaudratic = 800 functions to the dataset % =?

Ø Suppose we also know the lot size Ø Task: find a function that maps (size, lot size) price features/input label/output " R % & R & Ø Dataset: " ', & ',, (" +, & + ) where " (-) = (" ' -, " % - ) Ø Supervision refers to & ('),, & (+) " % " '

Ø! R $ for large % Ø E.g.,! =! '! (! )! $ --- living size --- lot size --- # floors --- condition --- zip code + --- price Ø Lecture 6-7: infinite dimensional features Ø Lecture 10-11: select features based on the data

Ø regression: if! R is a continuous variable Ø e.g., price prediction Ø classification: the label is a discrete variable Ø e.g., the task of predicting the types of residence (size, lot size) house or townhouse? Lecture 3&4: classification! = house or townhouse?

Ø Image Classification Ø! = raw pixels of the image, # = the main object ImageNet Large Scale Visual Recognition Challenge. Russakovsky et al. 2015

Ø Object localization and detection Ø! = raw pixels of the image, # = the bounding boxes ImageNet Large Scale Visual Recognition Challenge. Russakovsky et al. 2015

Ø Machine translation! " Ø Note: this course only covers the basic and fundamental techniques of supervised learning (which are not enough for solving hard vision or NLP problems.) Ø CS224N and CS231N would be more suitable if you are interested in the particular applications

Ø Dataset contains no labels:! ",! % Ø Goal (vaguely-posed): to find interesting structures in the data supervised unsupervised

Ø Lecture 12&13: k-mean clustering,mixtureof Gaussians

Cluster 7 Genes Cluster 1 Individuals Identifying Regulatory Mechanisms using Individual Variation Reveals Key Role for Chromatin Modification. [Su-In Lee, Dana Pe'er, Aimee M. Dudley, George M. Church and Daphne Koller. 06]

documents words Ø Lecture 14: principal component analysis (tools used in LSA) Image credit: https://commons.wikimedia.org/wiki/file:topic_ detection_in_a_document-word_matrix.gif

Word2vec [Mikolovet al 13] GloVe [Pennington et al 14] models Represent words by vectors Ø word!"#$%! vector Unlabeled dataset Ø relation!"#$%! direction Italy Rome Paris Berlin France Germany

[Arora-Ge-Liang-M.-Risteski, TACL 17,18]

learning to walk to the right Iteration 10 [Luo-Xu-Li-Tian-Darrell-M. 18]

learning to walk to the right Iteration 20 [Luo-Xu-Li-Tian-Darrell-M. 18]

learning to walk to the right Iteration 80 [Luo-Xu-Li-Tian-Darrell-M. 18]

learning to walk to the right Iteration 210 [Luo-Xu-Li-Tian-Darrell-M. 18]

Ø The algorithm can collect data interactively Try the strategy and collect feedbacks Data collection Training Improve the strategy based on the feedbacks

Supervised Learning Unsupervised Learning Reinforcement Learning can also be viewed as tools/methods

Ø Deep learning basics Ø Introduction to learning theory Ø Bias variance tradeoff Ø Feature selection Ø ML advice

Thank you!