An Unbiased Measure of Media Bias Using Latent Topic Models

Similar documents
FAIR AND BALANCED? QUANTIFYING MEDIA BIAS THROUGH CROWDSOURCED CONTENT ANALYSIS

Topic Analysis of Climate Change Coverage in the UK

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract

Political Blogs: A Dynamic Text Network. David Banks. DukeUniffirsity

MIDTERM EXAM 1: Political Economy Winter 2017

IPSOS POLL DATA Prepared by Ipsos Public Affairs

State of the Facts 2018

1 PEW RESEARCH CENTER

Final Court Rulings: Public Equally Interested in Voting Rights, Gay Marriage

PERCEIVED ACCURACY AND BIAS IN THE NEWS MEDIA A GALLUP/KNIGHT FOUNDATION SURVEY

Identifying Ideological Perspectives of Web Videos Using Folksonomies

Identifying Ideological Perspectives of Web Videos using Patterns Emerging from Folksonomies

Hierarchical Item Response Models for Analyzing Public Opinion

VS. Who REALLY Owns the Web?

Islamophobia and the American Elections How Does It Look in America and The Middle East?

THE AUTHORITY REPORT. How Audiences Find Articles, by Topic. How does the audience referral network change according to article topic?

PEW RESEARCH CENTER FOR THE PEOPLE & THE PRESS LATE DECEMBER, 2007 POLITICAL COMMUNICATIONS STUDY FINAL TOPLINE December 19- December 30, 2007 N=1430

Partisan Accountability and Economic Voting

PEW RESEARCH CENTER FOR THE PEOPLE & THE PRESS JUNE 2005 NEWS INTEREST INDEX / MEDIA UPDATE FINAL TOPLINE JUNE 8-12, 2005 N=1,464

Digital Journalism and the Challenges of Managing a 21st Century Newsroom Workforce Pilot Study Overview: Data Collection Process

SUPPLEMENT TO WHAT DRIVES MEDIA SLANT? EVIDENCE FROM U.S. DAILY NEWSPAPERS (Econometrica, Vol. 78, No. 1, January 2010, 35 71)

PEW RESEARCH CENTER FOR THE PEOPLE & THE PRESS/WASHINGTON POST MAY OSAMA BIN LADEN SURVEY FINAL TOPLINE May 2, 2011 N=654

The Issue-Adjusted Ideal Point Model

Pew Research Center Demographics and Questionnaire. ONLINE FOR ELECTION NEWS BY DEMOGRAPHICS (Based on General Public)

Two-dimensional voting bodies: The case of European Parliament

Congressional Gridlock: The Effects of the Master Lever

Practice Questions for Exam #2

Criminal Justice Today, 15e (Schmalleger) Chapter 1 What Is Criminal Justice? 1.1 Multiple Choice Questions

PEW RESEARCH CENTER FOR THE PEOPLE & THE PRESS JULY 2003 MEDIA UPDATE FINAL TOPLINE June 19 - July 2, 2003 N=1201

U.S. Government and Politics SUMMER ASSIGNMENT!

Primary Elections and Partisan Polarization in the U.S. Congress

Fake News 101 To Believe or Not to Believe

Linguistic Biases in News Media Reporting Within a Transnational Context

August 8, 2018 POTUS RADIO. Clifford Young. President, Ipsos Public Affairs Ipsos 1

Comments on Prat and Strömberg, and Robinson and Torvik 1

Biggest Stories of 2008: Economy Tops Campaign INTERNET OVERTAKES NEWSPAPERS AS NEWS OUTLET

THE POLITICO-GWU BATTLEGROUND POLL

Publicizing malfeasance:

PEW RESEARCH CENTER July 11-14, 2013 OMNIBUS FINAL TOPLINE N=1002

NEWS RELEASE. Political Sites Gain, But Major News Sites Still Dominant MODEST INCREASE IN INTERNET USE FOR CAMPAIGN 2002

Clinton vs. Trump 2016: Analyzing and Visualizing Tweets and Sentiments of Hillary Clinton and Donald Trump

Pessimism about Fiscal Cliff Deal, Republicans Still Get More Blame

14.770: Media Bias. Ben Olken. Olken Media Bias 1 / 63

Popularity Prediction of Reddit Texts

How Media Bias is Presented in Donald Trump s News Coverage as a Presidential Candidate by The New York Times?

PEW RESEARCH CENTER S PROJECT FOR EXCELLENCE IN JOURNALISM IN COLLABORATION WITH THE ECONOMIST GROUP 2011 Tablet News Phone Survey July 15-30, 2011

Chapter. Sampling Distributions Pearson Prentice Hall. All rights reserved

1. In general, do you think things in this country are heading in the right direction or the wrong direction? Strongly approve. Somewhat approve Net

Measuring the Political Sophistication of Voters in the Netherlands and the United States

Turmoil Draws Extensive Media Coverage Limited Public Interest in Egyptian Protests

Partisan Bias in Economic News: Evidence on the. Agenda-Setting Behavior of U.S. Newspapers

USING TEXT AS DATA TO MEASURE LATENT LEGAL CONSTRUCTS: A DICTIONARY-BASED APPROACH

Text as Data. Justin Grimmer. Associate Professor Department of Political Science Stanford University. November 20th, 2014

Partisan Interest, Reactions to IRS and AP Controversies

Phrasing and its Ideological Implications:

Rising Job Worries, Bush Economic Plan Doesn t Help PRESIDENT S CRITICISM OF MEDIA RESONATES, BUT IRAQ UNEASE GROWS

Ideological Segregation and the Effects of Social Media on News Consumption

Measuring the Political Sophistication of Voters in the Netherlands and the United States

POLITICAL POLARIZATION AND THE ELECTORAL EFFECTS OF MEDIA BIAS

Vote Compass Methodology

NUMBERS, FACTS AND TRENDS SHAPING THE WORLD FOR RELEASE DECEMBER 8, 2014 FOR FURTHER INFORMATION ON THIS REPORT:

Framing China s Corruption: A Content Analysis of Coverage on New York Times from 2006 to 2015

1. One of the various ways in which parties contribute to democratic governance is by.

5 Key Facts. About Online Discussion of Immigration in the New Trump Era

A Dramatic Change of Public Opinion In the Muslim World

Gab: The Alt-Right Social Media Platform

Support for Air Strikes is Vast Easily Eclipsing Gulf War Levels

AP AMERICAN GOVERNMENT STUDY GUIDE POLITICAL BELIEFS AND BEHAVIORS PUBLIC OPINION PUBLIC OPINION, THE SPECTRUM, & ISSUE TYPES DESCRIPTION

Implications of the 2012 Election for Health Care The Voters Perspective

RECOMMENDED CITATION: Pew Research Center, March 2014, Most Say U.S. Should Not Get Too Involved in Ukraine Situation

1 PEW RESEARCH CENTER

A Measure of Media Bias

Marist College Institute for Public Opinion Poughkeepsie, NY Phone Fax

Dictators, Terrorists, Populists: What Language Can Tell Us About Opaque and Radical Politics

A Joint Topic and Perspective Model for Ideological Discourse

Ensuring freedom of the press around the world by continued protection of reports. MUNOFS VII Research Report

Fake news on Twitter. Lisa Friedland, Kenny Joseph, Nir Grinberg, David Lazer Northeastern University

Three Essays on Political Economy of Media

An Analysis of U.S. Congressional Support for the Affordable Care Act

September 2017 Toplines

Chapter 8:3 The Media

Americans on the Middle East

Social Studies. Smyth County Schools Curriculum Map Grade:9--12 th. Subject:Current Affairs. Standards

No Change in Views of Torture, Warrantless Wiretaps OBAMA FACES FAMILIAR DIVISIONS OVER ANTI-TERROR POLICIES

Little Interest in Libya, European Debt Crisis Public Closely Tracking Economic and Political News

RECOMMENDED CITATION: Pew Research Center, September 2014, Growing Public Concern about Rise of Islamic Extremism At Home and Abroad

Views of Press Values and Performance: INTERNET NEWS AUDIENCE HIGHLY CRITICAL OF NEWS ORGANIZATIONS

Concern About Peacekeeping Grows, But More Also See a Benefit of the War

PEW RESEARCH CENTER FOR THE PEOPLE AND THE PRESS NEWS SAVVY PROJECT FINAL TOPLINE February 1-13, 2007 N= 1502

Essays on Political Economy

AMERICAN VIEWS: TRUST, MEDIA AND DEMOCRACY A GALLUP/KNIGHT FOUNDATION SURVEY

Another Day, Another Poll: Trends in Media Coverage of Polls and Surveys in the Election Realm and the Non-Election Realm

Poles Apart. The international reporting of climate scepticism. James Painter

In the Eye of the Beholder: How Information Shortcuts Shape Individual Perceptions of Bias in the Media

THREE ESSAYS IN POLITICAL ECONOMY CAGDAS AGIRDAS DISSERTATION

Buying Supermajorities

CRIMINAL JUSTICE NEWS COVERAGE IN 2012 Part 2

Political Polarization and the Electoral Effects of Media Bias

Boko Haram I. Background Boko Haram is an islamic terrorist group that is primarily ran out of Nigeria and is also

THE NEW NEWS AUDIENCE 12 ways consumers have changed in the digital age

Transcription:

An Unbiased Measure of Media Bias Using Latent Topic Models Lefteris Anastasopoulos 1 Aaron Kaufmann 2 Luke Miratrix 3 1 Harvard Kennedy School 2 Harvard University, Department of Government 3 Harvard University, Department of Statistics April 16, 2015

Goals 1 Use topic models to measure partisan selection bias and presentation bias in top 13 online media sources. 2 Rank sources by least to most absolute bias. 3 Automate calculation of bias scores for texts.

Goals 1 Use topic models to measure partisan selection bias and presentation bias in top 13 online media sources. 2 Rank sources by least to most absolute bias. 3 Automate calculation of bias scores for texts.

Goals 1 Use topic models to measure partisan selection bias and presentation bias in top 13 online media sources. 2 Rank sources by least to most absolute bias. 3 Automate calculation of bias scores for texts.

Is the Mainstream Media Biased? Most scholars say YES. Groseclose and Milyo (2005) - Liberal/Democrat bias. Gentzkow and Shapiro (2010) - Toward ideology of content consumers. Quinn and Ho (2008) - Liberal bias.

Is the Mainstream Media Biased? Most scholars say YES. Groseclose and Milyo (2005) - Liberal/Democrat bias. Gentzkow and Shapiro (2010) - Toward ideology of content consumers. Quinn and Ho (2008) - Liberal bias.

Is the Mainstream Media Biased? Most scholars say YES. Groseclose and Milyo (2005) - Liberal/Democrat bias. Gentzkow and Shapiro (2010) - Toward ideology of content consumers. Quinn and Ho (2008) - Liberal bias.

Is the Mainstream Media Biased? Most scholars say YES. Groseclose and Milyo (2005) - Liberal/Democrat bias. Gentzkow and Shapiro (2010) - Toward ideology of content consumers. Quinn and Ho (2008) - Liberal bias.

What is Media Bias? Presentation Bias - Spin or slant. How the same event/topic/issue is covered. Selection Bias - What is covered.

What is Media Bias? Presentation Bias - Spin or slant. How the same event/topic/issue is covered. Selection Bias - What is covered.

Presentation Bias: Example 1 Benghazi Anger Over a Film Fuels Anti-American Attacks in Libya and Egypt - New York Times, Sept. 12, 2012.

Presentation Bias: Example 1 Benghazi In glory ous bastard riot Muslims rip flag, kill over flick - New York Post, Sept. 12, 2012.

Presentation Bias: Example 1 Benghazi Protesters attack U.S. diplomatic compounds in Egypt, Libya - Washington Post, Sept. 12, 2012.

Presentation Bias: Example 2 Ferguson Ferguson: dailymail.co.uk, Nov. 24, 2014

Presentation Bias: Example 2 Ferguson Ferguson: Fox News, Nov. 24, 2014

Presentation Bias: Example 2 Ferguson Ferguson: Huffington Post, Nov. 24, 2014

Presentation Bias: Measurement Phrases Used by Republicans in Congress Source: Gentzkow and Shapiro (2010) Comparison of words and phrases in known partisan/ideological sources with articles. Groseclose and Milyo (2005) - Think tank citations. Quinn and Ho (2008) - Supreme Court opinions. Gentzkow and Shapiro (2010) - Congressional speeches.

Presentation Bias: Measurement Phrases Used by Republicans in Congress Source: Gentzkow and Shapiro (2010) Comparison of words and phrases in known partisan/ideological sources with articles. Groseclose and Milyo (2005) - Think tank citations. Quinn and Ho (2008) - Supreme Court opinions. Gentzkow and Shapiro (2010) - Congressional speeches.

Presentation Bias: Measurement Phrases Used by Republicans in Congress Source: Gentzkow and Shapiro (2010) Comparison of words and phrases in known partisan/ideological sources with articles. Groseclose and Milyo (2005) - Think tank citations. Quinn and Ho (2008) - Supreme Court opinions. Gentzkow and Shapiro (2010) - Congressional speeches.

Presentation Bias: Measurement Phrases Used by Republicans in Congress Source: Gentzkow and Shapiro (2010) Comparison of words and phrases in known partisan/ideological sources with articles. Groseclose and Milyo (2005) - Think tank citations. Quinn and Ho (2008) - Supreme Court opinions. Gentzkow and Shapiro (2010) - Congressional speeches.

Selection Bias: Example 1 Drudge Report

Selection Bias: Example 2 Huffington Post

Measuring Selection Bias: Framework S t ={s 1t, s 2t,..., s nt } At any time t there exists a universe of unobserved stories S t.

Measuring Selection Bias: Framework N dt S t News sources, N dt, conceptualized as sets of non-random draws from S t. A news source is a set of stories or that has a partisan/ideological valence.

Measuring Selection Bias: Framework N dt S t News sources, N dt, conceptualized as sets of non-random draws from S t. A news source is a set of stories or that has a partisan/ideological valence.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias: Literature Goal was to restrict S t enough to measure selection bias. Barret and Peake (2007) 1 S t = Each location Bush traveled to in 2001. Baum and Groeling (2008) 1 S t = Associated Press and Reuters News Feeds. Larcinese, Puglisi and Snyder (2011) 1 S t = Government reports on the economy.

Measuring Selection Bias With topic models we are able to construct comprehensive measures of selection bias automatically. Very important in an era of internet news where spin-free narratives can be easily constructed.

Measuring Selection Bias With topic models we are able to construct comprehensive measures of selection bias automatically. Very important in an era of internet news where spin-free narratives can be easily constructed.

Measuring Selection Bias [I wanted to find a] single, emblematic college rape case [that would show] what it s like to be on campus now...where not only is rape so prevalent but also that there s this pervasive culture of sexual harassment/rape culture... - Sabrina Erdely, Rolling Stone Magazine

A Model of Selection Bias S tk ={s t1, s t2,..., s tk } S t Multinom(β) θ dtk ={θ dt1, θ dt2,..., θ dtk } θ dt Dir(α) I tk ={I t1, I t2,..., I tk } I t Multinom(γ) S tk : All news at time t is comprised of k topics. θ dtk : Each news source d has a distribution over these k topics. I tk : Each topic has a partisan valence - Democrat, Republican, Neutral.

A Model of Selection Bias S tk ={s t1, s t2,..., s tk } S t Multinom(β) θ dtk ={θ dt1, θ dt2,..., θ dtk } θ dt Dir(α) I tk ={I t1, I t2,..., I tk } I t Multinom(γ) S tk : All news at time t is comprised of k topics. θ dtk : Each news source d has a distribution over these k topics. I tk : Each topic has a partisan valence - Democrat, Republican, Neutral.

A Model of Selection Bias S tk ={s t1, s t2,..., s tk } S t Multinom(β) θ dtk ={θ dt1, θ dt2,..., θ dtk } θ dt Dir(α) I tk ={I t1, I t2,..., I tk } I t Multinom(γ) S tk : All news at time t is comprised of k topics. θ dtk : Each news source d has a distribution over these k topics. I tk : Each topic has a partisan valence - Democrat, Republican, Neutral.

A Model of Selection Bias SB dt =θdt T I t K = θ dtk I tk k=1 SB d - Selection bias for a news source d is thus an average of the partisan valences for each topic k weighted by

A Model of Selection Bias Bias Value Democrat-Skewed Content 1 < SB dt < 0 Neutral Content SB dt = 0 Republican-Skewed Content 0 < SB dt < 1 Values of SB dt and Direction of Bias Define Democrat = 1, Neutral = 0, Republican = 1.

A Model of Selection Bias: Examples θ NYT,2014 = (0.4, 0.2, 0.4) θ FoxNews,2014 = (0.4, 0.55, 0.05) Imagine all news organizations cover only three topics: Immigration, Terrorism and Global Warming. New York Times - 40% of coverage to Immigration, 20% to Terrorism, 40% to Global Warming Fox News - 40% of coverage to Immigration, 55% to Terrorism, 5% to Global Warming

A Model of Selection Bias: Examples θ NYT,2014 = (0.4, 0.2, 0.4) θ FoxNews,2014 = (0.4, 0.55, 0.05) Imagine all news organizations cover only three topics: Immigration, Terrorism and Global Warming. New York Times - 40% of coverage to Immigration, 20% to Terrorism, 40% to Global Warming Fox News - 40% of coverage to Immigration, 55% to Terrorism, 5% to Global Warming

A Model of Selection Bias: Examples θ NYT,2014 = (0.4, 0.2, 0.4) θ FoxNews,2014 = (0.4, 0.55, 0.05) Imagine all news organizations cover only three topics: Immigration, Terrorism and Global Warming. New York Times - 40% of coverage to Immigration, 20% to Terrorism, 40% to Global Warming Fox News - 40% of coverage to Immigration, 55% to Terrorism, 5% to Global Warming

A Model of Selection Bias: Examples Valences are: Immigration = 0 (Neutral), Terrorism = 0.5 (Republican), Global Warming= 0.5 (Democrat). Thus... I 2014 = (0, 0.5, 0.5) 3 SB NYT,2014 = θ knyt,2014 I k k=1 = (0.40 0) + (0.20 0.5) + (0.40 0.5) = 0.10 3 SB FoxNews,2014 = θ kfox,2014 I k k=1 = (0.40 0) + (0.55 0.5) + (0.05 0.5) = 0.25

Modeling News Sources Using the Latent Dirichlet Allocation (LDA) LDA is a generative model of texts developed by Blei, Ng and Jordan (2003). LDA constructs a corpus using a documents and vocabularies. Used to automatically categorize large groups of documents.

Modeling News Sources Using the Latent Dirichlet Allocation (LDA) LDA is a generative model of texts developed by Blei, Ng and Jordan (2003). LDA constructs a corpus using a documents and vocabularies. Used to automatically categorize large groups of documents.

Modeling News Sources Using the Latent Dirichlet Allocation (LDA) LDA is a generative model of texts developed by Blei, Ng and Jordan (2003). LDA constructs a corpus using a documents and vocabularies. Used to automatically categorize large groups of documents.

Modeling News Sources Using the LDA: Overview TOPICS Topic1 Topic2.Topic30 CORPUS Internet News DOCUMENTS New York Times WalStreet Journal... LosAngeles Times TOPIC PROPORTIONS General Model of Internet News Sources Using the Latent Dirichlet Allocation

Modeling News Sources Using the LDA: Overview Corpus - 13 internet sources: CNN, NY Times, Fox News, LA Times, USA Today, Washington Post, Huffington Post, Chicago Sun Times, NY Daily News, ABC News, Wall Street Journal, NBC News. Documents - Each d = 1,..., 13 news sources. Topics - Arbitrarily set to K = 30.

Modeling News Sources Using the LDA: Overview Corpus - 13 internet sources: CNN, NY Times, Fox News, LA Times, USA Today, Washington Post, Huffington Post, Chicago Sun Times, NY Daily News, ABC News, Wall Street Journal, NBC News. Documents - Each d = 1,..., 13 news sources. Topics - Arbitrarily set to K = 30.

Modeling News Sources Using the LDA: Overview Corpus - 13 internet sources: CNN, NY Times, Fox News, LA Times, USA Today, Washington Post, Huffington Post, Chicago Sun Times, NY Daily News, ABC News, Wall Street Journal, NBC News. Documents - Each d = 1,..., 13 news sources. Topics - Arbitrarily set to K = 30.

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D 1 β k Dir(ψ), where k {1,..., 30} - the distribution over words that defines each of the K = 30 latent topics assumed to encompass the 12 news sources. 2 θ d Dir(α), where d {1,..., 13} - the distribution over topics for each news source. This, in combination with topic ideological valence I determines selection bias. 3 z d,n - topic assignment of the n th word in the d th news source. 4 w d,n - the n th word of the d th news source.

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D 1 β k Dir(ψ), where k {1,..., 30} - the distribution over words that defines each of the K = 30 latent topics assumed to encompass the 12 news sources. 2 θ d Dir(α), where d {1,..., 13} - the distribution over topics for each news source. This, in combination with topic ideological valence I determines selection bias. 3 z d,n - topic assignment of the n th word in the d th news source. 4 w d,n - the n th word of the d th news source.

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D 1 β k Dir(ψ), where k {1,..., 30} - the distribution over words that defines each of the K = 30 latent topics assumed to encompass the 12 news sources. 2 θ d Dir(α), where d {1,..., 13} - the distribution over topics for each news source. This, in combination with topic ideological valence I determines selection bias. 3 z d,n - topic assignment of the n th word in the d th news source. 4 w d,n - the n th word of the d th news source.

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D 1 β k Dir(ψ), where k {1,..., 30} - the distribution over words that defines each of the K = 30 latent topics assumed to encompass the 12 news sources. 2 θ d Dir(α), where d {1,..., 13} - the distribution over topics for each news source. This, in combination with topic ideological valence I determines selection bias. 3 z d,n - topic assignment of the n th word in the d th news source. 4 w d,n - the n th word of the d th news source.

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D 30 k=1 p(β k ψ) 12 d=1 ( p(θ d α) p(θ, z, w, β ψ, α) = N ) p(z d,n θ d )p(w d,n z d,n, β k ) n=1 Full model of news sources and topics.

Measuring Partisan Valence SB d = θ T d I Key parameters and distributions estimated using variational expectation maximization algorithm (VEM) (Wainwright and Jordan 2008). Measuring topic partisan valence. 1 Human coders. 2 Human coders + party platforms from 1980-Present.

Measuring Partisan Valence SB d = θ T d I Key parameters and distributions estimated using variational expectation maximization algorithm (VEM) (Wainwright and Jordan 2008). Measuring topic partisan valence. 1 Human coders. 2 Human coders + party platforms from 1980-Present.

Measuring Partisan Valence SB d = θ T d I Key parameters and distributions estimated using variational expectation maximization algorithm (VEM) (Wainwright and Jordan 2008). Measuring topic partisan valence. 1 Human coders. 2 Human coders + party platforms from 1980-Present.

Measuring Partisan Valence SB d = θ T d I Key parameters and distributions estimated using variational expectation maximization algorithm (VEM) (Wainwright and Jordan 2008). Measuring topic partisan valence. 1 Human coders. 2 Human coders + party platforms from 1980-Present.

Measuring Partisan Valence: Human Coders I k Multinom(Π) Π = {Π Democrat, Π Neutral, Π Republican } I = (E[I 1 ], E[I 2 ],..., E[I 30 ]) Topics labeled manually and automatically. Topics presented to Mechanical Turk workers. Do you think Republicans, Democrats or Neither talk more about [Topic k].

Measuring Partisan Valence: Human Coders I k Multinom(Π) Π = {Π Democrat, Π Neutral, Π Republican } I = (E[I 1 ], E[I 2 ],..., E[I 30 ]) Topics labeled manually and automatically. Topics presented to Mechanical Turk workers. Do you think Republicans, Democrats or Neither talk more about [Topic k].

Measuring Partisan Valence: Human Coders I k Multinom(Π) Π = {Π Democrat, Π Neutral, Π Republican } I = (E[I 1 ], E[I 2 ],..., E[I 30 ]) Topics labeled manually and automatically. Topics presented to Mechanical Turk workers. Do you think Republicans, Democrats or Neither talk more about [Topic k].

Measuring Partisan Valence: Using Human Coders and Party Platforms Data: Party platforms for Democratic and Republican parties from 1980-Present. Procedure: 1 Extract and label topics from news sources. 2 Extract topics from Democratic and Republican party platforms. 3 Mechanical Turk workers use labels from news sources to label platform topics. 4 For each topic k calculate proportion of topics labels Democrat and Republican.

Measuring Partisan Valence: Using Human Coders and Party Platforms Data: Party platforms for Democratic and Republican parties from 1980-Present. Procedure: 1 Extract and label topics from news sources. 2 Extract topics from Democratic and Republican party platforms. 3 Mechanical Turk workers use labels from news sources to label platform topics. 4 For each topic k calculate proportion of topics labels Democrat and Republican.

Measuring Partisan Valence: Using Human Coders and Party Platforms Data: Party platforms for Democratic and Republican parties from 1980-Present. Procedure: 1 Extract and label topics from news sources. 2 Extract topics from Democratic and Republican party platforms. 3 Mechanical Turk workers use labels from news sources to label platform topics. 4 For each topic k calculate proportion of topics labels Democrat and Republican.

Measuring Partisan Valence: Using Human Coders and Party Platforms Data: Party platforms for Democratic and Republican parties from 1980-Present. Procedure: 1 Extract and label topics from news sources. 2 Extract topics from Democratic and Republican party platforms. 3 Mechanical Turk workers use labels from news sources to label platform topics. 4 For each topic k calculate proportion of topics labels Democrat and Republican.

Measuring Partisan Valence: Using Human Coders and Party Platforms Data: Party platforms for Democratic and Republican parties from 1980-Present. Procedure: 1 Extract and label topics from news sources. 2 Extract topics from Democratic and Republican party platforms. 3 Mechanical Turk workers use labels from news sources to label platform topics. 4 For each topic k calculate proportion of topics labels Democrat and Republican.

Measuring Partisan Valence: Using Human Coders and Party Platforms Data: Party platforms for Democratic and Republican parties from 1980-Present. Procedure: 1 Extract and label topics from news sources. 2 Extract topics from Democratic and Republican party platforms. 3 Mechanical Turk workers use labels from news sources to label platform topics. 4 For each topic k calculate proportion of topics labels Democrat and Republican.

Preliminary Results: Topics Topic 1 Topic 2 Topic 3 Topic 4 Topic 5 Topic 6 Police Shootings Robert Durst ISIS Academy Awards Crime Taxes & Budget 1 polic durst often show charg state 2 offic new polic year court tax 3 shot airport kill award case year 4 shoot mall man oscar prison pay 5 man say islamic best murder percent 6 fire told state new arrest million 7 depart polic attack will attorney budget 8 report man hostag one death feder 9 kill report syria time told plan 10 two york video actor prosecutor will

Preliminary Results: Topic Proportions by Source

Preliminary Results: Topic Proportions by Source

Preliminary Results: Topic Proportions by Source

Questions? Comments?

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D p(θ d α) = K i=1 Γ(α i) Γ ( K i=1 α ) i 30 i=1 θ α i 1 di Distribution over topic proportions.

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D p(β k ψ) = N i=1 Γ(ψ i) N Γ ( N i=1 ψ ) i i=1 β ψ i 1 ki Distribution over words for each topic..

Modeling News Sources Using the LDA: Nuts and Bolts K ψ βk α θd zd,n wd,n N D p(z d,n θ d );z d,n Multinom(θ d ) p(w d,n z d,n, β k ;w d,n Multinom(β k ) Distributions of topic assignments and word assignments.