Ushio: Analyzing News Media and Public Trends in Twitter

Similar documents
Clinton vs. Trump 2016: Analyzing and Visualizing Tweets and Sentiments of Hillary Clinton and Donald Trump

Big Data, information and political campaigns: an application to the 2016 US Presidential Election

The Good Growth Plan Progress Data Fair Labor Standards 2017

Social Computing in Blogosphere

MEDIA KIT. Updated: 6/20/2018

Gab: The Alt-Right Social Media Platform

Cross Social Media Recommenda1on

The Citizen IS the Journalist - Automatically Extracting News from the Swarm

Link Attraction Factors

The language for most tablet questions was customized based on whether the respondent said they had an ipad or another type of tablet computer.

Semi-social discovery, aggregation, and curation

THE AUTHORITY REPORT. How Audiences Find Articles, by Topic. How does the audience referral network change according to article topic?

Topline questionnaire

How Social Computing Impacts Society

Quartz at Work. Our guide to leading, building and navigating the modern workplace. Quartz Index

Never Run Out of Ideas: 7 Content Creation Strategies for Your Blog

Promising Techniques for Social Media Savvy Funders

LOCAL MEDIA APP TRENDS

Social. Media. in prevention efforts. Lyndsey Hawkins. Bradley University

Immigrants Working for US

US MOBILE NEWS SEEKING TRENDS. Based on October September 2015 data. Excerpted from a full findings report delivered November 2015.

Logan McHone COMM 204. Dr. Parks Fall. Analysis of NPR's Social Media Accounts

@all studying the #twitter phenomenon. December 2009

Business Wire. At a Glance. January 13, 2015 at 9am - January 20, 2015 at 9am Page VC. 2% Positive Peak: 1 mentions on January 14th at 4pm

It Would Be Game Changing to: Deliver him socially agreed upon and expert endorsed information all in one place.

The Social Web: Social networks, tagging and what you can learn from them. Kristina Lerman USC Information Sciences Institute

World Statistics Day Prepared by the United Nations Statistics Division

ROI CASE STUDY MARKLOGIC CQ ROLL CALL

DOES ADDITION LEAD TO MULTIPLICATION? Koos Hussem X-CAGO B.V.

Media pack

he World Digital Library

COMMON GROUND BETWEEN COMPANY AND CIVIL SOCIETY SURVEILLANCE REFORM PRINCIPLES

How Social Media Is Changing Communications

Social Media based Analysis of Refugees in Turkey

Miyakita, Goki; Leskinen, Petri; Hyvönen, Eero U.S. Congress prosopographer - A tool for prosopographical research of legislators

Mining the Social Web - Twitter Projects. Iza Moise, Evangelos Pournaras, Dirk Helbing

Project Presentations - 1

MOBILE-FIRST NEWS: HOW PEOPLE USE SMARTPHONES TO ACCESS INFORMATION

Introduction to using social media

101 Ways Your Intern Can Triple Your Website Traffic & Performance This Year

Social Networking & Bar Association Communication -- What You Should Know About How to Use it to Your Advantage

Pioneers in Mining Electronic News for Research

Issues in Information Systems Volume 18, Issue 2, pp , 2017

SOCIAL MEDIA OPTIMIZATION

2017 Update to Leaders on Progress Towards the G20 Remittance Target

Think Social, Act Local: Applying Social Media to Your Community Group

Below is an overview of traditional and social media coverage and metrics tracked by our media team:

New Business Opportunities with Social Media

PEW RESEARCH CENTER S PROJECT FOR EXCELLENCE IN JOURNALISM IN COLLABORATION WITH THE ECONOMIST GROUP 2011 Tablet News Phone Survey July 15-30, 2011

Monitoring and Predicting the NFL Draft

Improving the accuracy of outbound tourism statistics with mobile positioning data

BASED ON ALL TABLET OWNERS AND THOSE WHO HAVE TABLETS IN HH [N=2806]:

arxiv: v2 [cs.si] 10 Apr 2017

Conspiracist propaganda

VEWS. Video News from all Views. Stanford University. Digital Media Entrepreneurship. Vignesh Ramachandran. Marcella De Laurentiis.

Chapter 7 Case Research

Konstantin Pantserev Saint-Petersburg State University

Monitoring and Predicting the NFL Draft

An Algorithmic and Computational Approach to Optimizing Gerrymandering

Urbana Illinois Newspaper

Automation Of Election Process For District Election Officer (RO) By Election MIS Software

COMMUNICATIONS H TOOLKIT H NATIONAL VOTER REGISTRATION DAY. A Partner Communications Toolkit for Traditional and Social Media

Definition Traits Benefits History Statistics. 1/10/2013 Social Networking SIG 2

Demographics of News Sharing in the U.S. Twittersphere

IPSJ SIG Technical Report Vol.2014-DBS-159 No.17 Vol.2014-IFAT-115 No /8/2 1,a) 1,b) 1,c) Web Web TextRank Wikipedia Wikipedia 1. Web

PREVIEW 28. Workers Around the World, 2002

Progress Toward the Holy Grail : The Continued Quest to Automate Fact-Checking

The Personal. The Media Insight Project

Sec. 1 Sec. 2 Sec. 3 Sec. 4 Sec. 1 Sec. 2 Sec. 3 Sec. 4 Sec. 5 Sec. 1 Sec. 2

Local differential privacy

Data Sampling using Congressional sampling. by Juhani Heliö

Quantcast Measure Guide

Using Social Media to Build Your Brand. Susan Getgood

Social Media in Staffing Guide. Best Practices for Building Your Personal Brand and Hiring Talent on Social Media

Survey Report Victoria Advocate Journalism Credibility Survey The Victoria Advocate Associated Press Managing Editors

Hoboken Public Schools. PLTW Introduction to Computer Science Curriculum

Social News Methods of research and exploratory analyses

Today I am going to speak about the National Digital Newspaper Program or NDNP, the Historic Maryland Newspapers Project or HMNP--the Maryland

NATIONAL SOCIAL MEDIA ENGAGEMENT POLICY. February 2013

Rethinking. chaos communication camp c3o, fin, metalab

Panel 2: National Data Governance in a Global Economy

DU PhD in Home Science

The Hispanic Millennial Project

Reddit Advertising: A Beginner s Guide To The Self-Serve Platform. Written by JD Prater Sr. Account Manager and Head of Paid Social

Politics and Social Media. Nov 6, 2012

Research Collection. Newspaper 2.0. Master Thesis. ETH Library. Author(s): Vinzens, Gianluca A. Publication Date: 2015

Monday, March 4, 13 1

UNIVERSITY RELATIONS

Who's in Charge Here? Information Privacy in a Social Networking World

FINDINGS FROM China. Building Support for International Development among Key. Findings from china. March 2012 PAGE 1

/

Union of International Associations (UIA) International Meetings Statistics for the Year 2013

Increasing Your Impact with Social. Rebecca Vander Linde, Social Media Manager Rachel Weatherly, Director of Digital Communications Strategy

Spring Tracking Survey 2008 Final Topline 5/19/08 Data for April 8 May 11, 2008

United Nations Population Information Network POPIN A guide to population information on UN system web sites United Nations Population Division

Characterizing the 2016 U.S. Presidential Campaign using Twitter Data

Model UN Workshop Tools and resources for accessing UN documents and information

SOCIAL MEDIA 101 Facebook and Twitter. Mike Lisi UUP Communications Director

Product Description

Quantitative Prediction of Electoral Vote for United States Presidential Election in 2016

Transcription:

Ushio: Analyzing News Media and Public Trends in Twitter Fangzhou Yao, Kevin Chen-Chuan Chang and Roy H. Campbell 3rd International Workshop on Big Data and Social Networking Management and Security (BDSN 2015) Department of Computer Science University of Illinois at Urbana-Champaign

Introduction Social Network Services (SNS) are evolving: There are more than 500 million tweets sent per day in Twitter nowadays, while the number was only 200 million back to 2011. Hashtags and geographical information are embedded in tweets and Facebook statuses. Lots of news agencies are broadcasting their news via Twitter, and people would like to participate in these discussions.

A Mashed Up But Curated World Mashable develops news stories according to trending topics from SNS. Flipboard generates pages based on SNS and news topics. Pulse aggregates news and tailors them based on user s LinkedIn professional interests. Yahoo presented News Digest, which summarizes top news from different sources, like multiple news agencies, Twitter and Wikipedia. New York Time follows and presented NYT Now.

Trending Topics Trends provided by many services do not differentiate which topics are covered more by news media, and which are discussed more by the public. Are there differences between the events that news media are more willing to cover and stories that people are really interested in? Also, can we discover any related news stories along with the trending topics?

We Would Like to Build Monitoring informative streams from both news media and the public, which is able to extract meaningful data and use it to analyze trends in real-time Discovering the correspondence between the focuses of news media and people, as well as the leading roles in assorted topics, would be beneficial to help media in building a more relevant news coverage

How Can We Collect Good Aggregation and Statistics Data? We use Term-Frequency, as we want to have a simple aggregation which could take least system resources. Meaningful and Fine-Grained Entities Named Entity Extraction is an approach to detect meaningful words and phrases quickly. Data Reliability

Design Ushio Database Twitter Media People Sample of the Public Timeline Account Following News Media Parser Query Handler Ruby Java Bridge User Twitter Connector NER Framework

Design Data Collection Twitter Streaming API As Twitter does not have an API for full public timeline streams, and we did not assign any topic or keyword to the API, therefore it returned us a random sample of all public statuses. We are able to obtain the complete tweets in real-time with Twitter s user account streaming API. Named Entity Recognition We used Stanford NER framework, which achieves a good accuracy and speed in extracting named entities from texts. This framework tags every word with its possible properties, such as PERSON, ORGANIZATION, LOCATION and MISC. Database Schema Tables are named as Media and People, respectively. The tuple has four columns: entity, type, time and tweet_id.

Relation Data Model Finding Trending Topics SELECT entity, count(*) AS count FROM social.people WHERE time > $a AND time < $b [AND type = $t] GROUP BY entity ORDER BY count DESC; Finding Related Topics SELECT social.people.entity AS name_entity, count(*) AS count FROM social.people WHERE tweet_id IN (SELECT social.people.tweet_id FROM social.people WHERE entity = $e AND time > $a AND time < $b) GROUP BY entity ORDER BY count DESC;

What A Busy Week! Top 10 trending topics from both the news media and the public in the week starting from April 28th to May 4th, 2014. News covers both Ukraine crisis and Sterling s NBA discrimination speech, but people who like to talk about the latter one more. Modi:13(152) and India:14(150) presented by media, but only Modi:79(1932) and India: 33(3516) by the public. Sports and entrainment topics are favored by the public more, but maybe not tech news, due to Google: 36(3411) and Apple:39(3179). Rank Media # Public # 1 Ukraine 462 Chelsea 11282 2 China 369 EU 10524 3 Donald Sterling 363 God 9913 4 Obama 287 Tribez 9625 5 NBA 282 Justin 9132 6 US 281 Argentina 8848 7 Russia 220 Donald Sterling 8788 8 Clippers 173 Best 8586 9 Apple 168 NBA 6790 10 Oklahoma 153 London 6293

Why Do They Care About It So Much? 5 News Media 120 Public 3.75 90 2.5 60 1.25 30 0 Microsoft Xbox One China Chinese 0 Microsoft China Xbox One Yahoo On April 29th 2014, Microsoft was mentioned more than usual, and we discovered the related topics, which indicated they were about to start selling Xbox One in China.

Why Do They Care About It So Much? 900 News Media 30 Public 675 22.5 450 15 225 7.5 0 NBA Donald Sterling Clippers Adam Silver 0 NBA Donald Sterling Clippers LA Clippers The NBA racism talk scandal by Donald Sterling and Clippers was the trend around that day, and hence all counts for these entities overwhelmed the ones about Microsoft.

Who is the Winner? The figure shows correlation between media and the public by showing the PERSON type entities ranking of Donald Sterling from April 26th until May 7th, 2014. Ranking 0 (40) (80) Media Public Are people leading the board? (120) What about political news? (160) Apr 26Apr 28Apr 30 May 2 May 4 May 6

Future Work Conducting more experiments on assorted topics and gathering more data Deploying this system with a visualized interface for public accesses Adding segregation based on geographical information in tweets Using Map-Reduce and / or NoSQL database for data entities aggregation streams at the host system

Questions? Thanks! Thank