arxiv: v1 [cs.si] 20 Jun 2016

Size: px
Start display at page:

Download "arxiv: v1 [cs.si] 20 Jun 2016"

Transcription

1 Rating Effects on Social News Posts and Comments Maria Glenski 1 and Tim Weninger 1 1 Department of Computer Science and Engineering, University of Notre Dame arxiv: v1 [cs.si] 20 Jun 2016 Abstract At a time when information seekers first turn to digital sources for news and opinion, it is critical that we understand the role that social media plays in human behavior. This is especially true when information consumers also act as information producers and editors through their online activity. In order to better understand the effects that editorial ratings have on online human behavior, we report the results of a two large-scale in-vivo experiments in social media. We find that small, random rating manipulations on social media posts and comments created significant changes in downstream ratings resulting in significantly different final outcomes. We found positive herding effects for positive treatments on posts, increasing the final rating by 11.02% on average, but not for positive treatments on comments. Contrary to the results of related work, we found negative herding effects for negative treatments on posts and comments, decreasing the final ratings on average, of posts by 5.15% and of comments by 37.4%. Compared to the control group, the probability of reaching a high rating ( 2000) for posts is increased by 24.6% when posts receive the positive treatment and for comments is decreased by 46.6% when comments receive the negative treatment. 1 Introduction We often rely on online reviews contributed by anonymous users as an important source of information to make decisions about which products to buy, movies to watch, news to read, or even political candidates to support. These online reviews replace traditional word-of-mouth communication about an object s or idea s quality [16]. The sheer volume of new information being produced and consumed only increases the reliance that individuals place on anonymous others to curate and sort massive amounts of information. Because of the economic and intrinsic value involved, it is important to consider whether this new mode of social communication can successfully harness the wisdom of crowd to accurately aggregate individual information. What is becoming known as collective intelligence bares the potential to enhance human capability and accomplish what is impossible individually [64, 12]. For example, more than a century ago the experiments of Francis Galton determined that the median estimate of a group can be more accurate than estimates of experts [22]. Surowiecki s book The Wisdom of the Crowds finds similar examples in stock markets, political elections, quiz shows and a variety of other fields where large groups of people behave intelligently and perform better than an elite few [59]. However, other experiments have shown that when individuals perceptions of quality and value follow the behavior of a group, the resulting herd mentality can be suboptimal for both the individual and the group [11, 29, 39]. By relying on the judgements of others, we may be susceptible to malicious ratings with some ulterior motive. Unfortunately, there is a gap in our knowledge and capabilities in this area, including untested and contradictory social theories. Fortunately, these gaps can be filled using new experimental methodologies on large, socio-digital data sets. The main idea is to determine if these socio-digital platforms produce useful, unbiased, aggregate outcomes, or (more likely) if, and how, opinion and behavior is influenced and manipulated. Work of our own and recent tangential experiments [62, 48, 32, 54] suggest that decisions and opinions can be significantly influenced by minor manipulations, yielding different social behavior. The main focus of the present work is the determine what effect, if any, does malicious voting behavior have on social news posts and comments. 1

2 Figure 1: Composite, redacted screenshot of Reddit. (A) There are many possible ranking systems on Reddit; in this image shows the first post when ordered by the top scored posts within the past month. (B) Authenticated users may up-vote, or down-vote once on any post; the score of a post is congruent to the total number of up-votes minus the total number of down-votes. (C) Each post displays its rank on the far left corresponding to its position in the selected ranking system, a title text, and the host of the linked content on the far right; this post also has 29,266 comments. (D) The top 200 comments are displayed in order as well corresponding to the chosen ranking system and number of points the comment has received; an orangered arrow indicates that the current user up-voted this comment. (E) This comment has a score of 5,997, which is congruent to the number of up-votes minus the number of down-votes that the comment has received. (F) Comment threads are hierarchical such that each comment can have have children, siblings, etc. thus comment orderings are based on their vote score relative to the sibling comments in the thread hierarchy. Unfortunately, causal determinations are difficult to assess. In a closely related experiment, Wu and Huberman measured rating behavior in two different online platforms. The first allowed users to see prior ratings before they voted and the other platform hid the prior ratings until after the user voted. They found that when no information about previous ratings or page views are available, the ratings and user-opinions expressed tend to follow regular patterns. However, in cases where the previous ratings were made known, the user-opinions tended to be either neutral or form a polarized consensus. In the latter case, new opinions tend to reinforce previous opinions and thus become more extreme [65]. Because of the information overload caused by billions of daily shares, tweets, posts and comments, nearly all social media Web sites have sophisticated ranking algorithms that attempt to identify the relatively few items that their users will find interesting. When new or better items are shared or posted, the ranking systems rely significantly upon user ratings to accelerate the discovery of new or interesting content. Information that is rated positively will be ranked higher, and will therefore be more visible to other users, which further increases the likelihood that it will receive further ratings [13, 52]. A recent experiment by Lerman and Hogg further studied the effects that presentation order has on the choices that users make. In this study, several users were asked to read and rate social media posts ranked by various ordering algorithms. Lerman and Hogg found that different ranking systems result in very different outcomes. Random orderings result in the most unbiased ratings, but may show a lot of uninteresting content resulting in poor user engagement. The popularity ranking, which rated posts by how many previous 2

3 positive-votes it received led to highly inconsistent outcomes and showed that small early differences in ratings led to inconsistent rating scores [34]. Social news Web sites represent a stark departure from traditional media outlets in which a news organization, i.e., a handful of television, radio or newspaper producers, sets the topics and directs the narrative. Socio-digital communities increasingly set the news agenda, cultural trends, and popular narrative of the day [35]. News agencies frequently have segments on what s trending, entire television shows are devoted to covering happenings on social media, and even live presidential debates select their topics based on popular questions posed to social media Web sites. As this trend continues and grows, the number of blogs, news outlets, and other sources of user generated content has outpaced the rate at which Web users can consume information. Social news Web sites are able to automatically curate, rank and provide commentary on the top content of the day by harnessing the wisdom of the crowd. The recent popularity of social networks has led to the study of socio-digital influence and popularity cascades where models can be developed based on the adoption rate of friends (e.g., shares, retweets). Bakshy et al., find that friendship plays a significant role in the sharing of content [6]. Similarly, Leskovec et al. were able to formulate a generative model that predicts the size and shape of information cascades in online social networks [37]. However, social media users seem to be unaware of the effects of social manipulation. A recent survey of Reddit users aimed to determine what the sampled community thought drove Reddit-users to up-vote or down-vote various posts. The surveyors expected that the leading indicators would be that users are more likely to up-vote or like 1) content that others have liked, indicating social influence; 2) content that was submitted or contributed by a well known user, indicating trust or model-based bias; or 3) content that is relevant to the user s interests. Contrary to our scientific understanding of social influence, the surveyed users indicated that social influence had little effect on their voting likelihood [53]. Other than the need to raise awareness of the impact of social influence within social media communities, these results suggest that online social media aggregators are a viable testbed for theories of trust and social influence. Like social networks, online social news platforms allow individuals to contribute to the wisdom of the crowd in new ways. These platforms are typically Web sites that contain very simple mechanics. In general, there are 4 operations that are shared among social news sites: 1. individuals generate content or submit links to content, 2. submissions are rated and ranked according to their rating scores, 3. individuals can comment on the submitted content, 4. comments are rated and ranked according to their rating scores. Simply put, social news platforms allow individuals to submit content and vote on the content they like or dislike. The voting mechanism found in socio-digital platforms provides a type of Web-democracy that is open to everyone. Given the widespread use and perceived value of these voting systems [25], it is important to consider whether they can successfully harness the wisdom of the crowd to accurately aggregate individual information. In our study, we determine what effect, if any, ranking and vote score has on rating behavior. This is accomplished via an in vivo experiment on the social media Web site, Reddit, by inserting random votes into the live rating system. Reddit is a social news Web site where registered users can submit content, such as direct posts or links. Registered users can then up-vote submissions or down-vote submissions to organize the posts and determine the post s position on the site; posts with a high vote score (i.e., up-votes down-votes) are ranked more highly than posts with a low vote score. Reddit is organized into many thousands of subreddits, according to topic or area of interest, e.g., news, science, compsci, datamining, and theoryofreddit, and posts must be submitted to a subreddit. A user that subscribes to a particular subreddit will see highly ranked posts from that subreddit on their frontpage, which Reddit describes as the front page of the Internet and is unique for each user. Figure 1 illustrates an example post and a small piece of its comment section. As in most social media Web sites, users are free to comment on the posts. Reddit has a unique commenting framework that ranks comments based on their scores relative to their sibling comments. For instance, all root comments, i.e., comments with no parent, are ranked together, and all of the children-comments of 3

4 some single parent-comment are ranked together. It is possible, even frequent, that a comment deep within the comment thread-tree will have a higher score than its parent or ancestor-comments [63]. By default, Reddit only displays the top 200 comments, even though it is common for popular posts to receive thousands of comments. Therefore, many comments in popular threads are never viewed, which likely exacerbates the rich-get-richer effect that is already seen in certain ranking systems. It is important to note that, unlike other online social spaces, Reddit is not a social network. the notion of friendship and friend-links, like on Facebook, is mostly absent on Reddit. Although usernames are associated with posts and comments, the true identity of registered users is generally unknown and in many cases fiercely guarded. In fact, we attempted to find friendship by looking at user-pairs that frequently reply to each other in comments; unfortunately, more than 99.9% of the comments were in reply to a user that they had never previously replied to. Thus, we typically refer to Reddit a social non-network, and the vast amount of previous social network literature does not apply. In the present work, we report the results of two large in-vivo experiments on Reddit; the first (N = 93, 019) up-voted or down-voted posts at random and the second (N = 128, 316) up-voted or down-voted comments at random. Based on these experimental treatments we observe the effects that votes have on the final score of a post or comment as a proxy for observing herding effects in social news. Unlike the experimental study performed by Muchnik et al., and other behavioral studies our experiments: 1) manipulate votes of posts and comments rather than just comments, 2) leverages Reddit s dynamic, score-based ranking system rather than a time-only ranking system, 3) does not involve friendship or the use of social networks, and 4) randomly delays the vote treatment rather than always performing the treatment immediately upon creation. These differences are significant in that this is the first ever vote manipulation experiment on a global scale, live, working system. The use of randomized trials eliminates concerns about various confounding factors, and we have made our data and analysis scripts available to the community for replication and further research. 2 Methods 2.1 Post Experiment During the 6 months between September 1, 2013 and January 31, 2014 a computer program was executed every 2 minutes that collected post data from Reddit through an automated two-step process. First, the most recent post on Reddit was identified and assigned to one of three treatment groups: up-treated, down-treated, or control. Up-treated posts were artificially given an up-vote (a +1 rating) and down-treated posts were given a down-vote (a -1 rating). Up-treatment, down-treatment and the control have an equal likelihood of being selected. Vote treated posts are assigned a random delay ranging from no delay up to an hour delay in intervals of 0,.5, 1, 5, 10, 30 and 60 minutes. Second, each post was re-sampled 4 days later and final vote totals were recorded. These treatments created a small, random manipulation signalling positive or negative judgement that is perceived by other voters as having the same expected quality as all other votes thereby enabling estimates of the effects of a single vote while holding all other factors constant. This data collection resulted in 93,019 sampled posts, of which 30,998 were up-treated and 30,796 were down-treated; each treatment type was randomly assigned a delay interval with equal likelihood. Treatments were removed from the vote scores before data analysis was performed, i.e., up-treated post-scores were decremented by 1 and down-treated post-scores were incremented by 1. During the experimental time period, Reddit reported that their up-vote and down-vote totals were fuzzed as an anti-spam measure; fortunately, they certified that a post s score (i.e., up-votes minus down-votes) was always accurate. In July of 2014, after the data gathering phase of this experiment had ended, Reddit removed the vote totals from their Web site and replaced it with a semi-accurate points system; Reddit administrators currently assert that the rankings are always accurate, even though their reported scores may not be. 4

5 Experiment Up-Treatment Down-Treatment Control Total Post 30,998 30,796 31,225 93,019 Comment 35,704 31,830 28,952 96,486 Table 1: Summary of sample count by treatment for the data collected from September 1, 2013 to January 31, 2014 through the Post and Comment Experiments. 2.2 Comment Experiment During the 6 months between September 1, 2013 and January 31, 2014 a computer program, separate from the post experiment, was executed every 2 minutes that collected comment data from Reddit through an automated two-step process. First, the most recent comment on the top ranked post ordered by the rising ranking algorithm on the Reddit frontpage was identified and assigned to one of three treatment groups: uptreated, down-treated, or control. Up-treated comments were artificially given an up-vote (a +1 rating) and down-treated comments were given a down-vote (a -1 rating). Up-treatment, down-treatment and the control have an equal likelihood of being selected. Vote treated comments are assigned a random delay ranging from no delay up to an hour delay in intervals of 0,.5, 1, 5, 10, 30 and 60 minutes. Second, each comment was re-sampled 4 days later and final vote totals were recorded. These treatments produced a score manipulation similar to that of the post experiment, wherein all other factors were held constant enabling a clear causal signal to be measured. This data collection resulted in 96,486 sampled comments, of which 35,704 were up-treated and 31,830 were down-treated; each treatment type was randomly assigned a delay interval with equal likelihood. Treatments were removed from the vote scores before data analysis was performed, i.e., final up-treated comment-scores were decremented by 1 and final down-treated comment-scores were incremented by 1. To our knowledge, comment scores were not fuzzed in the same way that post scores are fuzzed, so absolutely point scores reported here should be accurate. The voting agents used here were periodically checked to ensure that they had not been blocked or their votes ignored. The voting agent did not target any one type of content or subreddit or content provider, which are among the most common types of vote-spam, therefore, we are certain that all of our votes were counted. 3 Results We first compared the final vote totals of each treatment group. These findings measure the overall effect that up-treatments and down-treatments have on the overall life of a post or comment. Figure 2 shows the full distribution of the final post scores and comment scores for each treatment group. Black outer error bars show the 95% confidence interval and red inner error bars show the standard error of the mean. The full distribution of post scores in Figure 2(a) is extremely positively skewed with a skewness of 11.2 and a kurtosis of If we remove the top 1% highest scoring posts from the data set the skewness and kurtosis values drop to 6.5 and 54.9 respectively giving a better, although still skewed, view of the treatment effects. Figure 3(a) shows the distribution of the final post scores with the top 1% of posts removed. In this case, the up-treated posts have a significantly higher final score, and the down-treated posts have a significantly lower final score. The distribution of comment scores in Fig. 2(b) is even more positively skewed than the distribution of post scores with a skewness of 16.4 and a kurtosis of but when the top 1% highest scoring comments are removed, the skewness and kurtosis values dropped to 6.7 and 58.1, similar to the skewness and kurtosis for the distribution of post scores when the top 1% highest scoring posts are removed. In this case, the down-treated comments have a significantly lower final score but the up-treated comments do not have a significantly higher final score. Tests of statistical significance, e.g., T-test, are known to improperly reject the null hypothesis when the data distribution is non-normal or highly skewed. This is indeed the case in our result set as is indicated by the abnormally high skewness and kurtosis scores. Removal of the top 1% of scores is one way to unskew the data, hence the tightening of error bars and narrowing of confidence internals in Fig. 3 as compared to the 5

6 (a) Posts (b) Comments Figure 2: Final scores for artificially, randomly up-treated posts, down-treated posts, and scores for untreated posts in the control group are shown. Red inner error bars show the standard error of the mean; black outer error bars show the 95% confidence interval. Fig. (a) shows the scores in the heavily skewed full distribution for posts. Fig. (b) shows the scores in the heavily skewed full distribution for comments, with significant decreases for down-treated comments when compared to the control group. full results in Fig. 2. Another way to unskew data is to take the log of each value in the distribution, which unfortunately removes negative scores from the analysis, a significant limitation for this line of work. Student s T-Test on the full set (i.e., 100%) of log-scores for posts also showed that the up-treated posts were significantly higher than the control group (p = ), and that the down-treated posts were significantly lower than the control group (p = ), although scores less than or equal to 0 were removed to calculate the log of the final scores. For comments we find that Student s T-Test on log-scores demonstrated that up-treated posts were significantly higher than the control group (p = ), and down-treated posts were significantly lower than the control group (p = ). Unfortunately, the distribution of log-scores was still far from normal, so the T-Test is likely to give improper results. With this in mind, we used the non-parametric, 1-dimensional Kolmogorov-Smirnov (K-S) test as well as the Mann-Whitney U (M-W) Test to determine the significance between treatments and control. Both the M-W and the K-S tests are nonparametric tests to compare two unpaired groups of data. They each compute p-values that test the null hypothesis that the two groups have the same distribution. They do have some important differences though. The M-W test operates by ranking all the values from low to high, and then computes a p-value that depends on the differences between the mean ranks of the two groups. The K-S test compares the cumulative distribution of the two data sets, and computes a p-value that depends on the largest difference between the two distributions. The differences between the two tests are important, but they both compute p-values that can be used to judge the statistical significance of the treatment effects. Thus, we will display the results of both tests. The K-S Tests showed that the final score distribution of all up-treated posts were more positively skewed than posts in the control group (K-S test statistic: 0.08; p < ), which were more positively skewed than down-treated posts (K-S test statistic: 0.11; p < ). The same K-S test on comment scores shows significantly higher final scores for up-treated comments and down-treated comments (p < ). The reason that the p-value of the K-S statistic is reported as being less than is because floating point underflow error prevents a more precise calculation in the R-based K-S test calculator. Finally, we performed the independent 2-group M-W Test comparing treatments (up-treated and downtreated) with the control. We again find significant differences comparing the up-treated post scores to the control (p = ) and the down-treated post scores to the control (p = ). The same M-W Test on comments also showed significant differences in the final scores of up-treated comments compared to the control group (p = ), and significantly different final scores in down-treated comments compared to the control group (p = ). In general, an up-vote increases a post s score on the site which increases its visibility according to 6

7 (a) Posts (b) Comments Figure 3: Top 99% of final scores for artificially, randomly up-treated posts, down-treated posts, and scores for untreated posts in the control group are shown. Red inner error bars show the standard error of the mean; black outer error bars show the 95% confidence interval. When the highest 1% of post scores are removed, the score distribution becomes much less skewed resulting in tighter error bounds, which further result in significant increases for up-treated posts and significant decreases for down-treated posts when compared to the control group. Again, when the highest 1% of comment scores are removed, the score distribution becomes less skewed resulting in tighter error bounds, but with slight but not significant increases for uptreated comments when compared to the control group. the default ranking algorithms. The increased visibility of the post makes it more likely to be viewed by others. However, making a post more visible does not necessarily mean that it will receive more up-votes and continue to increase or even maintain its visibility; it may instead receive down-votes, thereby decreasing the posts visibility. That is, until we consider that the vast majority of votes cast on Reddit are up-votes and downvoting is actually discouraged unless the post is spam, off-topic, or otherwise improper. Thus, we are confident that the increase in the final post score after positive vote manipulation in the presence of popularity ranking mechanics is largely due to the increase in visibility due to the treatment up-vote. Comments, in contrast, have a vastly different visibility mechanism than posts. Reddit comment threads are hierarchical, wherein the default best (highest up-vote to down-vote ratio) ordering mechanism sorts comments among its siblings only. The visibility of a comment in the hierarchy depends not only on its ordering among its siblings but also the rank of any parent or ancestor comments it has. Because our voting mechanism selected the most recent comment, it may be the case that the selected comment was a child or other descendant of a highly visible comment. As such, it may be the case that the treated comment was already highly visible by its relative position in the comment hierarchy. Unfortunately, we did not record the relative position of each treated comment and are unable to find correlation between relative visibility and treatment effects. Another difference in the comments experiment is that, by default, only the top 200 comments are visible. By selecting rising posts, our collection methodology makes it highly likely that the comment that we select is within the first 200, and is therefore at least initially visible. Unfortunately, for large comment threads a single down-treatment may be enough to make the comment no longer visible under default orderings. This is probably why down-treated comments have such a low overall score compared to up-treated or control groups. 3.1 Delay Effects Up-votes and down-votes for post receiving treatments were performed after a 0, 0.5, 1, 5, 10, 30 or 60 minute delay chosen at random, and Figures 2 and 3 does not distinguish between the effects of vote-treatments performed after the various delay periods. Figure 4 separates the results for posts from Figure 3(a) and Figure 3(b) into their respective treatment delay groups in Figure 4(a) and Figure 4(b), respectively. We 7

8 (a) Posts (b) Comments Figure 4: Final scores separated into their respective treatment delay intervals. Fig. (a) shows final scores for artificially, randomly up-treated posts, down-treated posts, and scores for untreated posts in the control group and Fig. (b) shows final scores for artificially, randomly up-treated comments, down-treated comments, and scores for untreated comments in the control group. Horizontal lines show the overall mean of each treatment group. The top 1% of scores were removed to un-skew the score distribution. expected that immediate votes would have a larger effect than votes performed after a long delay. However, these results show, surprisingly, that a delay in treatment generally did not have a significant effect on the mean outcome of a post s final score. Unfortunately, displayed error bounds and confidence intervals, which are computed from Student s T- Test, have little meaning when the data is so highly skewed; K-S tests shown in Table 2 again showed that all up-treated posts were more positively skewed than posts in the control group and that the effects generally diminished as the delay interval increased. Similarly, the control group was more positively skewed than the down-treated posts, but the effects were mixed as the delay interval increased. As for comments, the K-S test results were more mixed in Table 2, but still mostly statistically significant. The up-treated comments were significantly more positively skewed than the control group comments, and the down-treated comments resulted in a significantly lower score. Interestingly, the p-values of the comment scores diminished as the delay grows longer, meaning that the vote treatment on comments are not effective a half-hour or an hour after the comment has been made. In short, timely voting on a comment is more important than timely voting on a post on average. M-W tests of statistical significance, also shown in Table 2, demonstrate that post treatments have a significant effect across all delay periods, and that this effect only slightly diminishes (if at all) when the K-S M-W Post Com. Post Com D = D = D = D = D = D = D = p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 D = D = D = D = D = D = D = p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 p < 2.2E 16 D = D = D = D = D = D = D = p = 1.8E 07 p = 8.9E 16 p = 3.2E 10 p = 7.0E 08 p = 1.6E 09 p = 1.3E 05 p = 0.04 D = D = D = D = D = D = D = p = 1.6E 08 p = 9.7E 09 p = 2.1E 08 p = 3.5E 08 p = 2.9E 05 p = 0.1E 02 p = 4.0E 05 p = 6.1E 14 p = 1.4E 18 p = 6.2E 18 p = 1.8E 13 p = 2.2E 11 p = 6.2E 15 p = 8.6E 12 p = 3.3E 22 p = 2.5E 17 p = 4.5E 24 p = 2.6E 21 p = 9.9E 27 p = 4.3E 15 p = 1.5E 11 p = 1.1E 05 p = 3.7E 10 p = 1.5E 06 p = 1.8E 05 p = 7.3E 09 p = 3.7E 04 p = 0.11 p = 0.1E 02 p = 0.8E 02 p = 0.3E 03 p = 3.7E 05 p = 0.17 p = 0.31 p = 0.1E 02 Table 2: Results of Kolmogorov-Smirnov (K-S) and Mann-Whitney U (M-W) Tests on complete result set (i.e., with top 1% included). represents tests comparing up-treatment with the control group; represents tests comparing down-treatment with the control group. indicates results that are not statistically significant at the 99% confidence level. 8

9 (a) Posts (b) Comments Figure 5: The middle 9 deciles of final scores for each treatment according to their delay intervals. These results show that most posts and comments receive a median score of 2 or less, and that treatment has the most effect in the higher deciles of the score distribution. delay approaches 1 hour. As for comment treatments, the M-W tests showed significance results similar to those from the K-S tests. Namely, the effect of up-treatment, as measured by the p-value scores, diminished as the delay grew bigger and led to an insignificant effect when the delay was 1 hour. The effect of down-treatment was significant for short delay periods, but was not significant for delays of 10 minutes and 30 minutes, and was only barely significant for delays of 1 hour. The results from the statistical tests on the comment treatments from Table 2 and Figure 4(b) appear to be in conflict. Figure 4(b) seems to show that negative treatments have a big effect on the final outcome of the comment for all delay levels, while positive-treatments have a little effect. However, proper statistical tests show that the truth is more nuanced. Ultimately, with this type of data, the best way to show aggregate results is through n-tile plots. With this in mind, Figure 5 shows the inner-deciles of the results as a function of their treatment delay. Taken together these results show graphically what the tests of statistical significance imply: that up-treated posts tend to score more highly than the control group, and that down-treated posts tend not to not score as highly as the control group. The decile plots also show that the majority of posts (deciles 50%) receive at most a final score of 2, and that most comments never receive any votes at all. 3.2 Reaching the Front Page Overall, the results suggest that an up-treatment increases the probability that a post will result in a high score relative to the control group, and that down-treatments decrease that probability relative to the control group. However, on Reddit and other social news sites only a handful of posts become extremely popular. On Twitter and Facebook this is generally referred to as a trending topic, but on Reddit the most popular posts are the ones 9

10 (a) Posts (b) Comments Figure 6: The probability of a post (a) or a comment (b) receiving a corresponding score by treatment type. The inset graph shows the complete probability distribution function. The outer graph shows the probability of a post receiving scores between 500 and 2000 an approximation for trending or frontpage posts. Uptreated posts are 24% more likely to reach a score of 2000 than the control group. that reach the front page. Unfortunately, reaching the front page is a difficult thing to discern because each user s homepage is different, based on the topical subreddits to which the user has subscribed. Nevertheless, we crudely define a post as having become popular, i.e., is trending, on the frontpage, etc., if it has a score of more than 500. Using this definition, Figure 6(a) shows the probability that post reaches a given final score under the two treatment conditions. These probability distribution functions are monotonically decreasing, positively skewed, and show that up-treatment results in a large departure from the control group for posts and down-treatment results in a large departure from the control group for comments. However, despite our earlier claims of up-treatment and down-treatment symmetry on post results, these results show that, in the upper limits of the distribution, down-treatments do not effect the final score results. These results mean that, compared to the control group, an up-treated post is 7.9% more likely to have a final score of at least 1000, and an up-treated post is 24.6% more likely to have a final score of at least The probability that a comment reaches a high score is generally lower than the probability of a post reaching the same high score because posts are generally more viewed and voted on than comments. Indeed, in order to even view the comments, a user must first view, or at least click-on, the post. Also, lower rated comments or comments with multiple levels of ancestor comments above them are often hidden until a user chooses to reveal them. Figure 6(b) shows the probability that a comment reaches a given final score under the two treatment options as in Figure 6(a). Interestingly, we find that an up-treatment has very little effect on the probability of a comment reaching a high score; yet, a down-treatment has a dramatic negative effect on that probability. 3.3 Subreddit Effects We finally investigated treatment effects in the top 10 most frequent subreddits. These do not necessarily correspond to the top 10 most popular subreddits or the subreddits with the most comments. Rather, they are the subreddits to which posts are most frequently submitted or whose posts are most frequently ranked first on Reddit s rising ranking system due to our data collection methodology. From the top 10 subreddits for posts, we removed politic and friendsafari and from the top 10 subreddits for comments, we removed friendsafari. These subreddits were removed from our analysis because posts in politic are automatically submitted by a computer program, and because posts and comments in friendsafari cannot be down-voted according to the subreddit rules. Thus, only 8 subreddits for posts and 9 subreddits for comments are shown in Figure 7 which illustrates the effects of treatment on post and comment scores on average within top 10 subreddits. Figure 7(a) and Mann-Whittney test results show significant positive effects on post scores in AdviceAnimals, AskReddit and videos, and significant negative effects on post scores in AskReddit and pics. These 10

11 (a) Posts (b) Comments Figure 7: Mean scores of down-treated, control group and up-treated posts in the top 8 most active subreddits in Fig. a and the mean scores of down-treated, control group and up-treated comments in the top 9 subreddits from which we collected the most data in Fig. b, i.e., subreddits that are (a) most active, (b) most often appearing as rising on Reddit. Black outer error bars show the 95% confidence interval and red inner error bars show error. results illustrate similar symmetric effects that we found on posts overall. Voting effects on comment scores within subreddits are shown in Figure 7(b). While we find that down-treatments typically result in significantly lower final comments scores compared to the control, up-treatments rarely result in significantly higher final comment scores as shown by Figure 7(b). Within the top 500 subreddits for posts, we found that 22% had significant up-treatment effects, 21.6% had significant down-treatments, and 5.4% of subreddits had significant differences in both the up-treatment and down-treatment results when compared to the control group. There was also no correlation in the top 500 between up-treatment significance and number of submissions the subreddit received (r 2 = 0.014; p-value = 0.007), nor down-treatment significance and number of submissions the subreddit received (r 2 = 0.010; p-value = 0.026). 4 Related Work Although this is the first in-vivo Reddit experiment, our work is motivated and informed by multiple overlapping streams of literature and build on substantial prior work from multiple fields such as: herding behavior from theoretical and empirical viewpoints [54, 63]; social influence [6]; collective intelligence [29, 1]; and online rating systems [42]. A recent study by Muchnik et al on a small social news Web site, similar to Reddit, found that a single up-vote/like on an online comment significantly increased the final vote count of the treated comment; interestingly, the same experiment also found that a single negative rating had little effect on the final vote count [48]. In a separate line of work, Sorensen used mistaken omissions of books from the NY Times bestsellers list to identify the boost in sales that accompany the perceived popularity of a book s appearance on the list [57]. Similarly, when the download counters for different software labels were randomly increased, Hanson and Putler found that users are significantly more likely to download software that had the largest counter increase [27]. Salganik and Watts performed a study to determine the extent to which perception of quality becomes a self-fulfilling prophecy. In their experiment they inverted the true popularity of songs in an online music marketplace, and found that the perceived-but-false-popularity became real over time [55]. These experiments aim to determine the causal effect of social influence on rating behavior, as well as the mechanisms driving socio-digital influence. Although these experiments are first-of-a-kind, they are motivated and informed by multiple overlapping streams of literature and build on substantial prior work from multiple fields such as: herding behavior from theoretical [11, 8, 26] and empirical viewpoints [54, 65, 36, 14, 2]; social influence in networks [6, 37, 46, 3, 49]; collective intelligence [64, 12, 11, 29]; and online 11

12 rating systems [16, 65, 42, 15, 49, 44, 19, 20, 30, 38, 67, 18]. Interestingly, most of the previous work is geared towards marketing science because of the close relationship between business and consumer opinion. The dynamics of online reviews, ratings and votes have received a lot of recent attention in the computing and marketing literature because the dynamics of online reviews for books, restaurants, hotels, etc. have become a vital business interest [44, 19, 20, 30, 38, 67, 18]. Recent work in text mining is able to automatically determine the positivity and negativity of user-opinion [42, 41, 43] even among different aspects of a certain product (e.g., large can be a good thing when talking about portion size, but bad when talking about camera size) [40]. These papers attempt to codify ratings from plain, user-generated text and then determine relationships between the ratings and popularity. Nonetheless, studies that aim to demonstrate the ease of online manipulation of ratings or voting tend to be limited. The biggest limitation is that these studies assume that the manipulators have full knowledge of the voting preferences of every user a valid assumption in theoretical work, but a meaningless assumption in real-world applications [56, 24, 9, 10]. There is some work that considers manipulators who have a limited [17] or probabilistic [7, 45] knowledge of the voting preferences, but these assumptions are still too limiting for our purposes. On the practical side, one obvious case of online manipulation is spam, particularly a new type of spam called social spam. Social spam is on the rise, with Nexgate Research reporting a tripling of social spam activity every six months [50] and BusinessWeek magazine reporting that only 40% of online social media users are real people [31]. There has been some practical work on detecting social spam in online social networking Web sites [60] like Facebook and Twitter, but not in social news platforms like Reddit and HackerNews. The largest and perhaps most effective type of social spam relies on social networks to broadcast and propagate the advertisement or message [66, 5, 23, 47]. These social network spammers are also the easiest to detect and shut down. However, social news platforms are purposefully not social networks. 5 Discussion In general, we find that the positive treatment of a single, random up-vote on a Reddit post has a corresponding positive herding effect that increases post scores on average and in the top limits of the heavily skewed score distribution but that a single, random up-vote on a Reddit comment had no significant positive herding effects. We further found that the negative treatment of a single, random down-vote on a post or comment has a corresponding negative herding effect that significantly decreased the post or comment scores on average, in contrast to the asymmetric findings of Muchnik et al. [48], who found no significant effects of a negative treatment. However, our results begin to resemble asymmetry in the top limits of the post score distribution meaning that a negative treatment does not decrease the probability that a post will receive a high score in the way that it does for comments. Separating treatments by their delay intervals did not yield a significant difference in effect overall. K-S and M-W tests found that up- and down-treatments for most delay intervals had significant effects compared to the control.. In general, the time that a vote is placed did not change the overall effect for post scores, but longer delays did diminish the effects that votes had on comment scores. 5.1 Voting and Viewing Mechanics Research in social news manipulation has been shown a great deal of interest in recent years because of its centrality in shaping the news and opinion of society. There are several conflicting reports that now need to be teased apart. The work by Muchnik et al. showed positive herding effects but not negative herding effects [48] on non-reddit social media comments ranked by recency, rather than popularity, and in the presence of a friendship social network. The voting and visibility mechanics of Reddit, which govern the data collection in this paper, are vastly different then the small or contrived experiments studied in earlier work. The post experiment and results are actually more in line with past research on ranking and visibility bias [34] because of how the ranking mechanics of posts impact visibility. The results of our analysis as described above and the behavior of voting on Reddit, with an overwhelming majority of votes being upvotes and the discouragement of down-voting posts that are appropriate, support an increase in popularity from an 12

13 increase in visibility. Thus, we are confident that the increase in the final post score and the probability of reaching a high post score after positive vote manipulation in the presence of popularity ranking mechanics is largely due to the increase in visibility due to the treatment up-vote. 5.2 Vote-based Manipulation Collectively, this work, in the presence of other work on this subject [34, 48, 58], shows that votes determine visibility which, in turn, drives more votes. The 1% rule, or its variants like the rule, the 80/20 rule or the pareto principle, when applied to social media indicates that about 90% of users only view content, 9% of users edit content (including voting), and 1% of users actively contribute new content [61, 28]. On all manner of vote-based social media platforms, the 10% of users who actually vote are the ones who determine the kind of content that becomes widely visible and circulated among the remaining 90% of the viewing public. Therefore, that active 10% determines the ideas and opinions that the public is exposed to and influenced by. Clearly, there is a huge incentive for opinion-pushers to manipulate the visibility of certain ideas and opinions on social media Web sites. There are several types of vote-based manipulation techniques that exist. We discuss a few of them here. Vote Brigading Vote brigading is when a large group of people all conspire to up-vote or down-vote a particular post or idea. This is not unique to Reddit, as Twitter has its Retweet armies that attempt to manipulate the velocity of some discussion in order to artificially force a topic to become a trending topic. The social media Web site Digg was particularly susceptible to vote brigading, wherein only users with many friends could ever hope to have a post reach the frontpage because the poster s friends would initially vote on the post in order to raise its visibility enough so that the wider community to see it. Fortunately, most forms of this type of vote manipulation can be easily detected and stopped with spam detection and prevention techniques [21, 33]. As part of a larger strategy, Reddit now encourages hyperlinks between subreddits to be tagged with a no-participation URL, which restricts access for non-subscribers of the subreddit to read-only, in order to prevent cross-subreddit contamination 1. For example, a no-participation link from /r/yankees to a post in /r/redsox (a historical baseball rivalry) would prevent Yankees fans from downvoting posts that favor Red Sox fans. Vote Nudging Vote nudging is the type of vote manipulation that is studied in this paper and is the easiest, and most common, type of vote manipulation on social media. A post or comment is most susceptible to being ignored when it is young. Vote nudging is when someone asks a few friends to up-vote the post or comment in order to give it a positive boost during its initial appearance. After the initial boost, the post is left to grow normally. As we have shown in this study, vote nudging can be extremely successful because the default ranking system gives higher visibility to posts with more, timely votes. Vote nudging also prevents instances when an unrelated user down-votes, and effectively kills a posts changes of becoming visible, because a post with three or four up-votes may be able to withstand the effects of a down-vote better than a post with no up-votes. It is difficult to say how much vote nudging happens in social media. It is common for users to have multiple accounts for this reason, but multiple votes from the same IP address is easy for spam prevention systems to catch. Reverse Vote Nudging Reverse vote nudging is when a user down-votes all of the posts or comments that are similar to their post or comment in order to make a relative gain on competing content. For example, if a user contributes a post about the winner of a baseball game to /r/yankees, several other users may also have contributed posts about the same baseball game at about the same time to /r/yankees. In order the increase the relative ranking of their own post, the user may down-vote all of the other posts by the other users thereby increasing the relative ranking of their submission. Similarly, a user may wish to down-vote all of the posts or comments that are ranked just above the user s submission in order to increase the relative ranking of the user s submission. Using the same baseball example as earlier, if there are posts dealing with other Yankees content that are ranked just above the user s post, then

14 the user may increase the ranking of their own post, and therefore increase its visibility, by downvoting the other content. 5.3 Conformity and Influence Comment threads on Reddit are a unique supplement to the posted content. In fact, it is widely thought that most social media users, across all types of platforms, read the title of the post and skip directly to the comments section although this has not been empirically researched. Also, Reddit, Youtube, Twitter and many other social media platforms, to some extent, show the current score of each comment in the comment thread. Thus, the opinions and ideas expressed in each comment are given an explicit rating from the voting user base that is often viewed as the prevailing opinion of the overall population. The Asch conformity experiments in the 1950 s and onward showed that perceptions of popular opinion can have profound effects on individual perceptions of the truth [4]. Social comment threads frequently have instances where the highest scored comments represent an incorrect fact or are contrary to be the prevailing public opinion, perhaps due to comment manipulation discussed above. However, it is sometimes uncomfortable for many comment readers to hold opinions contrary to what they perceive to the the prevailing opinion. This disillusionment sometimes leads to a position change, but can also lead to a retreat inwards due to confirmation bias, which, in the worst case, leads to radicalization. 5.4 Voting The nature of the manner in which social platforms rank items for viewing typically utilizes the ratings, in this case the post or comment scores, of the items being ranked. The results of our experiments show that random vote perturbations through vote treatments impact the scores of posts and comments on Reddit. These results underscore the need for counter measures against vote chaining and social engineering strategies as multiple artificial votes are likely to increase the herding effect. Finally, we bring attention back to what Eric Gilbert calls, the widespread underprovision of votes in social media like Reddit [25]. Although our data does not draw these figures explicitly, we estimate that a very small number of the daily visitors to social media Web sites actually vote on the items they view. This seems to be an even further skewed anecdote of the rule of social networking [61], and may be an underestimated reason behind the results presented in this paper. 6 Acknowledgments We thank Michael Creehan for his help and discussion. This research is sponsored by the Air Force Office of Scientific Research FA The research was approved by University of Notre Dame institution review board and the Air Force Surgeon General s Research Compliance Office. Raw data files, and statistical analysis scripts are available on the corresponding authors Web site at tweninge/data/reddit_report.html. Reddit Inc was not involved in the experimental design, implementation or data analysis. References [1] A. Anderson, D. Huttenlocher, J. Kleinberg, and J. Leskovec. Discovering value from community activity on focused question answering sites. In SIGKDD, page 850, New York, New York, USA, ACM Press. [2] L. R. Anderson and C. A. Holt. Information Cascades in the Laboratory. The American Economic Review, 87:847, [3] S. Aral, L. Muchnik, and A. Sundararajan. Distinguishing influence-based contagion from homophilydriven diffusion in dynamic networks. Proceedings of the National Academy of Sciences of the United States of America, 106(51): , Dec

CSE 190 Assignment 2. Phat Huynh A Nicholas Gibson A

CSE 190 Assignment 2. Phat Huynh A Nicholas Gibson A CSE 190 Assignment 2 Phat Huynh A11733590 Nicholas Gibson A11169423 1) Identify dataset Reddit data. This dataset is chosen to study because as active users on Reddit, we d like to know how a post become

More information

CSE 190 Professor Julian McAuley Assignment 2: Reddit Data. Forrest Merrill, A Marvin Chau, A William Werner, A

CSE 190 Professor Julian McAuley Assignment 2: Reddit Data. Forrest Merrill, A Marvin Chau, A William Werner, A 1 CSE 190 Professor Julian McAuley Assignment 2: Reddit Data by Forrest Merrill, A10097737 Marvin Chau, A09368617 William Werner, A09987897 2 Table of Contents 1. Cover page 2. Table of Contents 3. Introduction

More information

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012 Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012 Abstract In this paper we attempt to develop an algorithm to generate a set of post recommendations

More information

DU PhD in Home Science

DU PhD in Home Science DU PhD in Home Science Topic:- DU_J18_PHD_HS 1) Electronic journal usually have the following features: i. HTML/ PDF formats ii. Part of bibliographic databases iii. Can be accessed by payment only iv.

More information

Predicting Information Diffusion Initiated from Multiple Sources in Online Social Networks

Predicting Information Diffusion Initiated from Multiple Sources in Online Social Networks Predicting Information Diffusion Initiated from Multiple Sources in Online Social Networks Chuan Peng School of Computer science, Wuhan University Email: chuan.peng@asu.edu Kuai Xu, Feng Wang, Haiyan Wang

More information

VOTING DYNAMICS IN INNOVATION SYSTEMS

VOTING DYNAMICS IN INNOVATION SYSTEMS VOTING DYNAMICS IN INNOVATION SYSTEMS Voting in social and collaborative systems is a key way to elicit crowd reaction and preference. It enables the diverse perspectives of the crowd to be expressed and

More information

Social Rankings in Human-Computer Committees

Social Rankings in Human-Computer Committees Social Rankings in Human-Computer Committees Moshe Bitan 1, Ya akov (Kobi) Gal 3 and Elad Dokow 4, and Sarit Kraus 1,2 1 Computer Science Department, Bar Ilan University, Israel 2 Institute for Advanced

More information

Measurement and Analysis of an Online Content Voting Network: A Case Study of Digg

Measurement and Analysis of an Online Content Voting Network: A Case Study of Digg Measurement and Analysis of an Online Content Voting Network: A Case Study of Digg Yingwu Zhu Department of CSSE, Seattle University Seattle, WA 9822, USA zhuy@seattleu.edu ABSTRACT In online content voting

More information

arxiv:cs/ v1 [cs.hc] 7 Dec 2006

arxiv:cs/ v1 [cs.hc] 7 Dec 2006 Social Networks and Social Information Filtering on Digg Kristina Lerman University of Southern California Information Sciences Institute 4676 Admiralty Way Marina del Rey, California 9292 lerman@isi.edu

More information

EasyChair Preprint. (Anti-)Echo Chamber Participation: Examing Contributor Activity Beyond the Chamber

EasyChair Preprint. (Anti-)Echo Chamber Participation: Examing Contributor Activity Beyond the Chamber EasyChair Preprint 122 (Anti-)Echo Chamber Participation: Examing Contributor Activity Beyond the Chamber Ella Guest EasyChair preprints are intended for rapid dissemination of research results and are

More information

Chapter 9 Content Statement

Chapter 9 Content Statement Content Statement 2 Chapter 9 Content Statement 2. Political parties, interest groups and the media provide opportunities for civic involvement through various means Expectations for Learning Select a

More information

LOCAL epolitics REPUTATION CASE STUDY

LOCAL epolitics REPUTATION CASE STUDY LOCAL epolitics REPUTATION CASE STUDY Jean-Marc.Seigneur@reputaction.com University of Geneva 7 route de Drize, Carouge, CH1227, Switzerland ABSTRACT More and more people rely on Web information and with

More information

A comparative analysis of subreddit recommenders for Reddit

A comparative analysis of subreddit recommenders for Reddit A comparative analysis of subreddit recommenders for Reddit Jay Baxter Massachusetts Institute of Technology jbaxter@mit.edu Abstract Reddit has become a very popular social news website, but even though

More information

Popularity Dynamics and Intrinsic Quality in Reddit and Hacker News

Popularity Dynamics and Intrinsic Quality in Reddit and Hacker News Proceedings of the Ninth International AAAI Conference on Web and Social Media Popularity Dynamics and Intrinsic Quality in Reddit and Hacker News Greg Stoddard Northwestern University Abstract In this

More information

CASE SOCIAL NETWORKS ZH

CASE SOCIAL NETWORKS ZH CASE SOCIAL NETWORKS ZH CATEGORY BEST USE OF SOCIAL NETWORKS EXECUTIVE SUMMARY Zero Hora stood out in 2016 for its actions on social networks. Although being a local newspaper, ZH surpassed major players

More information

Reddit. By Martha Nelson Digital Learning Specialist

Reddit. By Martha Nelson Digital Learning Specialist Reddit By Martha Nelson Digital Learning Specialist In general Facebook Reddit Do use their real names, photos, and info. Self-censor Don t share every opinion. Try to seem normal. Don t share personal

More information

Social Media Audit and Conversation Analysis

Social Media Audit and Conversation Analysis Social Media Audit and Conversation Analysis February 2015 Jessica Hales Emily Lauder Claire Sanguedolce Madi Weaver 1 National Farm to School Network The National Farm School Network is a national nonprofit

More information

The language for most tablet questions was customized based on whether the respondent said they had an ipad or another type of tablet computer.

The language for most tablet questions was customized based on whether the respondent said they had an ipad or another type of tablet computer. PEW RESEARCH CENTER S PROJECT FOR EXCELLENCE IN JOURNALISM IN COLLABORATION WITH THE ECONOMIST GROUP Tablet News Web Survey September 6-19, N=300 tablet news users The language for most tablet questions

More information

BY Amy Mitchell FOR RELEASE DECEMBER 3, 2018 FOR MEDIA OR OTHER INQUIRIES:

BY Amy Mitchell FOR RELEASE DECEMBER 3, 2018 FOR MEDIA OR OTHER INQUIRIES: FOR RELEASE DECEMBER 3, 2018 BY Amy Mitchell FOR MEDIA OR OTHER INQUIRIES: Amy Mitchell, Director, Journalism Research Hannah Klein, Communications Associate 202.419.4372 RECOMMENDED CITATION Pew Research

More information

arxiv: v1 [cs.cy] 11 Jun 2008

arxiv: v1 [cs.cy] 11 Jun 2008 Analysis of Social Voting Patterns on Digg Kristina Lerman and Aram Galstyan University of Southern California Information Sciences Institute 4676 Admiralty Way Marina del Rey, California 9292, USA {lerman,galstyan}@isi.edu

More information

Analysis of Social Voting Patterns on Digg

Analysis of Social Voting Patterns on Digg Analysis of Social Voting Patterns on Digg Kristina Lerman and Aram Galstyan University of Southern California Information Sciences Institute 4676 Admiralty Way Marina del Rey, California 9292 {lerman,galstyan}@isi.edu

More information

Reddit Advertising: A Beginner s Guide To The Self-Serve Platform. Written by JD Prater Sr. Account Manager and Head of Paid Social

Reddit Advertising: A Beginner s Guide To The Self-Serve Platform. Written by JD Prater Sr. Account Manager and Head of Paid Social Reddit Advertising: A Beginner s Guide To The Self-Serve Platform Written by JD Prater Sr. Account Manager and Head of Paid Social Started in 2005, Reddit has become known as The Front Page of the Internet,

More information

Monday, March 4, 13 1

Monday, March 4, 13 1 1 2 Using Social Media to Achieve Goals Networking Your Way to Employment Friday, November 18, 2011 3 LinkedIn Establish your profile, resume, & professional picture Incorporate all keywords a recruiter

More information

WHAT IS PUBLIC OPINION? PUBLIC OPINION IS THOSE ATTITUDES HELD BY A SIGNIFICANT NUMBER OF PEOPLE ON MATTERS OF GOVERNMENT AND POLITICS

WHAT IS PUBLIC OPINION? PUBLIC OPINION IS THOSE ATTITUDES HELD BY A SIGNIFICANT NUMBER OF PEOPLE ON MATTERS OF GOVERNMENT AND POLITICS WHAT IS PUBLIC OPINION? PUBLIC OPINION IS THOSE ATTITUDES HELD BY A SIGNIFICANT NUMBER OF PEOPLE ON MATTERS OF GOVERNMENT AND POLITICS The family is our first contact with ideas toward authority, property

More information

NATIONAL CITY & REGIONAL MAGAZINE AWARDS

NATIONAL CITY & REGIONAL MAGAZINE AWARDS 2018 NATIONAL CITY & REGIONAL MAGAZINE AWARDS New Orleans June 2 4, 2018 DEADLINE NOV. 22, 2017 In association with the Missouri School of Journalism CITYMAG.ORG RULES THE CONTEST is open only to regular

More information

NP-Hard Manipulations of Voting Schemes

NP-Hard Manipulations of Voting Schemes NP-Hard Manipulations of Voting Schemes Elizabeth Cross December 9, 2005 1 Introduction Voting schemes are common social choice function that allow voters to aggregate their preferences in a socially desirable

More information

A New Computer Science Publishing Model

A New Computer Science Publishing Model A New Computer Science Publishing Model Functional Specifications and Other Recommendations Version 2.1 Shirley Zhao shirley.zhao@cims.nyu.edu Professor Yann LeCun Department of Computer Science Courant

More information

8 5 Sampling Distributions

8 5 Sampling Distributions 8 5 Sampling Distributions Skills we've learned 8.1 Measures of Central Tendency mean, median, mode, variance, standard deviation, expected value, box and whisker plot, interquartile range, outlier 8.2

More information

Research Thesis. Megan Fountain. The Ohio State University December 2017

Research Thesis. Megan Fountain. The Ohio State University December 2017 Social Media and its Effects in Politics: The Factors that Influence Social Media use for Political News and Social Media use Influencing Political Participation Research Thesis Presented in partial fulfillment

More information

Voter ID Pilot 2018 Public Opinion Survey Research. Prepared on behalf of: Bridget Williams, Alexandra Bogdan GfK Social and Strategic Research

Voter ID Pilot 2018 Public Opinion Survey Research. Prepared on behalf of: Bridget Williams, Alexandra Bogdan GfK Social and Strategic Research Voter ID Pilot 2018 Public Opinion Survey Research Prepared on behalf of: Prepared by: Issue: Bridget Williams, Alexandra Bogdan GfK Social and Strategic Research Final Date: 08 August 2018 Contents 1

More information

2011 The Pursuant Group, Inc.

2011 The Pursuant Group, Inc. Using Facebook & Social Media to Power Up your Engagement Barbara Talisman Initiate the Relationship Initiate the Relationship by reaching out to the places where your target audience aggregates Motivate

More information

Wisconsin Economic Scorecard

Wisconsin Economic Scorecard RESEARCH PAPER> May 2012 Wisconsin Economic Scorecard Analysis: Determinants of Individual Opinion about the State Economy Joseph Cera Researcher Survey Center Manager The Wisconsin Economic Scorecard

More information

The Social Web: Social networks, tagging and what you can learn from them. Kristina Lerman USC Information Sciences Institute

The Social Web: Social networks, tagging and what you can learn from them. Kristina Lerman USC Information Sciences Institute The Social Web: Social networks, tagging and what you can learn from them Kristina Lerman USC Information Sciences Institute The Social Web The Social Web is a collection of technologies, practices and

More information

Social Networking in Many Forms

Social Networking in Many Forms for Independent School Admissions Emily H.L. Surovick Director of Lower School Admission, Chestnut Hill Academy Vincent H. Valenzuela Director of Admission, Chestnut Hill Academy in Many Forms Blogging

More information

Analysis of Social Voting Patterns on Digg

Analysis of Social Voting Patterns on Digg Analysis of Social Voting Patterns on Digg Kristina Lerman Aram Galstyan USC Information Sciences Institute {lerman,galstyan}@isi.edu Content, content everywhere and not a drop to read Explosion of user-generated

More information

PRINT LG: (75,000 + circ.) Journalists are eligible whose work had significant reach into Ohio during Entrants need not be SPJ members.

PRINT LG: (75,000 + circ.) Journalists are eligible whose work had significant reach into Ohio during Entrants need not be SPJ members. PRINT LG: (75,000 + circ.) Journalists are eligible whose work had significant reach into Ohio during 2016. Entrants need not be SPJ members. Best Arts Profile One story that profiles an individual in

More information

Never Run Out of Ideas: 7 Content Creation Strategies for Your Blog

Never Run Out of Ideas: 7 Content Creation Strategies for Your Blog Never Run Out of Ideas: 7 Content Creation Strategies for Your Blog Whether you re creating your own content for your blog or outsourcing it to a freelance writer, you need a constant flow of current and

More information

Evaluating the Connection Between Internet Coverage and Polling Accuracy

Evaluating the Connection Between Internet Coverage and Polling Accuracy Evaluating the Connection Between Internet Coverage and Polling Accuracy California Propositions 2005-2010 Erika Oblea December 12, 2011 Statistics 157 Professor Aldous Oblea 1 Introduction: Polls are

More information

Publicizing malfeasance:

Publicizing malfeasance: Publicizing malfeasance: When media facilitates electoral accountability in Mexico Horacio Larreguy, John Marshall and James Snyder Harvard University May 1, 2015 Introduction Elections are key for political

More information

100 Sold Quick Start Guide

100 Sold Quick Start Guide 100 Sold Quick Start Guide The information presented below is to quickly get you going with Reddit but it doesn t contain everything you need. Please be sure to watch the full half hour video and look

More information

Social Media in Staffing Guide. Best Practices for Building Your Personal Brand and Hiring Talent on Social Media

Social Media in Staffing Guide. Best Practices for Building Your Personal Brand and Hiring Talent on Social Media Social Media in Staffing Guide Best Practices for Building Your Personal Brand and Hiring Talent on Social Media Table of Contents LinkedIn 101 New Profile Features Personal Branding Thought Leadership

More information

Rich Traffic Hack. Get The Flood of Traffic to Your Website, Affiliate or CPA offer Overnight by This Simple Trick! Introduction

Rich Traffic Hack. Get The Flood of Traffic to Your Website, Affiliate or CPA offer Overnight by This Simple Trick! Introduction Rich Traffic Hack Get The Flood of Traffic to Your Website, Affiliate or CPA offer Overnight by This Simple Trick! Introduction Congratulations on getting Rich Traffic Hack. By Lukmankim In this short

More information

Why Your Brand Or Business Should Be On Reddit

Why Your Brand Or Business Should Be On Reddit Have you ever wondered what the front page of the Internet looks like? Go to Reddit (https://www.reddit.com), and you ll see what it looks like! Reddit is the 6 th most popular website in the world, and

More information

What's in a name? The Interplay between Titles, Content & Communities in Social Media

What's in a name? The Interplay between Titles, Content & Communities in Social Media What's in a name? The Interplay between Titles, Content & Communities in Social Media Himabindu Lakkaraju, Julian McAuley, Jure Leskovec Stanford University Motivation Content, Content Everywhere!! How

More information

Chapters: Is There Such a Thing as Free Traffic? Reddit Stats Setting Up Your Account Reddit Lingo Navigating Reddit What is a Subreddit?

Chapters: Is There Such a Thing as Free Traffic? Reddit Stats Setting Up Your Account Reddit Lingo Navigating Reddit What is a Subreddit? Free Traffic Frenzy Chapters: Is There Such a Thing as Free Traffic? Reddit Stats Setting Up Your Account Reddit Lingo Navigating Reddit What is a Subreddit? Don t be a Spammer Using Reddit the Right Way

More information

The Internet and the Tragedy of the Commons

The Internet and the Tragedy of the Commons The Internet and the Tragedy of the Commons Jan. 4, 2017 The expectation of anonymity online has become extreme. By George Friedman The tragedy of the commons is a concept developed by a British economist

More information

Office of Communications Social Media Handbook

Office of Communications Social Media Handbook Office of Communications Social Media Handbook Table of Contents Getting Started... 3 Before Creating an Account... 3 Creating Your Account... 3 Maintaining Your Account... 3 What Not to Post... 3 Best

More information

Increasing Your Impact with Social. Rebecca Vander Linde, Social Media Manager Rachel Weatherly, Director of Digital Communications Strategy

Increasing Your Impact with Social. Rebecca Vander Linde, Social Media Manager Rachel Weatherly, Director of Digital Communications Strategy Increasing Your Impact with Social Rebecca Vander Linde, Social Media Manager Rachel Weatherly, Director of Digital Communications Strategy - Half of science is convincing the world what you re working

More information

INMA GLOBAL MEDIA AWARDS

INMA GLOBAL MEDIA AWARDS INMA GLOBAL MEDIA AWARDS Category Name of brand Name of product : Best use of mobile : AsiaOne : AsiaOne mobile apps for ios and Android All the top publications in one AsiaOne, Asia s leading news and

More information

Eric M. Uslaner, Inequality, Trust, and Civic Engagement (1)

Eric M. Uslaner, Inequality, Trust, and Civic Engagement (1) Eric M. Uslaner, Inequality, Trust, and Civic Engagement (1) Inequality, Trust, and Civic Engagement Eric M. Uslaner Department of Government and Politics University of Maryland College Park College Park,

More information

st ANNUAL PRESS CLUB OF NEW ORLEANS EXCELLENCE IN JOURNALISM AWARDS COMPETITION

st ANNUAL PRESS CLUB OF NEW ORLEANS EXCELLENCE IN JOURNALISM AWARDS COMPETITION 1 2019 61st ANNUAL PRESS CLUB OF NEW ORLEANS EXCELLENCE IN JOURNALISM AWARDS COMPETITION ELIGIBILITY All entrants must be Press Club of New Orleans members. All entries must have been published, broadcast

More information

Institutional aspects: What are the institutional actions to promote data sharing?

Institutional aspects: What are the institutional actions to promote data sharing? Institutional aspects: What are the institutional actions to promote data sharing? Christine Balagué Vice president Digital National Council www.cnnumerique.fr What is the digital national council? Taking

More information

We, the millennials The statistical significance of political significance

We, the millennials The statistical significance of political significance IN DETAIL We, the millennials The statistical significance of political significance Kevin Lin, winner of the 2017 Statistical Excellence Award for Early-Career Writing, explores political engagement via

More information

11th Annual Patent Law Institute

11th Annual Patent Law Institute INTELLECTUAL PROPERTY Course Handbook Series Number G-1316 11th Annual Patent Law Institute Co-Chairs Scott M. Alter Douglas R. Nemec John M. White To order this book, call (800) 260-4PLI or fax us at

More information

Chapter 8: Mass Media and Public Opinion Section 1 Objectives Key Terms public affairs: public opinion: mass media: peer group: opinion leader:

Chapter 8: Mass Media and Public Opinion Section 1 Objectives Key Terms public affairs: public opinion: mass media: peer group: opinion leader: Chapter 8: Mass Media and Public Opinion Section 1 Objectives Examine the term public opinion and understand why it is so difficult to define. Analyze how family and education help shape public opinion.

More information

Reddit Best Practices

Reddit Best Practices Reddit Best Practices BEST PRACTICES Reddit Profiles People use Reddit to share and discover information, so Reddit users want to learn about new things that are relevant to their interests, profiles included.

More information

The Cook Political Report / LSU Manship School Midterm Election Poll

The Cook Political Report / LSU Manship School Midterm Election Poll The Cook Political Report / LSU Manship School Midterm Election Poll The Cook Political Report-LSU Manship School poll, a national survey with an oversample of voters in the most competitive U.S. House

More information

Using Social Media to Build Your Brand. Susan Getgood

Using Social Media to Build Your Brand. Susan Getgood Using Social Media to Build Your Brand Susan Getgood 1 Myth: Social Media is for Kids 2 The Facts 3 The Facts Social Media has Grown Sharply Year Over Year +% Percentage of Growth (From March 2009 to March

More information

What is Public Opinion?

What is Public Opinion? What is Public Opinion? Citizens opinions about politics and government actions Why does public opinion matter? Explains the behavior of citizens and public officials Motivates both citizens and public

More information

Voting and Elections

Voting and Elections Voting and Elections General Elections Voters have a chance to vote in two kinds of elections: primary and general In a Primary election, voters nominate candidates from their political party In a General

More information

101 Ways Your Intern Can Triple Your Website Traffic & Performance This Year

101 Ways Your Intern Can Triple Your Website Traffic & Performance This Year 101 Ways Your Intern Can Triple Your Website Traffic & Performance This Year For 99% of entrepreneurs and business owners, we have identified what we believe are the top 101 highest leverage, most profitable

More information

The Ten Nation Impressions of America Poll

The Ten Nation Impressions of America Poll The Ten Nation Impressions of America Poll Submitted by: Zogby International 17 Genesee Street Utica, NY 132 (315)624-00 or 1-877-GO-2-POLL (315)624-0210 Fax http://www.zogby.com John Zogby, President

More information

2019 Missouri Press Foundation Better Newspaper Contest General Rules & Categories

2019 Missouri Press Foundation Better Newspaper Contest General Rules & Categories 2019 Missouri Press Foundation Better Newspaper Contest General Rules & Categories The 2019 Missouri Press Contest will be conducted online with procedures similar to the 2018 contest. The process is easy

More information

Americans and the News Media: What they do and don t understand about each other. Journalist Survey

Americans and the News Media: What they do and don t understand about each other. Journalist Survey Americans and the News Media: What they do and don t understand about each Journalist Survey Conducted by the Media Insight Project An initiative of the American Press Institute and The Associated Press-NORC

More information

Americans and the News Media: What they do and don t understand about each other. General Population Survey

Americans and the News Media: What they do and don t understand about each other. General Population Survey Americans and the News Media: What they do and don t understand about each General Population Survey Conducted by the Media Insight Project An initiative of the American Press Institute and The Associated

More information

6. Voting for the Program will be available for five (5) weeks from Monday 13 June 2016.

6. Voting for the Program will be available for five (5) weeks from Monday 13 June 2016. The Voice IVR Voting Terms and Conditions About the Voting Service 1. These Terms govern the Voice Voting Service. Lodging a Vote for and Artist competing in The Voice Australia 2016 is deemed acceptance

More information

Quantifying and comparing web news portals article salience using the VoxPopuli tool

Quantifying and comparing web news portals article salience using the VoxPopuli tool First International Conference on Advanced Research Methods and Analytics, CARMA2016 Universitat Politècnica de València, València, 2016 DOI: http://dx.doi.org/10.4995/carma2016.2016.3137 Quantifying and

More information

5 Key Facts. About Online Discussion of Immigration in the New Trump Era

5 Key Facts. About Online Discussion of Immigration in the New Trump Era 5 Key Facts About Online Discussion of Immigration in the New Trump Era Introduction As we enter the half way point of Donald s Trump s first year as president, the ripple effects of the new Administration

More information

Online Appendix: Political Homophily in a Large-Scale Online Communication Network

Online Appendix: Political Homophily in a Large-Scale Online Communication Network Online Appendix: Political Homophily in a Large-Scale Online Communication Network Further Validation with Author Flair In the main text we describe the use of author flair to validate the ideological

More information

BRAND GUIDELINES. Version

BRAND GUIDELINES. Version BRAND GUIDELINES INTRODUCTION Using this guide These guidelines explain how to use Reddit assets in a way that stays true to our brand. In most cases, you ll need to get our permission first. See Getting

More information

ECONOMIC GROWTH* Chapt er. Key Concepts

ECONOMIC GROWTH* Chapt er. Key Concepts Chapt er 6 ECONOMIC GROWTH* Key Concepts The Basics of Economic Growth Economic growth is the expansion of production possibilities. The growth rate is the annual percentage change of a variable. The growth

More information

CSC304 Lecture 16. Voting 3: Axiomatic, Statistical, and Utilitarian Approaches to Voting. CSC304 - Nisarg Shah 1

CSC304 Lecture 16. Voting 3: Axiomatic, Statistical, and Utilitarian Approaches to Voting. CSC304 - Nisarg Shah 1 CSC304 Lecture 16 Voting 3: Axiomatic, Statistical, and Utilitarian Approaches to Voting CSC304 - Nisarg Shah 1 Announcements Assignment 2 was due today at 3pm If you have grace credits left (check MarkUs),

More information

CHAPTER 9: THE POLITICAL PROCESS. Section 1: Public Opinion Section 2: Interest Groups Section 3: Political Parties Section 4: The Electoral Process

CHAPTER 9: THE POLITICAL PROCESS. Section 1: Public Opinion Section 2: Interest Groups Section 3: Political Parties Section 4: The Electoral Process CHAPTER 9: THE POLITICAL PROCESS 1 Section 1: Public Opinion Section 2: Interest Groups Section 3: Political Parties Section 4: The Electoral Process SECTION 1: PUBLIC OPINION What is Public Opinion? The

More information

Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content

Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content Was This Review Helpful to You? It Depends! Context and Voting Patterns in Online Content Ruben Sipos Dept. of Computer Science Cornell University Ithaca, NY rs@cs.cornell.edu Arpita Ghosh Dept. of Information

More information

PERCEPTIONS OF CORRUPTION OVER TIME

PERCEPTIONS OF CORRUPTION OVER TIME Duško Sekulić PERCEPTIONS OF CORRUPTION OVER TIME General perception of corruption The first question we want to ask is how Croatian citizens perceive corruption in the civil service. Perception of corruption

More information

Subreddit Recommendations within Reddit Communities

Subreddit Recommendations within Reddit Communities Subreddit Recommendations within Reddit Communities Vishnu Sundaresan, Irving Hsu, Daryl Chang Stanford University, Department of Computer Science ABSTRACT: We describe the creation of a recommendation

More information

ANNUAL SURVEY REPORT: BELARUS

ANNUAL SURVEY REPORT: BELARUS ANNUAL SURVEY REPORT: BELARUS 2 nd Wave (Spring 2017) OPEN Neighbourhood Communicating for a stronger partnership: connecting with citizens across the Eastern Neighbourhood June 2017 1/44 TABLE OF CONTENTS

More information

Facebook Guide for State Legislators

Facebook Guide for State Legislators Facebook Guide for State Legislators Facebook helps elected officials, governments, campaigns, and candidates reach and engage the people who matter most to them. Getting Started 2 Setting up your Facebook

More information

The voting behaviour in the local Romanian elections of June 2016

The voting behaviour in the local Romanian elections of June 2016 Bulletin of the Transilvania University of Braşov Series V: Economic Sciences Vol. 9 (58) No. 2-2016 The voting behaviour in the local Romanian elections of June 2016 Elena-Adriana BIEA 1, Gabriel BRĂTUCU

More information

Feedback loops of attention in peer production

Feedback loops of attention in peer production Feedback loops of attention in peer production arxiv:0905.1740v1 [cs.cy] 12 May 2009 Fang Wu, Dennis M. Wilkinson, and Bernardo A. Huberman HP Labs, Palo Alto, California 94304 June 18, 2018 Abstract A

More information

Ohio State University

Ohio State University Fake News Did Have a Significant Impact on the Vote in the 2016 Election: Original Full-Length Version with Methodological Appendix By Richard Gunther, Paul A. Beck, and Erik C. Nisbet Ohio State University

More information

Data manipulation in the Mexican Election? by Jorge A. López, Ph.D.

Data manipulation in the Mexican Election? by Jorge A. López, Ph.D. Data manipulation in the Mexican Election? by Jorge A. López, Ph.D. Many of us took advantage of the latest technology and followed last Sunday s elections in Mexico through a novel method: web postings

More information

Today s Training Video Is All About Traffic and Leads

Today s Training Video Is All About Traffic and Leads Today s Training Video Is All About Traffic and Leads I m Going To Show You How To Get Traffic And Leads For Your Business By Sharing With You My Proven Strategies That You Can Put To Use Today And See

More information

Survey Report Victoria Advocate Journalism Credibility Survey The Victoria Advocate Associated Press Managing Editors

Survey Report Victoria Advocate Journalism Credibility Survey The Victoria Advocate Associated Press Managing Editors Introduction Survey Report 2009 Victoria Advocate Journalism Credibility Survey The Victoria Advocate Associated Press Managing Editors The Donald W. Reynolds Journalism Institute Center for Advanced Social

More information

Mistake #1: Entering the Reddit world just because it has over 234 Million Users. -- It is similar with trying to dig through the desert with the hope that you will get a lot of diamonds out of your effort.

More information

BY Aaron Smith FOR RELEASE JUNE 28, 2018 FOR MEDIA OR OTHER INQUIRIES:

BY Aaron Smith FOR RELEASE JUNE 28, 2018 FOR MEDIA OR OTHER INQUIRIES: FOR RELEASE JUNE 28, 2018 BY Aaron Smith FOR MEDIA OR OTHER INQUIRIES: Aaron Smith, Associate Director, Research Lee Rainie, Director, Internet and Technology Research Dana Page, Associate Director, Communications

More information

THE GOP DEBATES BEGIN (and other late summer 2015 findings on the presidential election conversation) September 29, 2015

THE GOP DEBATES BEGIN (and other late summer 2015 findings on the presidential election conversation) September 29, 2015 THE GOP DEBATES BEGIN (and other late summer 2015 findings on the presidential election conversation) September 29, 2015 INTRODUCTION A PEORIA Project Report Associate Professors Michael Cornfield and

More information

Public Choice. Slide 1

Public Choice. Slide 1 Public Choice We investigate how people can come up with a group decision mechanism. Several aspects of our economy can not be handled by the competitive market. Whenever there is market failure, there

More information

A secure environment for trading

A secure environment for trading A secure environment for trading https://serenity-financial.io/ Bounty Program The arbitration platform will address the problem of transparent and secure trading on financial markets for millions of traders

More information

B. Executive Summary. Page 2 of 7

B. Executive Summary. Page 2 of 7 Category: Open Government Initiatives Project: NYS Open Government Initiative Submitted By: New York State Chief Information Officer/Office for Technology and New York State Senate Chief Information Officer

More information

Pioneers in Mining Electronic News for Research

Pioneers in Mining Electronic News for Research Pioneers in Mining Electronic News for Research Kalev Leetaru University of Illinois http://www.kalevleetaru.com/ Our Digital World 1/3 global population online As many cell phones as people on earth

More information

We will begin momentarily at 2pm ET. Slides available now! Recordings will be available to ACS members after one week.

We will begin momentarily at 2pm ET. Slides available now! Recordings will be available to ACS members after one week. We will begin momentarily at 2pm ET Slides available now! Recordings will be available to ACS members after one week. www.acs.org/acswebinars Contact ACS Webinars at acswebinars@acs.org 1 Have Questions?

More information

AMERICAN VIEWS: TRUST, MEDIA AND DEMOCRACY A GALLUP/KNIGHT FOUNDATION SURVEY

AMERICAN VIEWS: TRUST, MEDIA AND DEMOCRACY A GALLUP/KNIGHT FOUNDATION SURVEY AMERICAN VIEWS: TRUST, MEDIA AND DEMOCRACY A GALLUP/KNIGHT FOUNDATION SURVEY COPYRIGHT STANDARDS This document contains proprietary research, copyrighted and trademarked materials of Gallup, Inc. Accordingly,

More information

THE INDEPENDENT AND NON PARTISAN STATEWIDE SURVEY OF PUBLIC OPINION ESTABLISHED IN 1947 BY MERVIN D. FiElD.

THE INDEPENDENT AND NON PARTISAN STATEWIDE SURVEY OF PUBLIC OPINION ESTABLISHED IN 1947 BY MERVIN D. FiElD. THE INDEPENDENT AND NON PARTISAN STATEWIDE SURVEY OF PUBLIC OPINION ESTABLISHED IN 1947 BY MERVIN D. FiElD. 234 Front Street San Francisco 94111 (415) 3925763 COPYRIGHT 1982 BY THE FIELD INSTITUTE. FOR

More information

MATH4999 Capstone Projects in Mathematics and Economics Topic 3 Voting methods and social choice theory

MATH4999 Capstone Projects in Mathematics and Economics Topic 3 Voting methods and social choice theory MATH4999 Capstone Projects in Mathematics and Economics Topic 3 Voting methods and social choice theory 3.1 Social choice procedures Plurality voting Borda count Elimination procedures Sequential pairwise

More information

Demographics of News Sharing in the U.S. Twittersphere

Demographics of News Sharing in the U.S. Twittersphere Demographics of News Sharing in the U.S. Twittersphere Julio C. S. Reis Universidade Federal de Minas Gerais Belo Horizonte, Brazil julio.reis@dcc.ufmg.br Haewoon Kwak Qatar Computing Research Institute

More information

Journals in the Discipline: A Report on a New Survey of American Political Scientists

Journals in the Discipline: A Report on a New Survey of American Political Scientists THE PROFESSION Journals in the Discipline: A Report on a New Survey of American Political Scientists James C. Garand, Louisiana State University Micheal W. Giles, Emory University long with books, scholarly

More information

advertising options chromatographyonline.com

advertising options chromatographyonline.com chromatographyonline.com ChromatographyOnline.com, LCGC s global website, includes special features such as easy navigation with category zones, country-specific content and articles by industry. Fresh

More information

A Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs

A Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs A Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs KRISTINA LERMAN, USC Information Sciences Institute RUMI GHOSH, University of Southern California TAWAN

More information

State of the Facts 2018

State of the Facts 2018 State of the Facts 2018 Part 2 of 2 Summary of Results September 2018 Objective and Methodology USAFacts conducted the second annual State of the Facts survey in 2018 to revisit questions asked in 2017

More information