Qualitative Text Analysis

Similar documents
Testing Prospect Theory in policy debates in the European Union

Using Quantitative Methods to Study Parliament

DU PhD in Home Science

Text Mining the Election Manifestos

national congresses and show the results from a number of alternate model specifications for

Doctoral Research Agenda

Mining Expert Comments on the Application of ILO Conventions on Freedom of Association and Collective Bargaining

DETERMINANTS OF IMMIGRANTS EARNINGS IN THE ITALIAN LABOUR MARKET: THE ROLE OF HUMAN CAPITAL AND COUNTRY OF ORIGIN

Social Science Survey Data Sets in the Public Domain: Access, Quality, and Importance. David Howell The Philippines September 2014

Sample. The Political Role of Freedom and Equality as Human Values. Marc Stewart Wilson & Christopher G. Sibley 1

DATA ANALYSIS USING SETUPS AND SPSS: AMERICAN VOTING BEHAVIOR IN PRESIDENTIAL ELECTIONS

EXTRACTING POLICY POSITIONS FROM POLITICAL TEXTS USING WORDS AS DATA. Michael Laver, Kenneth Benoit, and John Garry * Trinity College Dublin

Probabilistic Latent Semantic Analysis Hofmann (1999)

Rockefeller College, University at Albany, SUNY Department of Political Science Graduate Course Descriptions Fall 2016

MA International Relations Module Catalogue (September 2017)

Viktória Babicová 1. mail:

Hellenic Observatory / National Bank of Greece Research Tender 2-NBG2-2014: The Crisis and Political Extremism.

The UK Policy Agendas Project Media Dataset Research Note: The Times (London)

Polimetrics. Lecture 2 The Comparative Manifesto Project

Introduction to Text Modeling

Research Note: Toward an Integrated Model of Concept Formation

LL.M. (Previous) DEGREE EXAMINATION, DEC First Year COMMON TO ALL BRANCHES. Paper - I : Research Methodology

ANNUAL SURVEY REPORT: AZERBAIJAN

Learning and Visualizing Political Issues from Voting Records Erik Goldman, Evan Cox, Mikhail Kerzhner. Abstract

How to improve social surveys to provide better statistics on migrants

ANNUAL SURVEY REPORT: REGIONAL OVERVIEW

Introduction to 300 and Table 5. Version 1.3 August 2017

Towards the next Dutch general election: the issue opportunity structure for parties

Ipsos MORI June 2016 Political Monitor

We present a new way of extracting policy positions from political texts that treats texts not

Reviewed by Alice PREDA (BODOC) 1

IN COLLABORATION WITH IVTB. Diploma in Information Technology. Special Resit Examinations for / Semester 1

Rights, Values and Welfare in Parliamentary Debates on Abortion. Albert Weale

EXTRACTING POLICY POSITIONS FROM POLITICAL TEXTS USING WORDS AS DATA * January 21, 2003

NOTE from : Governing Board of the European Police College Article 36 Committee/COREPER/Council Subject : CEPOL annual work programme for 2002

To Trump (v): /tɹʌmp/ "to fabricate, devise"

COMPUTATIONAL CREATIVITY EVALUATION

Syllabus (Revised) (w.e.f. June-2009) LL.M.

Introduction to the Virtual Issue: Recent Innovations in Text Analysis for Social Science

I wish you every success with your campaign. Nicola Sturgeon SNP Leader

The Discursive Institutionalism of Continuity and Change: The Case of Patient Safety in Wales ( ).

THE DUBAI INTERNATIONAL FINANCIAL CENTER (DIFC) COURTS AND ITS APPLICATION IN THE CONSTRUCTION INDUSTRY AND DISPUTE RESOLUTION OF UAE?

Analyzing and Representing Two-Mode Network Data Week 8: Reading Notes

Measuring the Political Sophistication of Voters in the Netherlands and the United States

Researching the politics of gender: A new conceptual and methodological approach

Researching hard-to-reach and vulnerable groups

Programme Specification

Matthias Zurker. The example of the South West of England region. Materialien zur Regionalentwicklung und Raumordnung. Band 10

POLICYBRIEF EUROPEAN. - EUROPEANPOLICYBRIEF - P a g e 1 INTRODUCTION EVIDENCE AND ANALYSIS

I wish you every success with your campaign. Nicola Sturgeon SNP Leader

Improving the quality and availability of migration statistics in Europe *

Users reading habits in online news portals

Project summary Intellectual Merit: The proposed project is a large-scale study of framing by interest groups involved in consultations with the

National Survey Report. May, 2018

Rural Wiltshire An overview

Measuring the Political Sophistication of Voters in the Netherlands and the United States

Centre sampling technique in foreign migration surveys: Methodology, application and operational aspects

Core competencies in LIS education: professional, generic and personal competencies for the higher education LIS sector

A Study of the Concession Speech by President Goodluck Jonathan. Adaobi Ngozi Okoye & Benjamin Ifeanyi Mmadike

Lab 3: Logistic regression models

ACCULTURATION DIFFERENCES IN FAMILY UNITS FROM FORMER YUGOSLAVIA. Written by Ivana Pelemis (BA Hons in Psychology, Murdoch University)

Polimetrics. Mass & Expert Surveys

europolis vol. 5, no. 2/2011

Feasibility research on the potential use of Migrant Workers Scan data to improve migration and population statistics

Ipsos MORI March 2017 Political Monitor

Clarification of apolitical codes in the party identification summary variable on ANES datasets

International migration data as input for population projections

Elections for everyone. Experiences of people with disabilities at the 8 June 2017 UK Parliamentary general election

Monetary Policy Oversight in Comparative Perspective: Britain and America During the Financial Crisis

Defining migratory status in the context of the 2030 Agenda

Measuring Sustainable Tourism Project concept note

Q1) What is Socio-legal research? Explain the doctrinal and non-doctrinal research? Q2) Write a critical note on identification of a research problem?

Guidelines for Performance Auditing

Employment Outlook 2017

Improving Record-Linkage-Software for Survey-Data

Subjectivity Classification

Vote Compass Methodology

EUROBAROMETER 62 PUBLIC OPINION IN THE EUROPEAN UNION

California Subject Examinations for Teachers

Out of Africa: Sudanese refugees and the construction of difference in political and lay talk

AP United States Government and Politics Syllabus

2) What are the merits and demerits case study method of research?

ANNUAL SURVEY REPORT: BELARUS

Defending Wales 3. Defending the things that are important to Wales. Protect the Welsh Assembly 6. Protecting Welsh jobs 7

Part I Introduction. [11:00 7/12/ pierce-ch01.tex] Job No: 5052 Pierce: Research Methods in Politics Page: 1 1 8

CSI Brexit 4: People s Stated Reasons for Voting Leave or Remain

Beyond intuitions, algorithms, and dictionaries: Historical semantics and legal interpretation

MODERN STUDIES Access 3 Level

12 th Grade U.S. Government Curriculum Map FL Literacy Standards (See final pages)

2017 Politics. Higher. Finalised Marking Instructions

Greenberg Quinlan Rosner/Democracy Corps Youth for the Win! Audacity of Hope

CHAPTER 5. CONTROL. Comparability: The Limits of Comparison

Bass Connections Project Proposal Template for Projects

Overview of standards for data disaggregation

Mobility of health professionals between the Philippines and selected EU member states: A Policy Dialogue

1. Introduction. 1.1 Topics and research questions to be explored. The main topics we want to explore in this paper are:

Examiners Report June 2010

RESEARCH METHODOLOGY IN POLITICAL SCIENCE STUDY NOTES CHAPTER ONE

External Vacancy Notice in the European Asylum Support Office (EASO) REF.: EASO/2019/TA/001

Good Practices Research

Transcription:

LSE Department of Methodology, MY428/528 - LT 2014 Qualitative Text Analysis Course Convenor: Dr. Aude Bicquelet (a.j.bicquelet@lse.ac.uk) Office Hours: Thursday 11:30-13:30

EXPLORATORY CONTENT ANALYSIS Week 8

Lecture Outline 1. Definitions & Ambitions ------------------------------------------------------------------ 2. Origins & Epistemological Foundations ------------------------------------------------------------------ 3. Applications of Exploratory CA Can Text-Mining help handle the data deluge in Public Policy Analysis? (Bicquelet and Weale, 2011) In a different Parliamentary voice? (Bicquelet et al. 2012) Right-Wing Nationalism in Political Manifestos ------------------------------------------------------------------ 4. The Alceste Software

DEFINITIONS & AMBITIONS

Exploratory Content Analysis Definition and Ambitions Definition Exploratory Content Analysis/Text Mining: Process of extracting information in large corpora with the aim of identifying patterns and relationship in textual data Ambitions To identify major patterns of argumentation within large corpora To reduce complex data through categorization and visualization techniques

Exploratory Content Analysis Ambitions When/Why using it? To match passive variables with classes of words in large corpora. To help the elaboration of codebooks for inductive approaches (Summative Content Analysis). To generate hypothesis to be tested by hypothetico-deductive analyses (Classical Content Analysis). To triangulate results generated by inductive approaches.

Exploratory Content Analysis Ambitions & Applications Applications Parliamentary debates (Schonhardt-Bailey 2008; Weale et al. 2012; Bara et al. 2007) Online public consultations (Bicquelet and Weale 2011) Political Manifestos (Bicquelet, 2007) Open-ended Questionnaires (Brugidou 2003; Lahlou 1996) Social Media Analysis/Sentiment Analysis/Marketing Applications (etc )

Exploratory Content Analysis Example - Parliamentary debates about the EU.

Exploratory Content Analysis Advantages & Critiques Advantages Generate classification free from preconceptions Produce fast results High reliability/ Replicability Critiques Analysis of the classes is an interpretative process (not free from preconceptions) Danger to overlook valuable information (impossible to evaluate weight or strength) Weak validity

ORIGINS & EPISTEMOLOGICAL FOUNDATIONS

Exploratory Content Analysis Origins and Epistemological Foundations Cognitive Anthropology (Cultural Domain Analysis) + Structural Linguistics + Statistics (Descriptive and Exploratory Data Analysis)

Exploratory Content Analysis Origins and Epistemological Foundations I. Cognitive Anthropology (Spradley, 1972) Cultural Domain Analysis (Borgatti, 1999) Organised set of words, concepts and sentences tell us a lot about how people think. Cluster of words are conceptual domains referring to a set of objects reflecting the everyday taxonomy of a native people. Free lists and pile sorts help to identify items in a cultural domain.

Exploratory Content Analysis Origins and Epistemological Foundations What is it like to live in London? (Free Lists 10 words) Resp. 1: grey; cold; lonely; food; expensive; travel; tube; work; walk; park. Resp. 2: Fun; parties; Art; movie; restaurant; rainy; shops; cheap; flatmates; exciting. Resp. 3: Tea; pub; galleries; sunny; cool; pricy; cycling; yoga; colleagues; kids.

Exploratory Content Analysis Origins and Epistemological Foundations What is it like to live in London? (Pile sorts/ judged similarities) Weather Feelings Relations Money Activities Grey Cold Rainy Sunny Lonely Fun Exciting Cool Family Friends Flatmate Kids Cheap Expensive Pricy Travel Work Walk Job Movie Gallery ( )

Exploratory Content Analysis Origins and Epistemological Foundations II. Structural Linguistics Semiotics (C.S. Peirce, 1877) We make sense of reality through word associations and co-occurrences. The meaning of a word is best understood by the set of words that co-occur with it. Two words that have similar co-occurrence patterns are semantically related ( Semantic Network Analysis)

Exploratory Content Analysis Origins and Epistemological Foundations Knife Restaurant Italian Waiter Horrified Pasta Boyfriend Smiled Knife Trial Italy Judge Horrified murder Boyfriend charged

Exploratory Content Analysis Origins and Epistemological Foundations Valentine Dinner Amanda Knox s trial Knife Restaurant Italian Waiter Horrified Pasta Boyfriend Smiled Knife Trial Italy Judge Horrified murder Boyfriend charged

Exploratory Content Analysis Methods of Analysis III. Descriptive Statistics Exploratory data Analysis (J. Tukey, 1977) Word frequencies Ranking (how early an items gets mentioned) KWIC Multidimensional Scaling Cluster Analysis Correspondence Analysis

Exploratory Content Analysis Methods of Analysis: Multidimensional Scaling

Exploratory Content Analysis Methods of Analysis: Multidimensional Scaling MDS maps the relations among items in a matrix. The algorithms work out the best spatial representation of a set of items that are represented by a set of similarities. Similarity Matrix Grey Lone. Kids Sun. Cheap Friend Price Fun Cold Cool Rain. Grey 1 0 0 1 0 0 0 0 1 0 1 Lone. 0 1 0 0 0 0 0 1 0 1 0 Kids 0 0 1 0 0 1 0 0 0 0 0 Sun 1 0 0 1 0 0 0 0 1 0 1 Cheap 0 0 0 0 1 0 1 0 0 0 0 Friends 0 0 1 0 0 1 0 0 0 0 0 Price 0 0 0 0 1 0 1 0 0 0 0 Fun 0 1 0 0 0 0 0 1 0 1 0 Cold 1 0 0 1 0 0 0 0 1 0 1 Cool 0 1 0 0 0 0 0 1 0 1 0 Rain 1 0 0 1 0 0 0 0 1 0 1

Exploratory Content Analysis Methods of Analysis: Cluster Analysis Cluster Analysis is another visualization method. Like MDS it operates on similarity matrices. In cluster analysis the aim is to divide a set of items into subgroups (clusters). Members of each subgroups are more like each other than they are like members of other subgroups.

Exploratory Content Analysis Methods of Analysis: Cluster Analysis

Exploratory Content Analysis Software Alceste Website: http://www.image-zafar.com/en/alceste-software Iramuteq (r) Website: http://www.r-project.org/ QDA Miner/Wordstat Website: http://provalisresearch.com/products/qualitative-data-analysissoftware/ T-Lab Website: http://www.tlab.it/en/presentation.php

Exploratory Content Analysis A mixed-method Approach A mixed-method Approach Generating statistical outputs is only the first step towards an integrated analysis of the results. It is always necessary to check word frequencies, clusters, correspondence and multidimensional analyses against the raw data. No software can replace the interpretative process of a corpus Exploratory Content Analysis is thus a Mixed-Method approach to data analysis.

APPLICATIONS OF EXPLORATORY CONTENT ANALYSIS

BICQUELET AND WEALE S ANALYSIS OF PUBLIC CONSULTATIONS (2011)

Bicquelet and Weale (2011) Research question: What are the most commonly expressed arguments for/against funding end of life medicines by different cohorts (patients, carers, NHS professional?) Data: Public Consultation on end of life medicines run by NICE in 2008. Software: Alceste Units of Analysis: Digitised answers to public consultation Sampling strategy: Purposive

Bicquelet and Weale (2011)

Bicquelet and Weale (2011)

Bicquelet and Weale (2011)

Bicquelet and Weale (2011) Why Alceste? Suitable for very large corpora Little coding (only requires the definition of unit of analyses and variables) Works purely on the occurrence and co-occurrence of typical word pairs in a corpus. Automatically produces classes made up of key terms. Key terms and classes are matched with the variables

Bicquelet and Weale (2011)

Bicquelet and Weale (2011)

BICQUELET et al. s ANALYSIS OF GENDER DIFFERENCES IN PARLIAMENT (2012)

Bicquelet et al. (2012) Research Question: Do men and women express similar (or different) types of arguments in parliamentary debates? Data: 6 Second reading debates from 1966 to 1988 in the UK House of Commons Software: Alceste Units of Analysis: Speech-acts Sampling Strategy: Purposive

Bicquelet et al. (2012) Date House Type of debate Initiator Party Government 22 nd July 1966 Commons Second Steel Liberal Labour (Wilson) 13 th Feb. 1970 Commons Second Conservative Labour (Wilson) 7 th Feb. 1975 Commons Second White Labour Labour (Wilson) 25 th Feb. 1977 Commons Second Benyon Conservative Labour (Callaghan) 13 th July 1979 Commons Second Corrie Conservative Conservative (Thatcher) 22 nd Jan. 1988 Commons Second Liberal Democrat Conservative (Thatcher)

Bicquelet et al. (2012) Coding For each debate: name of the speaker year gender (Unit of anlysis = speech act / utterence) Corpus Construction Integrated debate (compile all six debates within a single corpus) Analysis of six debates individually Type of Analysis: - Standard (Integrated debate) - Cross-data analysis - Tri-croisé on the variable gender (on each single debate)

Class of Sentences Main Themes in Class 1. Moral concerns Sanctity and value of life. Moral status of child. Effects of disability Unwanted pregnancy, strain on families. 2. Operation of medical facilities Permits and licensing Operation of medical facilities 3. Effects of 1967 legislation Estimates of number of abortions, including illegal Legal status of abortion Some polling evidence 4. Rhetoric of debate and procedure Congratulations or criticism of other speakers Character of procedure 5. Role of committees and reports Reference to committees of enquiry Reference to parliamentary committees 6. Reflections on debate Character of debate Role of parliament

Bicquelet et al. (2012)

Bicquelet et al. (2012) Results Women have used a larger array of rhetorical strategies than men Political alignment did not seem to influence the way arguments were being framed Women and Men deploy similar rhetorical strategies but the key difference is terms of their frequency If Women and Men speak in a different voice, this is only from time to time

Right-Wing Nationalism in Political Manifestos

Right-Wing Nationalism in Political Manifestos Research Question: is there a rhetoric of right-wing nationalism that is distinct from the language deployed by self-declared mainstream parties? Data: Manifestos of right-wing nationalist parties from three different countries (France, Italy and the UK). Software: Alceste Units of Analysis: Manifestos. Sampling Strategy: Purposive

Right-Wing Nationalism in Political Manifestos

Right-Wing Nationalism in Political Manifestos

Right-Wing Nationalism in Political Manifestos

Right-Wing Nationalism in Political Manifestos Results Right-wing political manifestos commonly use four types of argumentative patterns: The Rhetoric of Decadence The Rhetoric of Defence The Rhetoric of Tradition The Rhetoric of Difference While immigration was expected to be a dominant theme, it was in fact subordinated within concerns about home countries in relation to the European Union.

Useful Resources Bicquelet, A., Weale, A & Bara,J. (2012) In a different Parliamentary Voice? An analysis of gender differences in UK parliamentary debates about abortion Politics & Gender (Vol.8,N.1) Schonhardt-Bailey, C. (2008), The congressional debate on partial-birth abortion: Constitutional gravitas and moral passion. British Journal of Political Science (38:383 410). Bicquelet, A. & Weale, A. (2011), Coping with the cornucopia: Can Text Mining Help Handle the Data Deluge in Public Policy Analysis? Policy & Internet (Vol.3, N.4) Brugidou, M. (2003) Argumentation and Values: An Analysis of Ordinary Political Competence via an Open-Ended Question International Journal of Public Opinion Research, 15:4, pp. 413-430. Guérin-Pace, F (1998) Textual Statistics. An Exploratory Tool for the Social Sciences, New Methodological Approaches in the Social Sciences, 10:1, pp. 73-95. Lahlou, S. (1996) A Method to Extract Social Representations from Linguistic Corpora. Japanese Journal of Experimental Social Psychology. 36:1, pp. 278 291. Schonhardt-Bailey, C. (2005) Measuring Ideas More Effectively: An Analysis of Bush and Kerry's National Security Speeches, PS: Political Science and Politics, 38:3, pp. 701-711.