Chapter. Estimating the Value of a Parameter Using Confidence Intervals Pearson Prentice Hall. All rights reserved

Similar documents
Chapter. Sampling Distributions Pearson Prentice Hall. All rights reserved

8 5 Sampling Distributions

A survey of 200 adults in the U.S. found that 76% regularly wear seatbelts while driving. True or false: 76% is a parameter.

Supplementary Materials A: Figures for All 7 Surveys Figure S1-A: Distribution of Predicted Probabilities of Voting in Primary Elections

Practice Questions for Exam #2

Hoboken Public Schools. AP Statistics Curriculum

Hoboken Public Schools. Algebra II Honors Curriculum

Approaches to Analysing Politics Variables & graphs

SIMPLE LINEAR REGRESSION OF CPS DATA

The choice E. NOTA denotes None of These Answers. Give exact answers unless otherwise specified. Good luck, and have fun!

Vermonters Awareness of and Attitudes Toward Sprawl Development in 2002

Executive Summary. 1 Page

Essential Questions Content Skills Assessments Standards/PIs. Identify prime and composite numbers, GCF, and prime factorization.

Section Apportionment Methods. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

CH 19. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

STATISTICAL GRAPHICS FOR VISUALIZING DATA

A procedure to compute a probabilistic bound for the maximum tardiness using stochastic simulation

EDEXCEL FUNCTIONAL SKILLS PILOT. Maths Level 2. Test your skills. Chapters 6 and 7. Investigating election statistics

Becoming an Effective Consumer of Descriptive Evidence

IPSOS POLL DATA Prepared by Ipsos Public Affairs

Welfarism and the assessment of social decision rules

COMMUNITY RESILIENCE STUDY

Supplementary Materials for Strategic Abstention in Proportional Representation Systems (Evidence from Multiple Countries)

Who Votes Without Identification? Using Affidavits from Michigan to Learn About the Potential Impact of Strict Photo Voter Identification Laws

Warm Up: Identify the population and the sample: 1. A survey of 1353 American households found that 18% of the households own a computer.

ISERP Working Paper 06-10

Appendix to Sectoral Economies

THE INDEPENDENT AND NON PARTISAN STATEWIDE SURVEY OF PUBLIC OPINION ESTABLISHED IN 1947 BY MERVIN D. FiElD.

Supporting Information Political Quid Pro Quo Agreements: An Experimental Study

Estimating the Margin of Victory for Instant-Runoff Voting

Universality of election statistics and a way to use it to detect election fraud.

Voter and non-voter survey report

Expert Mining and Required Disclosure: Appendices

Exploring Data. Kwonsang Lee. January 30, University of Pennsylvania Kwonsang Lee STAT111 January 30, / 12

Wisconsin Economic Scorecard

October 29, 2010 I. Survey Methodology Selection of Households

GENERAL ELECTION PREVIEW:

Was the Late 19th Century a Golden Age of Racial Integration?

AMERICAN JOURNAL OF UNDERGRADUATE RESEARCH VOL. 3 NO. 4 (2005)

THE LOUISIANA SURVEY 2018

Socially Optimal Districting: An Empirical Investigation

9.2. Section. Chapter. Objectives. Estimating the Value of a Parameter Using. The Logic in Constructing. for a Population Mean. Confidence Intervals

Approval Voting Theory with Multiple Levels of Approval

Pork Barrel as a Signaling Tool: The Case of US Environmental Policy

Survey on the Death Penalty

NBER WORKING PAPER SERIES THE PERFORMANCE OF THE PIVOTAL-VOTER MODEL IN SMALL-SCALE ELECTIONS: EVIDENCE FROM TEXAS LIQUOR REFERENDA

Methodology. 1 State benchmarks are from the American Community Survey Three Year averages

Data manipulation in the Mexican Election? by Jorge A. López, Ph.D.

Growth and Poverty Reduction: An Empirical Analysis Nanak Kakwani

Ipsos Poll Conducted for Reuters Daily Election Tracking:

Belief in climate change eroding

Congruence in Political Parties

Asylum Seekers Should Enter the Country Legally: Plurality

1. A Republican edge in terms of self-described interest in the election. 2. Lower levels of self-described interest among younger and Latino

Committee for Economic Development: October Business Leader Study. Submitted to:

JudgeIt II: A Program for Evaluating Electoral Systems and Redistricting Plans 1

REPORT TO PROPRIETARY RESULTS FROM THE 48 TH PAN ATLANTIC SMS GROUP. THE BENCHMARK OF MAINE PUBLIC OPINION Issued May, 2011

Do two parties represent the US? Clustering analysis of US public ideology survey

Tests Tell the Difference?

A positive correlation between turnout and plurality does not refute the rational voter model

Appendix for: Authoritarian Public Opinion and the Democratic Peace *

ATTITUDES TOWARDS IMMIGRATION TAKE A HIT FROM 9/11 New Jerseyans Like Their Immigrant Neighbors, But Aren t Sure They Want More

Report for the Associated Press: Illinois and Georgia Election Studies in November 2014

Ipsos Poll Conducted for Reuters Daily Election Tracking:

Created by T. Madas PROPORTION SAMPLES. Created by T. Madas

Half of Ontarians Believe Government to Blame for Rising Hydro Rates

Response to the Report Evaluation of Edison/Mitofsky Election System

Hoboken Public Schools. Algebra I Curriculum

MEREDITH COLLEGE POLL September 18-22, 2016

PREPARING TO DO THE MATH

Telephone Survey. Contents *

Ipsos Poll Conducted for Reuters State-Level Election Tracking:

Women as Policy Makers: Evidence from a Randomized Policy Experiment in India

! = ( tapping time ).

What is fairness? - Justice Anthony Kennedy, Vieth v Jubelirer (2004)

has been falling for almost 40 years, from about 25% in the early 1970s to

Americans Want a Direct Say in Government: Survey Results in All 50 States on Initiative & Referendum

North Carolina and the Federal Budget Crisis

FAVORABLE RATINGS OF LABOR UNIONS FALL SHARPLY

Assessing the Impact of the Sentencing Council s Burglary Definitive Guideline on Sentencing Trends

Game theory and applications: Lecture 12

HIGH POINT UNIVERSITY POLL MEMO RELEASE 9/24/2018 (UPDATE)

EDEXCEL FUNCTIONAL SKILLS PILOT. Maths Level 1. Test your skills. Chapters 6 and 7. Investigating election statistics

Ipsos Poll conducted for Reuters, May 5-9, 2011 NOTE: all results shown are percentages unless otherwise labeled.

Are policy makers out of step with their constituency when it comes to immigration?

November 15-18, 2013 Open Government Survey

What is The Probability Your Vote will Make a Difference?

Appendices for Elections and the Regression-Discontinuity Design: Lessons from Close U.S. House Races,

San Diego 2nd City Council District Race 2018

MEASURING INTER-JUDGE SENTENCING DISPARITY BEFORE AND AFTER THE FEDERAL SENTENCING GUIDELINES. James M. Anderson Defender Association of Philadelphia

In Elections, Irrelevant Alternatives Provide Relevant Data

Execution Moratoriums, Commutations and Deterrence: The Case of Illinois. Dale O. Cloninger, Professor of Finance & Economics*

Supporting Information for Do Perceptions of Ballot Secrecy Influence Turnout? Results from a Field Experiment

Ipsos Poll Conducted for Reuters State-Level Election Tracking-Kentucky:

Human Capital and Income Inequality: New Facts and Some Explanations

RAY C. BLISS INSTITUTE OF APPLIED POLITICS & REGULA CENTER FOR PUBLIC SERVICE. Presentation on Civility Research

Ipsos MORI June 2016 Political Monitor

Simulating Electoral College Results using Ranked Choice Voting if a Strong Third Party Candidate were in the Election Race

Hierarchical Item Response Models for Analyzing Public Opinion

Health Insurance: Can They Or Can t They? Voters Speak Clearly On Question of Mandating Health Insurance

Transcription:

Chapter 9 Estimating the Value of a Parameter Using Confidence Intervals 2010 Pearson Prentice Hall. All rights reserved

Section 9.1 The Logic in Constructing Confidence Intervals for a Population Mean When the Population Standard Deviation Is Known 2010 Pearson Prentice Hall. All rights reserved

Objectives 1. Compute a point estimate of the population mean 2. Construct and interpret a confidence interval for the population mean assuming that the population standard deviation is known 3. Understand the role of margin of error in constructing the confidence interval 4. Determine the sample size necessary for estimating the population mean within a specified margin of error 2010 Pearson Prentice Hall. All rights reserved 9-3

Objective 1 Compute a Point Estimate of the Population Mean 2010 Pearson Prentice Hall. All rights reserved 9-4

A point estimate is the value of a statistic that estimates the value of a parameter. For example, the sample mean,, is a point estimate of the population mean µ. 2010 Pearson Prentice Hall. All rights reserved 9-5

Parallel Example 1: Computing a Point Estimate Pennies minted after 1982 are made from 97.5% zinc and 2.5% copper. The following data represent the weights (in grams) of 17 randomly selected pennies minted after 1982. 2.46 2.47 2.49 2.48 2.50 2.44 2.46 2.45 2.49 2.47 2.45 2.46 2.45 2.46 2.47 2.44 2.45 Treat the data as a simple random sample. Estimate the population mean weight of pennies minted after 1982. 2010 Pearson Prentice Hall. All rights reserved 9-6

Solution The sample mean is The point estimate of µ is 2.464 grams. 2010 Pearson Prentice Hall. All rights reserved 9-7

Objective 2 Construct and Interpret a Confidence Interval for the Population Mean 2010 Pearson Prentice Hall. All rights reserved 9-8

A confidence interval for an unknown parameter consists of an interval of numbers. The level of confidence represents the expected proportion of intervals that will contain the parameter if a large number of different samples is obtained. The level of confidence is denoted (1 α) 100%. 2010 Pearson Prentice Hall. All rights reserved 9-9

For example, a 95% level of confidence (α = 0.05) implies that if 100 different confidence intervals are constructed, each based on a different sample from the same population, we will expect 95 of the intervals to contain the parameter and 5 to not include the parameter. 2010 Pearson Prentice Hall. All rights reserved 9-10

Confidence interval estimates for the population mean are of the form Point estimate ± margin of error. The margin of error of a confidence interval estimate of a parameter is a measure of how accurate the point estimate is. 2010 Pearson Prentice Hall. All rights reserved 9-11

The margin of error depends on three factors: 1. Level of confidence: As the level of confidence increases, the margin of error also increases. 2. Sample size: As the size of the random sample increases, the margin of error decreases. 3. Standard deviation of the population: The more spread there is in the population, the wider our interval will be for a given level of confidence. 2010 Pearson Prentice Hall. All rights reserved 9-12

The shape of the distribution of all possible sample means will be normal, provided the population is normal or approximately normal, if the sample size is large (n 30), with mean and standard deviation. 2010 Pearson Prentice Hall. All rights reserved 9-13

Because is normally distributed, we know 95% of all sample means lie within 1.96 standard deviations of the population mean,, and 2.5% of the sample means lie in each tail. 2010 Pearson Prentice Hall. All rights reserved 9-14

2010 Pearson Prentice Hall. All rights reserved 9-15

95% of all sample means are in the interval With a little algebraic manipulation, we can rewrite this inequality and obtain: 2010 Pearson Prentice Hall. All rights reserved 9-16

. It is common to write the 95% confidence interval as so that it is of the form Point estimate ± margin of error. 2010 Pearson Prentice Hall. All rights reserved 9-17

Parallel Example 2: Using Simulation to Demonstrate the Idea of a Confidence Interval We will use Minitab to simulate obtaining 30 simple random samples of size n = 8 from a population that is normally distributed with µ = 50 and σ = 10. Construct a 95% confidence interval for each sample. How many of the samples result in intervals that contain µ = 50? 2010 Pearson Prentice Hall. All rights reserved 9-18

Sample Mean 95.0% CI C1 47.07 ( 40.14, 54.00) C2 49.33 ( 42.40, 56.26) C3 50.62 ( 43.69, 57.54) C4 47.91 ( 40.98, 54.84) C5 44.31 ( 37.38, 51.24) C6 51.50 ( 44.57, 58.43) C7 52.47 ( 45.54, 59.40) C8 59.62 ( 52.69, 66.54) C9 43.49 ( 36.56, 50.42) C10 55.45 ( 48.52, 62.38) C11 50.08 ( 43.15, 57.01) C12 56.37 ( 49.44, 63.30) C13 49.05 ( 42.12, 55.98) C14 47.34 ( 40.41, 54.27) C15 50.33 ( 43.40, 57.25) 2010 Pearson Prentice Hall. All rights reserved 9-19

SAMPLE MEAN 95% CI C16 44.81 ( 37.88, 51.74) C17 51.05 ( 44.12, 57.98) C18 43.91 ( 36.98, 50.84) C19 46.50 ( 39.57, 53.43) C20 49.79 ( 42.86, 56.72) C21 48.75 ( 41.82, 55.68) C22 51.27 ( 44.34, 58.20) C23 47.80 ( 40.87, 54.73) C24 56.60 ( 49.67, 63.52) C25 47.70 ( 40.77, 54.63) C26 51.58 ( 44.65, 58.51) C27 47.37 ( 40.44, 54.30) C28 61.42 ( 54.49, 68.35) C29 46.89 ( 39.96, 53.82) C30 51.92 ( 44.99, 58.85) 2010 Pearson Prentice Hall. All rights reserved 9-20

Note that 28 out of 30, or 93%, of the confidence intervals contain the population mean µ = 50. In general, for a 95% confidence interval, any sample mean that lies within 1.96 standard errors of the population mean will result in a confidence interval that contains µ. Whether a confidence interval contains µ depends solely on the sample mean,. 2010 Pearson Prentice Hall. All rights reserved 9-21

Interpretation of a Confidence Interval A (1 α) 100% confidence interval indicates that, if we obtained many simple random samples of size n from the population whose mean, µ, is unknown, then approximately (1 α) 100% of the intervals will contain µ. For example, if we constructed a 99% confidence interval with a lower bound of 52 and an upper bound of 71, we would interpret the interval as follows: We are 99% confident that the population mean, µ, is between 52 and 71. 2010 Pearson Prentice Hall. All rights reserved 9-22

Constructing a (1 α) 100% Confidence Interval for µ, σ Known Suppose that a simple random sample of size n is taken from a population with unknown mean, µ, and known standard deviation σ. A (1 α) 100% confidence interval for µ is given by Lower Upper Bound: Bound: where is the critical Z-value. Note: The sample size must be large (n 30) or the population must be normally distributed. 2010 Pearson Prentice Hall. All rights reserved 9-23

Parallel Example 3: Constructing a Confidence Interval Construct a 99% confidence interval about the population mean weight (in grams) of pennies minted after 1982. Assume σ = 0.02 grams. 2.46 2.47 2.49 2.48 2.50 2.44 2.46 2.45 2.49 2.47 2.45 2.46 2.45 2.46 2.47 2.44 2.45 2010 Pearson Prentice Hall. All rights reserved 9-24

2010 Pearson Prentice Hall. All rights reserved 9-25

Weight (in grams) of Pennies 2010 Pearson Prentice Hall. All rights reserved 9-26

Lower bound: = 2.464-2.575 = 2.464-0.012 = 2.452 Upper bound: = 2.464+2.575 = 2.464+0.012 = 2.476 We are 99% confident that the mean weight of pennies minted after 1982 is between 2.452 and 2.476 grams. 2010 Pearson Prentice Hall. All rights reserved 9-27

A simple random sample of size n < 30 has been obtained. From the normal probability plot and boxplot, judge whether a Z-interval should be constructed. data A. Yes B. No Copyright 2010 Pearson 2010 Prentice Pearson Hall. All Education, rights reserved Inc. Slide 9-28

A simple random sample of size n < 30 has been obtained. From the normal probability plot and boxplot, judge whether a Z-interval should be constructed. data A. Yes B. No Copyright 2010 Pearson 2010 Prentice Pearson Hall. All Education, rights reserved Inc. Slide 9-29

Objective 3 Understand the Role of the Margin of Error in Constructing a Confidence Interval 2010 Pearson Prentice Hall. All rights reserved 9-30

The margin of error, E, in a (1 α) 100% confidence interval in which σ is known is given by where n is the sample size. Note: We require that the population from which the sample was drawn be normally distributed or the samples size n be greater than or equal to 30. 2010 Pearson Prentice Hall. All rights reserved 9-31

Parallel Example 5: Role of the Level of Confidence in the Margin of Error Construct a 90% confidence interval for the mean weight of pennies minted after 1982. Comment on the effect that decreasing the level of confidence has on the margin of error. 2010 Pearson Prentice Hall. All rights reserved 9-32

Lower bound: = 2.464-1.645 = 2.464-0.008 = 2.456 Upper bound: = 2.464+1.645 = 2.464+0.008 = 2.472 We are 90% confident that the mean weight of pennies minted after 1982 is between 2.456 and 2.472 grams. 2010 Pearson Prentice Hall. All rights reserved 9-33

Notice that the margin of error decreased from 0.012 to 0.008 when the level of confidence decreased from 99% to 90%. The interval is therefore wider for the higher level of confidence. Confidence Level Margin of Error Confidence Interval 90% 0.008 (2.456, 2.472) 99% 0.012 (2.452, 2.476) 2010 Pearson Prentice Hall. All rights reserved 9-34

Parallel Example 6: Role of Sample Size in the Margin of Error Suppose that we obtained a simple random sample of pennies minted after 1982. Construct a 99% confidence interval with n = 35. Assume the larger sample size results in the same sample mean, 2.464. The standard deviation is still σ = 0.02. Comment on the effect increasing sample size has on the width of the interval. 2010 Pearson Prentice Hall. All rights reserved 9-35

Lower bound: = 2.464-2.575 = 2.464-0.009 = 2.455 Upper bound: = 2.464+2.575 = 2.464+0.009 = 2.473 We are 99% confident that the mean weight of pennies minted after 1982 is between 2.455 and 2.473 grams. 2010 Pearson Prentice Hall. All rights reserved 9-36

Notice that the margin of error decreased from 0.012 to 0.009 when the sample size increased from 17 to 35. The interval is therefore narrower for the larger sample size. Sample Size Margin of Error Confidence Interval 17 0.012 (2.452, 2.476) 35 0.009 (2.455, 2.473) 2010 Pearson Prentice Hall. All rights reserved 9-37

Objective 4 Determine the Sample Size Necessary for Estimating the Population Mean within a Specified Margin of Error 2010 Pearson Prentice Hall. All rights reserved 9-38

Determining the Sample Size n The sample size required to estimate the population mean, µ, with a level of confidence (1 α) 100% with a specified margin of error, E, is given by where n is rounded up to the nearest whole number. 2010 Pearson Prentice Hall. All rights reserved 9-39

Parallel Example 7: Determining the Sample Size Back to the pennies. How large a sample would be required to estimate the mean weight of a penny manufactured after 1982 within 0.005 grams with 99% confidence? Assume σ = 0.02. 2010 Pearson Prentice Hall. All rights reserved 9-40

σ = 0.02 E=0.005 Rounding up, we find n=107. 2010 Pearson Prentice Hall. All rights reserved 9-41

A CEO wants to estimate the mean age of employees at a company. How many employees should be in a sample to estimate the mean age to within 0.5 year with 95% confidence? Assume that σ = 4.8 years. A. 18 B. 19 C. 354 D. 355 2010 Pearson Prentice Hall. All rights reserved Slide 9-42

A CEO wants to estimate the mean age of employees at a company. How many employees should be in a sample to estimate the mean age to within 0.5 year with 95% confidence? Assume that σ = 4.8 years. A. 18 B. 19 C. 354 D. 355 Copyright 2010 Pearson 2010 Prentice Pearson Hall. All Education, rights reserved Inc. Slide 9-43

End Here 2010 Pearson Prentice Hall. All rights reserved 9-44

Section 9.2 Confidence Intervals about a Population Mean When the Population Standard Deviation is Unknown 2010 Pearson Prentice Hall. All rights reserved

Objectives 1. Know the properties of Student s t-distribution 2. Determine t-values 3. Construct and interpret a confidence interval for a population mean 2010 Pearson Prentice Hall. All rights reserved 9-46

Objective 1 Know the Properties of Student s t-distribution 2010 Pearson Prentice Hall. All rights reserved 9-47

Student s t-distribution Suppose that a simple random sample of size n is taken from a population. If the population from which the sample is drawn follows a normal distribution, the distribution of follows Student s t-distribution with n-1 degrees of freedom where is the sample mean and s is the sample standard deviation. 2010 Pearson Prentice Hall. All rights reserved 9-48

Parallel Example 1: Comparing the Standard Normal Distribution to the t-distribution Using Simulation a) Obtain 1,000 simple random samples of size n = 5 from a normal population with µ = 50 and σ = 10. b) Determine the sample mean and sample standard deviation for each of the samples. c) Compute and for each sample. d) Draw a histogram for both z and t. 2010 Pearson Prentice Hall. All rights reserved 9-49

Histogram for z 2010 Pearson Prentice Hall. All rights reserved 9-50

Histogram for t 2010 Pearson Prentice Hall. All rights reserved 9-51

CONCLUSIONS: The histogram for z is symmetric and bell-shaped with the center of the distribution at 0 and virtually all the rectangles between -3 and 3. In other words, z follows a standard normal distribution. The histogram for t is also symmetric and bell-shaped with the center of the distribution at 0, but the distribution of t has longer tails (i.e., t is more dispersed), so it is unlikely that t follows a standard normal distribution. The additional spread in the distribution of t can be attributed to the fact that we use s to find t instead of σ. Because the sample standard deviation is itself a random variable (rather than a constant such as σ), we have more dispersion in the distribution of t. 2010 Pearson Prentice Hall. All rights reserved 9-52

Properties of the t-distribution 1. The t-distribution is different for different degrees of freedom. 2. The t-distribution is centered at 0 and is symmetric about 0. 3. The area under the curve is 1. The area under the curve to the right of 0 equals the area under the curve to the left of 0 equals 1/2. 4. As t increases without bound, the graph approaches, but never equals, zero. As t decreases without bound, the graph approaches, but never equals, zero. 2010 Pearson Prentice Hall. All rights reserved 9-53

Properties of the t-distribution 5. The area in the tails of the t-distribution is a little greater than the area in the tails of the standard normal distribution, because we are using s as an estimate of σ, thereby introducing further variability into the t- statistic. 6. As the sample size n increases, the density curve of t gets closer to the standard normal density curve. This result occurs because, as the sample size n increases, the values of s get closer to the values of σ, by the Law of Large Numbers. 2010 Pearson Prentice Hall. All rights reserved 9-54

2010 Pearson Prentice Hall. All rights reserved 9-55

Determine t-values Objective 2 2010 Pearson Prentice Hall. All rights reserved 9-56

2010 Pearson Prentice Hall. All rights reserved 9-57

Parallel Example 2: Finding t-values Find the t-value such that the area under the t- distribution to the right of the t-value is 0.2 assuming 10 degrees of freedom. That is, find t 0.20 with 10 degrees of freedom. 2010 Pearson Prentice Hall. All rights reserved 9-58

Solution The figure to the left shows the graph of the t-distribution with 10 degrees of freedom. The unknown value of t is labeled, and the area under the curve to the right of t is shaded. The value of t 0.20 with 10 degrees of freedom is 0.8791. 2010 Pearson Prentice Hall. All rights reserved 9-59

Objective 3 Construct and Interpret a Confidence Interval for a Population Mean 2010 Pearson Prentice Hall. All rights reserved 9-60

Constructing a (1 α) 100% Confidence Interval for µ, σ Unknown Suppose that a simple random sample of size n is taken from a population with unknown mean µ and unknown standard deviation σ. A (1 α) 100% confidence interval for µ is given by Lower Upper bound: bound: Note: The interval is exact when the population is normally distributed. It is approximately correct for non-normal populations, provided that n is large enough. 2010 Pearson Prentice Hall. All rights reserved 9-61

Parallel Example 3: Constructing a Confidence Interval about a Population Mean The pasteurization process reduces the amount of bacteria found in dairy products, such as milk. The following data represent the counts of bacteria in pasteurized milk (in CFU/mL) for a random sample of 12 pasteurized glasses of milk. Data courtesy of Dr. Michael Lee, Professor, Joliet Junior College. Construct a 95% confidence interval for the bacteria count. 2010 Pearson Prentice Hall. All rights reserved 9-62

NOTE: Each observation is in tens of thousand. So, 9.06 represents 9.06 x 10 4. 2010 Pearson Prentice Hall. All rights reserved 9-63

Solution: Checking Normality and Existence of Outliers Normal Probability Plot for CFU/ml 2010 Pearson Prentice Hall. All rights reserved 9-64

Solution: Checking Normality and Existence of Outliers Boxplot of CFU/mL 2010 Pearson Prentice Hall. All rights reserved 9-65

Lower bound: Upper bound: The 95% confidence interval for the mean bacteria count in pasteurized milk is (3.52, 9.30). 2010 Pearson Prentice Hall. All rights reserved 9-66

Parallel Example 5: The Effect of Outliers Suppose a student miscalculated the amount of bacteria and recorded a result of 2.3 x 10 5. We would include this value in the data set as 23.0. What effect does this additional observation have on the 95% confidence interval? 2010 Pearson Prentice Hall. All rights reserved 9-67

Solution: Checking Normality and Existence of Outliers Boxplot of CFU/mL 2010 Pearson Prentice Hall. All rights reserved 9-68

Solution Lower bound: Upper bound: The 95% confidence interval for the mean bacteria count in pasteurized milk, including the outlier is (3.86, 11.52). 2010 Pearson Prentice Hall. All rights reserved 9-69

CONCLUSIONS: With the outlier, the sample mean is larger because the sample mean is not resistant With the outlier, the sample standard deviation is larger because the sample standard deviation is not resistant Without the outlier, the width of the interval decreased from 7.66 to 5.78. s 95% CI Without Outlier With Outlier 6.41 4.55 (3.52, 9.30) 7.69 6.34 (3.86, 11.52) 2010 Pearson Prentice Hall. All rights reserved 9-70

Section 9.3 Confidence Intervals for a Population Proportion 2010 Pearson Prentice Hall. All rights reserved

Objectives 1. Obtain a point estimate for the population proportion 2. Construct and interpret a confidence interval for the population proportion 3. Determine the sample size necessary for estimating a population proportion within a specified margin of error 2010 Pearson Prentice Hall. All rights reserved 9-72

Objective 1 Obtain a point estimate for the population proportion 2010 Pearson Prentice Hall. All rights reserved 9-73

A point estimate is an unbiased estimator of the parameter. The point estimate for the population proportion is where x is the number of individuals in the sample with the specified characteristic and n is the sample size. 2010 Pearson Prentice Hall. All rights reserved 9-74

Parallel Example 1: Calculating a Point Estimate for the Population Proportion In July of 2008, a Quinnipiac University Poll asked 1783 registered voters nationwide whether they favored or opposed the death penalty for persons convicted of murder. 1123 were in favor. Obtain a point estimate for the proportion of registered voters nationwide who are in favor of the death penalty for persons convicted of murder. 2010 Pearson Prentice Hall. All rights reserved 9-75

Solution Obtain a point estimate for the proportion of registered voters nationwide who are in favor of the death penalty for persons convicted of murder. 2010 Pearson Prentice Hall. All rights reserved 9-76

Objective 2 Construct and Interpret a Confidence Interval for the Population Proportion 2010 Pearson Prentice Hall. All rights reserved 9-77

Sampling Distribution of For a simple random sample of size n, the sampling distribution of normal with mean is approximately and standard deviation, provided that np(1 p) 10. NOTE: We also require that each trial be independent when sampling from finite populations. 2010 Pearson Prentice Hall. All rights reserved 9-78

Constructing a (1-α) 100% Confidence Interval for a Population Proportion Suppose that a simple random sample of size n is taken from a population. A (1-α) 100% confidence interval for p is given by the following quantities Lower bound: Upper bound: Note: It must be the case that n 0.05N to construct this interval. and 2010 Pearson Prentice Hall. All rights reserved 9-79

Parallel Example 2: Constructing a Confidence Interval for a Population Proportion In July of 2008, a Quinnipiac University Poll asked 1783 registered voters nationwide whether they favored or opposed the death penalty for persons convicted of murder. 1123 were in favor. Obtain a 90% confidence interval for the proportion of registered voters nationwide who are in favor of the death penalty for persons convicted of murder. 2010 Pearson Prentice Hall. All rights reserved 9-80

and the sample size is definitely less than 5% of the population size α =0.10 so z α/2 =z 0.05 =1.645 Lower bound: Solution Upper bound: 2010 Pearson Prentice Hall. All rights reserved 9-81

Solution We are 90% confident that the proportion of registered voters who are in favor of the death penalty for those convicted of murder is between 0.61and 0.65. 2010 Pearson Prentice Hall. All rights reserved 9-82

Objective 3 Determine the Sample Size Necessary for Estimating a Population Proportion within a Specified Margin of Error 2010 Pearson Prentice Hall. All rights reserved 9-83

Sample size needed for a specified margin of error, E, and level of confidence (1 α): Problem: The formula uses which depends on n, the quantity we are trying to determine! 2010 Pearson Prentice Hall. All rights reserved 9-84

Two possible solutions: 1. Use an estimate of p based on a pilot study or an earlier study. 2. Let = 0.5 which gives the largest possible value of n for a given level of confidence and a given margin of error. 2010 Pearson Prentice Hall. All rights reserved 9-85

The sample size required to obtain a (1 α) 100% confidence interval for p with a margin of error E is given by (rounded up to the next integer), where is a prior estimate of p. If a prior estimate of p is unavailable, the sample size required is 2010 Pearson Prentice Hall. All rights reserved 9-86

Parallel Example 4: Determining Sample Size A sociologist wanted to determine the percentage of residents of America that only speak English at home. What size sample should be obtained if she wishes her estimate to be within 3 percentage points with 90% confidence assuming she uses the 2000 estimate obtained from the Census 2000 Supplementary Survey of 82.4%? 2010 Pearson Prentice Hall. All rights reserved 9-87

Solution E=0.03 We round this value up to 437. The sociologist must survey 437 randomly selected American residents. 2010 Pearson Prentice Hall. All rights reserved 9-88

Section 9.4 Putting It Together: Which Procedure Do I Use? 2010 Pearson Prentice Hall. All rights reserved

Objective 1. Determine the appropriate confidence interval to construct 2010 Pearson Prentice Hall. All rights reserved 9-90

Objective 1 Determine the Appropriate Confidence Interval to Construct 2010 Pearson Prentice Hall. All rights reserved 9-91

2010 Pearson Prentice Hall. All rights reserved 9-92