The words you are searching are inside this book. To get more targeted content, please make full-text search by clicking here.

AP Statistics Course and Exam Description

Discover the best professional documents and content resources in AnyFlip Document Base.
Search
Published by INTERTU℠ EDUCATION, 2022-08-19 16:20:37

AP Statistics Course and Exam Description

AP Statistics Course and Exam Description

6UNIT Inference for Categorical Data: Proportions

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.K UNC-4.K.1

Calculate an appropriate For a comparison of proportions,
confidence interval
for a comparison of the interval estimate is
population proportions.
[Skill 3.D] ( pˆ1  pˆ2 )  z pˆ1(1  pˆ1)  pˆ2(1  pˆ2 ) .
n1 n2

CLARIFYING STATEMENT:

Formulas for interval estimates do not
appear explicitly on the AP Statistics Formula
Sheet provided with the AP Statistics Exam.
However, these formulas do not need to be
memorized, as they can be constructed
based on the general test statistic formula
and the relevant standard error formulas that
are provided on the formula sheet.

UNC-4.L UNC-4.L.1

Calculate an interval estimate Confidence intervals for a difference in
based on a confidence proportions can be used to calculate interval
interval for a difference estimates with specified units.
of proportions. [Skill 3.D]

AP Statistics Course and Exam Description Course Framework V.1 | 144

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Proportions 6UNIT

TOPIC 6.9 SKILLS

Justifying a Statistical
Claim Based on a Argumentation
Confidence Interval
for a Difference of 4.B
Population Proportions Interpret statistical
calculations and findings to
Required Course Content assign meaning or assess
a claim.
4.D
Justify a claim based on a
confidence interval.

AVAILABLE RESOURCE
§§ Classroom Resource >
Calculations
Aren’t Enough!
The Importance of
Communication in
AP Statistics

ENDURING UNDERSTANDING

UNC-4

An interval of values should be used to estimate parameters, in order to account
for uncertainty.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.M UNC-4.M.1

Interpret a confidence In repeated random sampling with the same
interval for a difference sample size, approximately C% of confidence
of proportions. [Skill 4.B] intervals created will capture the difference in
population proportions.

UNC-4.M.2

Interpreting a confidence interval for difference
between population proportions should
include a reference to the sample taken and
details about the population it represents.

UNC-4.N UNC-4.N.1

Justify a claim based on a A confidence interval for difference in
confidence interval for a population proportions provides an interval of
difference of proportions. values that may provide sufficient evidence to
[Skill 4.D] support a particular claim in context.

AP Statistics Course and Exam Description Course Framework V.1 | 145

Return to Table of Contents

© 2020 College Board

6UNIT Inference for Categorical Data: Proportions

SKILLS TOPIC 6.10

Selecting Statistical Setting Up a Test for
Methods the Difference of Two
Population Proportion
1.F
Identify null and alternative Required Course Content
hypotheses.
1.E ENDURING UNDERSTANDING
Identify an appropriate
inference method for VAR-6
significance tests.
The normal distribution may be used to model variation.
Statistical
Argumentation LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

4.C VAR-6.H VAR-6.H.1
Verify that inference
procedures apply in a Identify the null and alternative For a two-sample test for a difference of two
given situation. hypotheses for a difference of proportions, the null hypothesis specifies
two population proportions.
AVAILABLE RESOURCE [Skill 1.F] a value of 0 for the difference in population
§§ Classroom Resource >
Inference VAR-6.I proportions, indicating no difference or effect.

Identify an appropriate testing VAR-6.H.2
method for the difference of
two population proportions. The null hypothesis for a difference in
[Skill 1.E]
proportions is: H0 : p1 = p2, or H0 : p1 − p2 = 0.

VAR-6.H.3

A one-sided alternative hypothesis for a

fdHoifrafea: rpde1inf>fceepre2in.nAcpertowopofo-psrrtiodiopenodsratisiloteHnrsna ia:stpiHv1 e<a h:pyp2p1, oo≠rt,hpe2.sis

VAR-6.I.1

For a single categorical variable, the appropriate
testing method for the difference of two

population proportions is a two-sample z-test for

a difference between two population proportions.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 146

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Proportions 6UNIT

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-6.J VAR-6.J.1

Verify the conditions for In order to make statistical inferences when
making statistical inferences testing a difference between population
when testing a difference of proportions, we must check for independence
two population proportions. and that the sampling distribution is
[Skill 4.C] approximately normal:

a. To check for independence:

i. Data should be collected using two
independent, random samples or a
randomized experiment.

ii. When sampling without replacement,

check that n1 ≤ 10%N1 and n2 ≤ 10%N2.

b. Tpˆo1 −chpˆe2cisk that the sampling distribution of
approximately normal (shape):

i. For the combined sample, define the

combined (or pooled) proportion,

pˆc = n1 pˆ1 + n2 pˆ2 . Assuming that H0 is

true ( n1 + n2 0 or p1 = p2), check that
p1 − p2 =

   n1pˆc, n1 1  pˆc , n2 pˆc , and n2 1 pˆc

are all greater than or equal to some

predetermined value, typically either 5
or 10.

AP Statistics Course and Exam Description Course Framework V.1 | 147

Return to Table of Contents

© 2020 College Board

6UNIT Inference for Categorical Data: Proportions

SKILLS TOPIC 6.11

Using Probability Carrying Out a Test for
and Simulation the Difference of Two
Population Proportions
3.E
Calculate a test statistic Required Course Content

and find a p-value, provided ENDURING UNDERSTANDING

conditions for inference VAR-6
are met.
The normal distribution may be used to model variation.
Statistical
Argumentation LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

4.B VAR-6.K VAR-6.K.1
Interpret statistical
calculations and findings to Calculate an appropriate test The test statistic for a difference in
assign meaning or assess statistic for the difference of
a claim. two population proportions.
4.E [Skill 3.E]
Justify a claim using
a decision based on
significance tests.

AVAILABLE RESOURCE
§§ Classroom Resource >
Inference

proportions is:

z= (pˆ1  pˆ2 )  0 ,

pˆc (1  pˆc ) 1 1
n1 n2
n1 pˆ1 + n2 pˆ2
where pˆc = n1 + n2 .

CLARIFYING STATEMENT:

The formulas for test statistics do not appear
explicitly on the AP Statistics Formula
Sheet provided with the AP Statistics Exam.
However, these formulas do not need to
be memorized, as they can be constructed
based on the general test statistic formula
and the standard error formulas for each of
the relevant test statistics that are provided
on the formula sheet.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 148

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Proportions 6UNIT

ENDURING UNDERSTANDING

DAT-3

Significance testing allows us to make decisions about hypotheses within a
particular context.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

DAT-3.C DAT-3.C.1

Interpret the p-value An interpretation of the p-value of a

of a significance test significance test for a difference of two
for a difference of population proportions should recognize that
population proportions.
[Skill 4.B] the p-value is computed by assuming that the

null hypothesis is true, i.e., by assuming that
the true population proportions are equal to
each other.

DAT-3.D DAT-3.D.1

Justify a claim about A formal decision explicitly compares the
the population based on p-value to the significance . If the p-value ≤ ,
the results of a significance H 0 : p1 = p2, or
test for a difference of then reject the null hypothesis, ,
population proportions. then fail to
[Skill 4.E] H0 : p1 − p2 = 0. If the p-value >

reject the null hypothesis.

DAT-3.D.2

The results of a significance test for a
difference of two population proportions can
serve as the statistical reasoning to support
the answer to a research question about the
two populations that were sampled.

AP Statistics Course and Exam Description Course Framework V.1 | 149

Return to Table of Contents

© 2020 College Board

THIS PAGE IS INTENTIONALLY LEFT BLANK.

AP STATISTICS

UNIT 7

Inference for
Quantitative
Data: Means

10–18%

AP EXAM WEIGHTING

~14–16

CLASS PERIODS

AP Statistics Course and Exam Description Course Framework V.1 | 151

Return to Table of Contents

© 2020 College Board

Remember to go to AP Classroom
to assign students the online
Personal Progress Check for
this unit.

Whether assigned as homework or
completed in class, the Personal
Progress Check provides each
student with immediate feedback
related to this unit’s topics and skills.

Personal Progress Check 7
Multiple-choice: ~50 questions
Free-response: 2 questions

§§ Inference and Collecting Data

§§ Investigative Task

AP Statistics Course and Exam Description Course Framework V.1 | 152

Return to Table of Contents

© 2020 College Board

7UNIT 10–18% ~14–16 CLASS PERIODS
  AP EXAM WEIGHTING

Inference for
Quantitative Data:
Means

Developing Understanding

BIG IDEA 1 In this unit, students will analyze quantitative data to make inferences about population
Variation and
Distribution  VAR means. Students should understand that t∗ and t-tests are used for inference with means
when the population standard deviation,  , is not known. Using s for  in the formula for z
§§ How do we know gives a slightly different value, t, whose distribution, which depends on sample size, has more

whether to use a t-test area in the tails than a normal distribution. The boundaries for rejecting a null hypothesis using
or a z-test for inference
a t-distribution tend to be further from the mean than for a normal distribution. Students
with means?
should understand how and why conditions for inference with proportions and means are
BIG IDEA 2
Patterns and similar and different.
Uncertainty  UNC
Building Course Skills numerical reference to support their claim
§§ How can we make
1.E 1.F 4.C 4.E (e.g., “Because p < 0.05, we reject the null
sure that samples
Unit 7 focuses on means, which has many hypothesis.”) and interpreting findings
are independent? similarities to the conditions and procedures within the context of the question.
for proportions. Since students sometimes
BIG IDEA 3 Preparing for the AP Exam
Data-Based confuse t-tests with z-tests, it will help to
Predictions, Decisions, It is critical for students to recognize that
and Conclusions  DAT review the underlying rationales each time free-response questions asking whether
conditions come up. This will help students data provide convincing evidence of
§§ Why is it inappropriate develop understanding through repeated some finding are asking for a significance
practice in new situations. Teachers can test, not just a descriptive analysis. When
to accept a hypothesis encourage students to be mindful of notation using statistical inference for significance
and use the formula sheet as a reference. tests, students should identify the correct
as true based on the parameter and hypotheses, identify an
Teachers can reinforce that inference appropriate test procedure and check
results of statistical testing requires careful selection of a conditions, calculate a test statistic and
procedure based on specific conditions for
inference testing? a given problem. Common errors include p-value, and provide a conclusion in context,
mislabeling conditions (e.g., incorrectly
associating the large sample condition along with a justification based on linkage
with independence), relying upon vague
references to the normal distribution, or between the p-value and the conclusion. For
applying an inappropriate large sample
condition. The null and alternative inference with means, the appropriate test
hypotheses must be clearly stated in terms
of population parameters, not sample will often be a t-test, but if is known, a z-test
statistics. A formal decision compares
would be appropriate (see 2018 FRQ 6(a)).
the p-value to the level of significance.
For a t-test, conditions are (1) random sample
Students should also practice providing a and (2) large sample (e.g., n > 30). When

sampling without replacement, students
should also verify that the sample size is at
most 10% of the population.

AP Statistics Course and Exam Description Course Framework V.1 | 153

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

UNIT AT A GLANCE

Enduring Topic Skills Class Periods
Understanding 7.1 I ntroducing Statistics: ~14–16 CLASS PERIODS

VAR-1 Should I Worry 1.A Identify the question to be answered or problem
About Error? to be solved (not assessed).

VAR-7, UNC-4 7.2 C onstructing a 3.C Describe probability distributions.
Confidence Interval for a
Population Mean 1.D Identify an appropriate inference method for
confidence intervals.

4.C Verify that inference procedures apply in a
given situation.

3.D Construct a confidence interval, provided
conditions for inference are met.

UNC-4 7.3 J ustifying a Claim About a 4.B Interpret statistical calculations and findings to
Population Mean Based on assign meaning or assess a claim.
a Confidence Interval
4.D Justify a claim based on a confidence interval.

4.A Make an appropriate claim or draw an
appropriate conclusion.

7.4 S etting Up a Test for a 1.E Identify an appropriate inference method for
Population Mean significance tests.

VAR-7 1.F Identify null and alternative hypotheses.

4.C Verify that inference procedures apply in a
given situation.

VAR-7, DAT-3 7.5 C arrying Out a Test for a 3.E Calculate a test statistic and find a p-value,
Population Mean
provided conditions for inference are met.

4.B Interpret statistical calculations and findings to
assign meaning or assess a claim.

4.E Justify a claim using a decision based on
significance tests.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 154

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

UNIT AT A GLANCE (cont’d)

Enduring Topic Skills Class Periods
Understanding 7.6 C onfidence Intervals ~14–16 CLASS PERIODS
1.D Identify an appropriate inference method for
UNC-4 for the Difference of confidence intervals.
Two Means
VAR-7 4.C Verify that inference procedures apply in a
7.7 J ustifying a Claim given situation.
VAR-7, DAT-3 About the Difference of
Two Means Based on a 3.D Construct a confidence interval, provided
Confidence Interval conditions for inference are met.

7.8 S etting Up a Test for 4.B Interpret statistical calculations and findings to
the Difference of Two assign meaning or assess a claim.
Population Means
4.D Justify a claim based on a confidence interval.
7.9 C arrying Out a Test for
the Difference of Two 4.A Make an appropriate claim or draw an
Population Means appropriate conclusion.

7.10 S kills Focus: Selecting, 1.E Identify an appropriate inference method for
Implementing, and significance tests.
Communicating
Inference Procedures 1.F Identify null and alternative hypotheses.

4.C Verify that inference procedures apply in a
given situation.

3.E Calculate a test statistic and find a p-value,

provided conditions for inference are met.

4.B Interpret statistical calculations and findings to
assign meaning or assess a claim.

4.E Justify a claim using a decision based on
significance tests.

N/A

Go to AP Classroom to assign the Personal Progress Check for Unit 7.
Review the results in class to identify and address any student misunderstandings.

AP Statistics Course and Exam Description Course Framework V.1 | 155

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

SAMPLE INSTRUCTIONAL ACTIVITIES

The sample activities on this page are optional and are offered to provide possible ways to
incorporate various instructional approaches into the classroom. They were developed in
partnership with teachers from the AP community to share ways that they approach teaching
some of the topics in this unit. Please refer to the Instructional Approaches section beginning
on p. 207 for more examples of activities and strategies.

Activity Topic Sample Activity
1 7.2
Predict and Confirm
2 7.2
3 7.3 iAsfktenroiwntnr,oxduczin*gthe, hcaovnefisdteundceentinstdeirsvcaul sfosrimn uslma afollrgaropuoppsuwlahtiaotnwmillehaanpwpheennifswigema
4 7.5 n
5 7.4
substitute the sample standard deviation s into the formula when σ is unknown. Have
7.9
students use the Simulating Confidence Intervals for Population Parameter applet to test
7.10 their conjectures (see link for Rossman/Chance Applets on page 210).

Team Challenge
Give each team of three to four students a copy of 2004 FRQ 6, which focuses on the

connection between a one-sample t-interval, a one-sample t-test, and the unfamiliar

concept of a one-sided confidence interval. Challenge teams to collaboratively produce a
model solution in 30 minutes.

Discussion Groups
Ask each group of three to four students to identify the conditions for performing a test
about a population mean. For each condition, have them explain why the condition is
required and what would go wrong with the test if the condition were violated. Have groups
pair up and compare answers.

Team FRQ
Give each team of four students copies of a free-response question that involves

performing a two-sample t-test (e.g., 2011 FRQ 4). Have each team member take

responsibility for writing one part of the model solution (hypotheses, procedure and
conditions, calculations, conclusion) with group input.

Graphic Organizer
Have students work in teams of two to three to develop a flowchart for determining which
inference procedure from Units 6 and 7 to use in a given setting.

AP Statistics Course and Exam Description Course Framework V.1 | 156

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.1 SKILL

Introducing Statistics: Selecting Statistical
Why Should I Worry Methods
About Error?
1.A
Required Course Content Identify the question to
be answered or problem
to be solved.

ENDURING UNDERSTANDING

VAR-1

Given that variation may be random or not, conclusions are uncertain.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-1.I VAR-1.I.1

Identify questions suggested Random variation may result in errors in
by probabilities of errors in statistical inference.
statistical inference. [Skill 1.A]

AP Statistics Course and Exam Description Course Framework V.1 | 157

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

SKILLS TOPIC 7.2

Using Probability Constructing a
and Simulation Confidence Interval
for a Population Mean
3.C
Describe probability Required Course Content
distributions.
3.D ENDURING UNDERSTANDING
Construct a confidence
interval, provided VAR-7
conditions for inference
are met. The t-distribution may be used to model variation.

Selecting Statistical LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE
Methods
VAR-7.A VAR-7.A.1
1.D
Identify an appropriate Describe t-distributions. When s is used instead of  to calculate a test
inference method for
confidence intervals. [Skill 3.C] statistic, the corresponding distribution, known

Statistical as the t-distribution, varies from the normal
Argumentation
distribution in shape, in that more of the area is
4.C allocated to the tails of the density curve than
Verify that inference in a normal distribution.
procedures apply in a
given situation. VAR-7.A.2

As the degrees of freedom increase, the area in

the tails of a t-distribution decreases.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 158

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

ENDURING UNDERSTANDING

UNC-4

An interval of values should be used to estimate parameters, in order to account
for uncertainty.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.O UNC-4.O.1

Identify an appropriate Because  is typically not known for distributions
confidence interval
procedure for a population of quantitative variables, the appropriate
mean, including the mean confidence interval procedure for estimating the
difference between values in population mean of one quantitative variable for
matched pairs. [Skill 1.D]
one sample is a one-sample t-interval for a mean.

UNC-4.O.2

dFoisrtroibnuetqeuda, tnhteitadtiisvterivbaurtiiaobnleo,fXt,=th(axt is normally

− μ) is a

s

n
t-distribution with n −1 degrees of freedom.

UNC-4.O.3

Matched pairs can be thought of as one sample
of pairs. Once differences between pairs of
values are found, inference for confidence
intervals proceeds as for a population mean.

UNC-4.P UNC-4.P.1

Verify the conditions for In order to calculate confidence intervals to
calculating confidence estimate a population mean, we must check
intervals for a population for independence and that the sampling
mean, including the mean distribution is approximately normal:
difference between values in
matched pairs. [Skill 4.C] a. To check for independence:
i. Data should be collected using a random
sample or a randomized experiment.
ii. When sampling without replacement,

check that n ≤ 10%N , where N is the size

of the population.

b. To check that the sampling distribution of x

is approximately normal (shape):
i. If the observed distribution is skewed,

n should be greater than 30.

ii. If the sample size is less than 30, the
distribution of the sample data should be
free from strong skewness and outliers.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 159

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.Q UNC-4.Q.1

Determine the margin of error The critical value t * with n −1 degrees of
for a given sample size for a
one-sample t-interval. freedom can be found using a table or
[Skill 3.D]
computer-generated output.

UNC-4.Q.2

The standard error for a sample mean is

given by SE = s  , where s is the sample
n

standard deviation.

UNC-4.Q.3

For a one-sample t-interval for a mean, the
margin of error is the critical value (t *) times the

standard error (SE), which equals t *  s .
 n 


UNC-4.R UNC-4.R.1

Calculate an appropriate The point estimate for a population mean is the
confidence interval for a
population mean, including sample mean, x .
the mean difference between
values in matched pairs. UNC-4.R.2
[Skill 3.D]
For the population mean for one sample with
unknown population standard deviation, the
confidence interval is x  t * s .

n

CLARIFYING STATEMENT:

Formulas for interval estimates do not
appear explicitly on the AP Statistics Formula
Sheet provided with the AP Statistics Exam.
However, these formulas do not need to be
memorized, as they can be constructed
based on the general test statistic formula
and the relevant standard error formulas that
are provided on the formula sheet.

AP Statistics Course and Exam Description Course Framework V.1 | 160

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.3 SKILLS

Justifying a Claim Statistical
About a Population Argumentation
Mean Based on a
Confidence Interval 4.B
Interpret statistical
Required Course Content calculations and findings to
assign meaning or assess
a claim.

4.D
Justify a claim based on a
confidence interval.

4.A
Make an appropriate claim
or draw an appropriate
conclusion.

ENDURING UNDERSTANDING ILLUSTRATIVE EXAMPLE

UNC-4 UNC-4.S.3:

An interval of values should be used to estimate parameters, in order to account For interpreting a 96%
for uncertainty. confidence interval for
mean foot length for all
LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE footprints found in a cave
based on a particular
UNC-4.S UNC-4.S.1 randomly selected sample
of footprints in the cave:
Interpret a confidence A confidence interval for a population mean
interval for a population either contains the population mean or it does “We are 96% confident
mean, including the mean not, because each interval is based on data that the mean foot length
difference between values in from a random sample, which varies from for all footprints found in
matched pairs. [Skill 4.B] sample to sample. the cave falls within the
confidence interval” (based
UNC-4.S.2 on 2000 FRQ 2).

We are C% confident that the confidence
interval for a population mean captures the
population mean.

UNC-4.S.3

An interpretation of a confidence interval for
a population mean includes a reference to the
sample taken and details about the population
it represents.

UNC-4.T UNC-4.T.1

Justify a claim based on a A confidence interval for a population mean
confidence interval for a provides an interval of values that may provide
population mean, including sufficient evidence to support a particular
the mean difference between claim in context.
values in matched pairs.
[Skill 4.D]

AP Statistics Course and Exam Description continued on next page

Course Framework V.1 | 161

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.U UNC-4.U.1

Identify the relationships When all other things remain the same, the width
between sample size, width of a confidence interval for a population mean
of a confidence interval, tends to decrease as the sample size increases.
confidence level, and margin
of error for a population mean. UNC-4.U.2
[Skill 4.A]
pFororpaosritniognlealmtoean1, the width of the interval is
.
n

UNC-4.U.3

For a given sample, the width of the confidence
interval for a population mean increases as the
confidence level increases.

AP Statistics Course and Exam Description Course Framework V.1 | 162

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.4 SKILLS

Setting Up a Test for Selecting Statistical
a Population Mean Methods

Required Course Content 1.E
Identify an appropriate
ENDURING UNDERSTANDING inference method for
significance tests.
VAR-7 1.F
Identify null and alternative
The t-distribution may be used to model variation. hypotheses.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE Statistical
Argumentation
VAR-7.B VAR-7.B.1
4.C
Identify an appropriate The appropriate test for a population mean Verify that inference
testing method for a procedures apply in a
population mean with with unknown  is a one-sample t-test for a given situation.

unknown σ , including the population mean. AVAILABLE RESOURCE
§§ Classroom Resource >
mean difference between VAR-7.B.2 Inference
values in matched pairs.
[Skill 1.E] Matched pairs can be thought of as one sample
of pairs. Once differences between pairs of
values are found, inference for significance
testing proceeds as for a population mean.

VAR-7.C VAR-7.C.1

Identify the null and The null hypothesis for a one-sample t-test
alternative hypotheses for for a population mean is H0 : μ = μ0 , where
a population mean with μ0 is the hypothesized value. Depending upon

unknown σ , including the the situation, the alternative hypothesis is

mean difference between Ha : μ < μ0, or Ha :   0, or Ha :   0 .
values in matched pairs.
[Skill 1.F] VAR-7.C.2

When finding the mean difference, μd , between

values in a matched pair, it is important to
define the order of subtraction.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 163

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-7.D VAR-7.D.1

Verify the conditions for In order to make statistical inferences when
the test for a population testing a population mean, we must check
mean, including the mean for independence and that the sampling
difference between values in distribution is approximately normal:
matched pairs. [Skill 4.C]
a. To check for independence:

i. Data should be collected using a random
sample or a randomized experiment.

ii. When sampling without replacement,

check that n ≤ 10%N.
b. To check that the sampling distribution of x

is approximately normal (shape):

i. If the observed distribution is skewed,

n should be greater than 30.

ii. If the sample size is less than 30, the
distribution of the sample data should be
free from strong skewness and outliers.

AP Statistics Course and Exam Description Course Framework V.1 | 164

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.5 SKILLS

Carrying Out a Test Using Probability
for a Population Mean and Simulation

Required Course Content 3.E
Calculate a test statistic
ENDURING UNDERSTANDING
and find a p-value, provided
VAR-7
conditions for inference
The t-distribution may be used to model variation. are met.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE Statistical
Argumentation
VAR-7.E VAR-7.E.1
4.B
Calculate an appropriate For a single quantitative variable when random Interpret statistical
test statistic for a population calculations and findings to
mean, including the mean assign meaning or assess
difference between values in a claim.
matched pairs. [Skill 3.E] 4.E
Justify a claim using
a decision based on
significance tests.

AVAILABLE RESOURCE
§§ Classroom Resource >
Inference

sampling with replacement from a population

that can be modeled with a normal distribution

with mean  and standard deviation  , the
x −μ
sampling distribution of t = s has a

n
t-distribution with n – 1 degrees of freedom.

CLARIFYING STATEMENT:

The formulas for test statistics do not appear
explicitly on the AP Statistics Formula
Sheet provided with the AP Statistics Exam.
However, these formulas do not need to
be memorized, as they can be constructed
based on the general test statistic formula
and the relevant standard error formulas that
are provided on the formula sheet.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 165

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

ENDURING UNDERSTANDING

DAT-3

Significance testing allows us to make decisions about hypotheses within a
particular context.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

DAT-3.E DAT-3.E.1

Interpret the p-value of An interpretation of the p-value of a

a significance test for a significance test for a population mean should
population mean, including
the mean difference between recognize that the p-value is computed by
values in matched pairs.
[Skill 4.B] assuming that the null hypothesis is true,
i.e., by assuming that the true population mean
is equal to the particular value stated in the
null hypothesis.

DAT-3.F DAT-3.F.1

Justify a claim about the A formal decision explicitly compares
population based on the
results of a significance test the p-value to the significance  . If the
for a population mean. p-value ≤  , then reject the null hypothesis,
[Skill 4.E] H0 : μ = μ0. If the p-value >  , then fail to

reject the null hypothesis.

DAT-3.F.2

The results of a significance test for a
population mean can serve as the statistical
reasoning to support the answer to a
research question about the population
that was sampled.

AP Statistics Course and Exam Description Course Framework V.1 | 166

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.6 SKILLS

Confidence Intervals Selecting Statistical
for the Difference Methods
of Two Means
1.D
Required Course Content Identify an appropriate
inference method for
ENDURING UNDERSTANDING confidence intervals.

UNC-4 Statistical
Argumentation
An interval of values should be used to estimate parameters, in order to account
for uncertainty. 4.C
Verify that inference
procedures apply in a
given situation.

Using Probability
and Simulation

3.D
Construct a confidence
interval, provided
conditions for inference
are met.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

Identify an appropriate UNC-4.V.1
confidence interval
procedure for a difference of Consider a simple random sample from
two population means.
[Skill 1.D] population 1 of size n1 , mean μ1 , and standard

deviation σ 1 and a second simple random
n2 , mean μ2 ,
sample from population 2 of size
 . If the distributions
and standard deviation σ 2
of populations 1 and 2 are normal or if both n1
and n2 are greater than 30, then the sampling
distribution of the difference of means, x 1 − x 2
is also normal. The mean for the sampling

distribution of x 1 − x 2 is μ1 − μ2. The standard

deviation of x 1 − x 2 is (σ1)2 + (σ 2 )2 .
n1 n2

UNC-4.V.2

The appropriate confidence interval

procedure for one quantitative variable for

two independent samples is a two-sample

t-interval for a difference between

population means.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 167

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.W UNC-4.W.1

Verify the conditions to In order to calculate confidence intervals to
calculate confidence estimate a difference of population means, we
intervals for the difference of must check for independence and that the
two population means. sampling distribution is approximately normal:
[Skill 4.C]
a. To check for independence:
UNC-4.X
i. Data should be collected using two
Determine the margin of independent, random samples or a
error for the difference of two randomized experiment.
population means. [Skill 3.D]
ii. When sampling without replacement,
UNC-4.Y
check that n1 ≤ 10%N1 and n2 ≤ 10%N2.
Calculate an appropriate
confidence interval b. To check that the sampling distribution
for a difference of two
population means. [Skill 3.D] of (x1 − x2 ) should be approximately

normal (shape):

i. If the observed distributions are skewed,

both n1 and n2 should be greater than 30.

UNC-4.X.1

For the difference of two sample means,
the margin of error is the critical value (t *) times
the standard error (SE) of the difference of

two means.

UNC-4.X.2

The standard error for the difference in

two sample means with sample standard

deviations, s1 and s2, is (s1)2 + (s2 )2 .
n1 n2

UNC-4.Y.1

The point estimate for the difference of
two population means is the difference in

sample means, x1 − x2.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 168

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

UNC-4.Y UNC-4.Y.2

Calculate an appropriate For a difference of two population means
confidence interval
for a difference of two where the population standard deviations
population means. [Skill 3.D]
are not known, the confidence interval is

 x1  x2  t* s12  s22 where ±t * are the critical
n1 n2
values for the central C% of a t-distribution

with appropriate degrees of freedom that can

be found using technology.

CLARIFYING STATEMENT:

Formulas for interval estimates do not
appear explicitly on the AP Statistics Formula
Sheet provided with the AP Statistics Exam.
However, these formulas do not need to be
memorized, as they can be constructed
based on the general test statistic formula
and the relevant standard error formulas that
are provided on the formula sheet.

AP Statistics Course and Exam Description Course Framework V.1 | 169

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

SKILLS TOPIC 7.7

Statistical Justifying a Claim
Argumentation About the Difference of
Two Means Based on
4.B a Confidence Interval
Interpret statistical
calculations and findings to Required Course Content
assign meaning or assess
a claim.

4.D
Justify a claim based on a
confidence interval.

4.A
Make an appropriate
claim or draw an
appropriate conclusion.

ILLUSTRATIVE EXAMPLE ENDURING UNDERSTANDING

UNC-4.Z.2: UNC-4

For interpreting a An interval of values should be used to estimate parameters, in order to account
confidence interval for a for uncertainty.
difference between mean
response times for two LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE
fire stations (northern –
southern): “Based on these UNC-4.Z UNC-4.Z.1
samples, one can be 95
percent confident that the Interpret a confidence In repeated random sampling with the same
difference in the population interval for a difference of sample size, approximately C% of confidence
mean response times population means. [Skill 4.B] intervals created will capture the difference of
(northern – southern) is population means.
between −2.37 minutes and
0.37 minutes” (2009 FRQ 4). UNC-4.Z.2

An interpretation for a confidence interval for
the difference of two population means should
include a reference to the samples taken and
details about the populations they represent.

UNC-4.AA UNC-4.AA.1

Justify a claim based A confidence interval for a difference of
on a confidence interval population means provides an interval of
for a difference of values that may provide sufficient evidence to
population means. [Skill 4.D] support a particular claim in context.

UNC-4.AB UNC-4.AB.1

Identify the effects of When all other things remain the same,
sample size on the width of the width of the confidence interval for the
a confidence interval for the difference of two means tends to decrease as
difference of two means. the sample sizes increase.
[Skill 4.A]

AP Statistics Course and Exam Description Course Framework V.1 | 170

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.8 SKILLS

Setting Up a Test Selecting Statistical
for the Difference of Methods
Two Population Means
1.E
Required Course Content Identify an appropriate
inference method for
significance tests.
1.F
Identify null and alternative
hypotheses.

Statistical
Argumentation

4.C
Verify that inference
procedures apply in a
given situation.

ENDURING UNDERSTANDING

VAR-7

The t-distribution may be used to model variation.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-7.F VAR-7.F.1

Identify an appropriate For a quantitative variable, the appropriate
selection of a testing test for a difference of two population means
method for a difference
of two population means. is a two-sample t-test for a difference of
[Skill 1.E]
two population means.

VAR-7.G VAR-7.G.1

Identify the null and The null hypothesis for a two-sample t-test
alternative hypotheses
for a difference of two for a difference of two population means,
population means. [Skill 1.F]
HoTμrh1aeHa:naμadl1t:eμμ>r12nμ−,a2its,μivo:2Her>Hh0 0y:ap,μ:oo1μrt−1hHe<μsa2μi:s=2μ,i0s1o,−rHoHμar 2aH: μ≠:0μ10:1−μ,≠oμ1 rμ2=<2μ. 02 .,

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 171

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-7.H VAR-7.H.1

Verify the conditions for In order to make statistical inferences when
the significance test for testing a difference between population
the difference of two means, we must check for independence
population means. [Skill 4.C] and that the sampling distribution is
approximately normal:

a. Individual observations should be
independent:

i. Data should be collected using simple
random samples or a randomized
experiment.

ii. When sampling without replacement,

check that n1 ≤ 10%N1 and n2 ≤ 10%N2 .
b. The sampling distribution of x1 − x2 should

be approximately normal (shape).

i. If the observed distribution is skewed,

both n1 and n2 should be greater than 30.

ii. If the sample size is less than 30, the
distribution of the sample data should
be free from strong skewness and
outliers. This should be checked for
BOTH samples.

AP Statistics Course and Exam Description Course Framework V.1 | 172

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.9 SKILLS

Carrying Out a Test Using Probability
for the Difference of and Simulation
Two Population Means
3.E
Required Course Content Calculate a test statistic

ENDURING UNDERSTANDING and find a p-value, provided

VAR-7 conditions for inference
are met.
The t-distribution may be used to model variation.
Statistical
LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE Argumentation

VAR-7.I VAR-7.I.1 4.B
Interpret statistical
Calculate an appropriate test For a single quantitative variable, data collected calculations and findings to
statistic for a difference of assign meaning or assess
two means. [Skill 3.E] using independent random samples or a a claim.

randomized experiment from two populations, 4.E
Justify a claim using
each of which can be modeled with a normal a decision based on
significance tests.
tdi=st(rxib1u−tixo2ns)12,−t+h(eμs22s1 a−mμp2l)inisg distribution of
an approximate AVAILABLE RESOURCE
§§ Classroom Resource >
n1 n2 Inference

t-distribution with degrees of freedom that can ILLUSTRATIVE EXAMPLE
VAR-7.I.1:
be found using technology. The degrees of In a study comparing
1 mean recovery times for
freedom fall between the smaller of n1 − and two surgical procedures
to repair a torn anterior
n2 −1 and n1 + n2 − 2. cruciate ligament (ACL),
the group receiving one
CLARIFYING STATEMENT: procedure had a sample
size of 110, while the
The formulas for test statistics do not group receiving the other
appear explicitly on the AP Statistics Formula procedure had a sample
Sheet provided with the AP Statistics Exam. size of 100. The degrees of
However, these formulas do not need to be freedom fall between 100
memorized, as they can be constructed (the smaller of 110 and 100)
based on the general test statistic formula and 208 (110 + 100 − 2).
and the standard error formulas for each of The degrees of freedom
the relevant test statistics that are provided may be determined using
on the formula sheet. technology. If the test
statistic for this study is

t ≈ 7.13, then the p-value

is the area greater than

7.13 for a t-distribution with
df = 207.18 (2018 FRQ 4).

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 173

Return to Table of Contents

© 2020 College Board

7UNIT Inference for Quantitative Data: Means

ENDURING UNDERSTANDING

DAT-3

Significance testing allows us to make decisions about hypotheses within a
particular context.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

DAT-3.G DAT-3.G.1

Interpret the p-value An interpretation of the p-value of a

of a significance test significance test for a two-sample difference
for a difference of of population means should recognize that the
population means. [Skill 4.B]
p-value is computed by assuming that the null

hypothesis is true, i.e., by assuming that the
true population means are equal to each other.

DAT-3.H DAT-3.H.1

Justify a claim about the A formal decision explicitly compares the
population based on the p-value to the significance  . If the p-value ≤  ,
results of a significance Infutlhl ehypp-voathluees>is, H, t0h:eμn1 − μ2 = 0,
test for a difference of two then reject the
population means in context. fail to
[Skill 4.E] or H 0 : μ1 = μ2.

reject the null hypothesis.

DAT-3.H.2

The results of a significance test for a
two-sample test for a difference between
two population means can serve as the
statistical reasoning to support the answer
to a research question about the populations
that were sampled.

AP Statistics Course and Exam Description Course Framework V.1 | 174

Return to Table of Contents

© 2020 College Board

Inference for Quantitative Data: Means 7UNIT

TOPIC 7.10

Skills Focus: Selecting,
Implementing, and
Communicating
Inference Procedures

Required Course Content

This topic is intended to focus on the skill of selecting an appropriate inference
procedure, now that students have a range of options. Students should be given
opportunities to practice when and how to apply all learning objectives relating to
inference involving proportions or means.

AP Statistics Course and Exam Description Course Framework V.1 | 175

Return to Table of Contents

© 2020 College Board

THIS PAGE IS INTENTIONALLY LEFT BLANK.

AP STATISTICS

UNIT 8

Inference for
Categorical
Data:
Chi-Square

2–5%

AP EXAM WEIGHTING

~10–11

CLASS PERIODS

AP Statistics Course and Exam Description Course Framework V.1 | 177

Return to Table of Contents

© 2020 College Board

Remember to go to AP Classroom
to assign students the online
Personal Progress Check for
this unit.

Whether assigned as homework or
completed in class, the Personal
Progress Check provides each
student with immediate feedback
related to this unit’s topics and skills.

Personal Progress Check 8
Multiple-choice: ~30 questions
Free-response: 2 questions

§§ Inference

§§ Inference and Exploring Data/
Collecting Data

AP Statistics Course and Exam Description Course Framework V.1 | 178

Return to Table of Contents

© 2020 College Board

8UNIT 2–5% ~10–11 CLASS PERIODS
  AP EXAM WEIGHTING

Inference for
Categorical Data:
Chi-Square

Developing Understanding

BIG IDEA 1 Unit 6 introduced inference for proportions of categorical data. Unit 8 introduces chi-
Variation and square tests, which can be used when there are two or more categories. Students need to
Distribution  VAR understand how to select from the following tests: the chi-square test for goodness of fit (for
a distribution of proportions of one categorical variable in a population), the chi-square test
§§ How does increasing for independence (for associations between categorical variables within a single population),
the degrees of freedom or the chi-square test for homogeneity (for comparing distributions of a categorical variable
influence the shape across populations or treatments). To integrate conceptual understanding, teachers can make
of the chi-square connections between frequency tables, conditional probability, and calculating expected
distribution? counts. The chi-square statistic is introduced to measure the distance between observed and
expected counts relative to expected counts.
BIG IDEA 3
Data-Based Building Course Skills Preparing for the AP Exam
Predictions, Decisions,
and Conclusions  DAT 1.E 3.E 4.C 4.E When writing hypotheses, students should
refer to the population, using language
§§ Why is it inappropriate to In Unit 8, students should continue applying from the question. For example, “The
use statistical inference the same problem-solving structure to null hypothesis is that the age group at
to justify a claim that chi-square significance testing: State the diagnosis and gender are independent
there is no association hypotheses in words, explicitly identify (i.e., they are not associated) for the
between variables? the correct procedure, verify conditions, population of people currently being
treated for schizophrenia” (see Scoring
calculate the test statistic and the p-value, Guidelines for 2017 FRQ 5). As always,
students should name the test and provide
and then draw a conclusion in context that evidence verifying appropriate conditions.
For chi-square tests, the conditions are
is directly linked to the p-value. Students (1) random selection or randomized
experiment and (2) large counts. Students
should have opportunities to practice should be sure to say that all expected
the distinctive elements for each type of counts (rather than actual counts) are at
chi-square test, such as analysis of expected least 5. Students need to clearly present
counts, degrees of freedom, verbally stated calculations and state the conclusion in
hypotheses, and two-way tables.
context with linkage to p-values. Students
When the p-value is large, drawing an
should avoid tacitly accepting the null
appropriate conclusion is challenging for
students. Saying there is “no association” hypothesis. If the p-value is greater
between two variables is equivalent to
incorrectly “accepting the null hypothesis.” than conventional significance levels,
Instead, teachers can teach students to use the correct conclusion of a chi-square
nondeterministic language in their conclusions, test for independence would be that
that is, “The data do not provide strong enough there is insufficient evidence that there
evidence to conclude that the variables are is an association.
associated.” Students should have frequent
opportunities to practice writing, with detailed
feedback to help them improve.

AP Statistics Course and Exam Description Course Framework V.1 | 179

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

UNIT AT A GLANCE

Enduring Topic Skills Class Periods
Understanding ~10–11 CLASS PERIODS

VAR-1 8.1 I ntroducing Statistics: Are 1.A Identify the question to be answered or problem
My Results Unexpected?
to be solved (not assessed).

8.2 S etting Up a Chi-Square 3.C Describe probability distributions.
Goodness of Fit Test
1.F Identify null and alternative hypotheses.
VAR-8
1.E Identify an appropriate inference method for
significance tests.

3.A Determine relative frequencies, proportions,
or probabilities using simulation or calculations.

4.C Verify that inference procedures apply in a
given situation.

VAR-8, DAT-3 8.3 C arrying Out a Chi-Square 3.E Calculate a test statistic and find a p-value,
Test for Goodness of Fit
provided conditions for inference are met.

4.B Interpret statistical calculations and findings to
assign meaning or assess a claim.

4.E Justify a claim using a decision based on
significance tests.

8.4 E xpected Counts in 3.A Determine relative frequencies, proportions,
Two-Way Tables or probabilities using simulation or calculations.

VAR-8 8.5 S etting Up a Chi-Square 1.F Identify null and alternative hypotheses.
Test for Homogeneity
or Independence 1.E Identify an appropriate inference method for
significance tests.

4.C Verify that inference procedures apply in a
given situation.

VAR-8, DAT-3 8.6 C arrying Out a Chi-Square 3.E Calculate a test statistic and find a p-value,
Test for Homogeneity
or Independence provided conditions for inference are met.

4.B Interpret statistical calculations and findings to
assign meaning or assess a claim.

4.E Justify a claim using a decision based on
significance tests.

8.7 S kills Focus: Selecting an N/A
Appropriate Inference
Procedure for
Categorical Data

Go to AP Classroom to assign the Personal Progress Check for Unit 8.
Review the results in class to identify and address any student misunderstandings.

AP Statistics Course and Exam Description Course Framework V.1 | 180

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Chi-Square 8UNIT

SAMPLE INSTRUCTIONAL ACTIVITIES

The sample activities on this page are optional and are offered to provide possible ways to
incorporate various instructional approaches into the classroom. They were developed in
partnership with teachers from the AP community to share ways that they approach teaching
some of the topics in this unit. Please refer to the Instructional Approaches section beginning
on p. 207 for more examples of activities and strategies.

Activity Topic Sample Activity
1 8.1
Simulation
2 8.5 Prepare several bags with an identical mix of at least 250 chips or beads of
3 8.7 three colors in different proportions (e.g., red = 0.5, white = 0.3, blue = 0.2). Have
each student take a random sample of 25 chips or beads from the bag, calculate

 (Observed count − Expected count)2  , and plot their value on a class dotplot.
Expected count

Use this graph to introduce the chi-square distribution with df = 2.

Discussion Groups
Give each group of three to four students an example of a chi-square test involving a
two-way table. Have students work together to state appropriate hypotheses, describe a
Type 1 and Type 2 error in context, and give a possible consequence of each of those errors.

Graphic Organizer
Have students work in teams of two to three to develop a chart that summarizes the three
types of chi-square tests, including when each is appropriate, as well as the hypotheses,
conditions, and degrees of freedom.

AP Statistics Course and Exam Description Course Framework V.1 | 181

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

SKILL TOPIC 8.1

Selecting Statistical Introducing Statistics:
Methods Are My Results
Unexpected?
1.A
Identify the question to be
answered or problem to
be solved.

Required Course Content

ENDURING UNDERSTANDING

VAR-1

Given that variation may be random or not, conclusions are uncertain.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-1.J VAR-1.J.1

Identify questions suggested Variation between what we find and what we
by variation between expect to find may be random or not.
observed and expected
counts in categorical data.
[Skill 1.A]

AP Statistics Course and Exam Description Course Framework V.1 | 182

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Chi-Square 8UNIT

TOPIC 8.2 SKILLS

Setting Up a Chi-Square Using Probability
Goodness of Fit Test and Simulation

Required Course Content 3.C
Describe probability
ENDURING UNDERSTANDING distributions.

VAR-8 Selecting Statistical
Methods
The chi-square distribution may be used to model variation.
1.F
LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE Identify null and alternative
hypotheses.
VAR-8.A VAR-8.A.1 1.E
Identify an appropriate
Describe chi-square Expected counts of categorical data are counts inference method for
distributions. [Skill 3.C] consistent with the null hypothesis. In general, significance tests.
an expected count is a sample size times
a probability. Using Probability
and Simulation
VAR-8.A.2
3.A
The chi-square statistic measures the distance Determine relative
between observed and expected counts frequencies, proportions,
relative to expected counts. or probabilities using
simulation or calculations.
VAR-8.A.3
Statistical
Chi-square distributions have positive values Argumentation
and are skewed right. Within a family of density
curves, the skew becomes less pronounced 4.C
with increasing degrees of freedom. Verify that inference
procedures apply in a
given situation.

AVAILABLE RESOURCE
§§ Classroom Resource >
Inference

VAR-8.B VAR-8.B.1

Identify the null and For a chi-square goodness-of-fit test, the null
alternative hypotheses hypothesis specifies null proportions for each
in a test for a distribution category, and the alternative hypothesis is
of proportions in a set of that at least one of these proportions is not as
categorical data. [Skill 1.F] specified in the null hypothesis.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 183

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-8.C VAR-8.C.1

Identify an appropriate When considering a distribution of proportions
testing method for a for one categorical variable, the appropriate
distribution of proportions test is the chi-square test for goodness of fit.
in a set of categorical data.
[Skill 1.E] VAR-8.D.1

VAR-8.D Expected counts for a chi-square
goodness-of-fit test are (sample size)
Calculate expected counts (null proportion).
for the chi-square test for
goodness of fit. [Skill 3.A] VAR-8.E.1

VAR-8.E In order to make statistical inferences for a
chi-square test for goodness of fit we must
Verify the conditions for check the following:
making statistical inferences a. To check for independence:
when testing goodness of fit
for a chi-square distribution. i. Data should be collected using a random
[Skill 4.C] sample or randomized experiment.

ii. When sampling without replacement,

check that n ≤ 10%N .

b. The chi-square test for goodness of
fit becomes more accurate with more
observations, so large counts should be
used (shape).
i. A conservative check for large counts
is that all expected counts should be
greater than 5.

AP Statistics Course and Exam Description Course Framework V.1 | 184

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Chi-Square 8UNIT

TOPIC 8.3 SKILLS

Carrying Out a Using Probability
Chi-Square Test for and Simulation
Goodness of Fit
3.E
Required Course Content Calculate a test statistic

ENDURING UNDERSTANDING and find a p-value, provided

VAR-8 conditions for inference
are met.
The chi-square distribution may be used to model variation.
Statistical
LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE Argumentation

VAR-8.F VAR-8.F.1 4.B
Interpret statistical
Calculate the appropriate The test statistic for the chi-square calculations and findings to
statistic for the chi-square test for goodness of fit is assign meaning or assess
test for goodness of fit. a claim.
[Skill 3.E]  2   (Observed count  Expected count)2 , 4.E
Expected count Justify a claim using
a decision based on
with significance tests.

degrees of freedom = number of categories −1. AVAILABLE RESOURCE
§§ Classroom Resource >
VAR-8.F.2 Inference

The distribution of the test statistic assuming
the null hypothesis is true (null distribution) can
be either a randomization distribution or, when
a probability model is assumed to be true, a
theoretical distribution (chi-square).

VAR-8.G VAR-8.G.1

Determine the p-value for The p-value for a chi-square test for goodness

chi-square test for goodness of fit for a number of degrees of freedom is
of fit significance test. found using the appropriate table or computer
[Skill 3.E] generated output.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 185

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

ENDURING UNDERSTANDING

DAT-3

Significance testing allows us to make decisions about hypotheses within a
particular context.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

DAT-3.I DAT-3.I.1

Interpret the p-value for the An interpretation of the p-value for the

chi-square test for goodness chi-square test for goodness of fit is the
of fit. [Skill 4.B] probability, given the null hypothesis and
probability model are true, of obtaining a
test statistic as, or more, extreme than the
observed value.

DAT-3.J DAT-3.J.1

Justify a claim about the A decision to either reject or fail to reject the
population based on the null hypothesis is based on comparison of the
results of a chi-square test
for goodness of fit. [Skill 4.E] p-value to the significance level, .

DAT-3.J.2

The results of a chi-square test for goodness
of fit can serve as the statistical reasoning to
support the answer to a research question
about the population that was sampled.

AP Statistics Course and Exam Description Course Framework V.1 | 186

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Chi-Square 8UNIT

TOPIC 8.4 SKILL

Expected Counts in Using Probability
Two-Way Tables and Simulation

3.A
Determine relative
frequencies, proportions,
or probabilities using
simulation or calculations.

Required Course Content

ENDURING UNDERSTANDING

VAR-8

The chi-square distribution may be used to model variation.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-8.H VAR-8.H.1

Calculate expected counts The expected count in a particular cell of
for two-way tables of a two-way table of categorical data can be
categorical data. [Skill 3.A] calculated using the formula:

expected count = (row total)(column total).
table total

AP Statistics Course and Exam Description Course Framework V.1 | 187

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

SKILLS TOPIC 8.5

Selecting Statistical Setting Up a
Methods Chi-Square Test
for Homogeneity
1.F or Independence
Identify null and alternative
hypotheses. Required Course Content
1.E
Identify an appropriate
inference method for
significance tests.

Statistical
Argumentation

4.C
Verify that inference
procedures apply in a
given situation.

AVAILABLE RESOURCE ENDURING UNDERSTANDING
§§ Classroom Resource >
Inference VAR-8

The chi-square distribution may be used to model variation.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-8.I VAR-8.I.1

Identify the null and The appropriate hypotheses for a chi-square
alternative hypotheses test for homogeneity are:
for a chi-square test
for homogeneity H0: There is no difference in distributions of
or independence. [Skill 1.F]
a categorical variable across populations or
treatments.

Ha : There is a difference in distributions of

a categorical variable across populations
or treatments.

VAR-8.I.2

The appropriate hypotheses for a chi-square
test for independence are:

H0: There is no association between two

categorical variables in a given population or
the two categorical variables are independent.

Ha : Two categorical variables in a population

are associated or dependent.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 188

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Chi-Square 8UNIT

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

VAR-8.J VAR-8.J.1

Identify an appropriate When comparing distributions to determine
testing method for whether proportions in each category for
comparing distributions categorical data collected from different
in two-way tables of populations are the same, the appropriate test
categorical data. [Skill 1.E] is the chi-square test for homogeneity.

VAR-8.K VAR-8.J.2

Verify the conditions for To determine whether row and column
making statistical inferences variables in a two-way table of categorical data
when testing a chi-square might be associated in the population from
distribution for independence which the data were sampled, the appropriate
or homogeneity. [Skill 4.C] test is the chi-square test for independence.

VAR-8.K.1

In order to make statistical inferences
for a chi-square test for two-way tables
(homogeneity or independence), we must
verify the following:

a. To check for independence:
i. For a test for independence: Data
should be collected using a simple
random sample.
ii. For a test for homogeneity: Data should
be collected using a stratified random
sample or randomized experiment.
iii. When sampling without replacement,

check that n ≤ 10%N.

b. The chi-square tests for independence and
homogeneity become more accurate with
more observations, so large counts should
be used (shape).
i. A conservative check for large counts
is that all expected counts should be
greater than 5.

AP Statistics Course and Exam Description Course Framework V.1 | 189

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

SKILLS TOPIC 8.6

Using Probability Carrying Out a
and Simulation Chi-Square Test
for Homogeneity
3.E or Independence
Calculate a test statistic
Required Course Content
and find a p-value, provided
ENDURING UNDERSTANDING
conditions for inference
are met. VAR-8

Statistical The chi-square distribution may be used to model variation.
Argumentation
LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE
4.B
Interpret statistical VAR-8.L VAR-8.L.1
calculations and findings to
assign meaning or assess Calculate the appropriate The appropriate test statistic for a
a claim. statistic for a chi-square chi-square test for homogeneity or
4.E test for homogeneity or independence is the chi-square statistic:
Justify a claim using independence. [Skill 3.E]
a decision based on  2   (Observed count  Expected count)2 ,
significance tests. Expected count

AVAILABLE RESOURCE with degrees of freedom equal to:
§§ Classroom Resource >
Inference (number of rows −1)(number of columns −1).

VAR-8.M VAR-8.M.1

Determine the p-value for The p-value for a chi-square test for

a chi-square significance independence or homogeneity for a number
test for independence or of degrees of freedom is found using the
homogeneity. [Skill 3.E] appropriate table or technology.

VAR-8.M.2

For a test of independence or homogeneity for

a two-way table, the p-value is the proportion

of values in a chi-square distribution with
appropriate degrees of freedom that are equal
to or larger than the test statistic.

continued on next page

AP Statistics Course and Exam Description Course Framework V.1 | 190

Return to Table of Contents

© 2020 College Board

Inference for Categorical Data: Chi-Square 8UNIT

ENDURING UNDERSTANDING

DAT-3

Significance testing allows us to make decisions about hypotheses within a
particular context.

LEARNING OBJECTIVE ESSENTIAL KNOWLEDGE

DAT-3.K DAT-3.K.1

Interpret the p-value An interpretation of the p-value for the

for the chi-square test chi-square test for homogeneity or
for homogeneity or independence is the probability, given the null
independence. [Skill 4.B] hypothesis and probability model are true, of
obtaining a test statistic as, or more, extreme
than the observed value.

DAT-3.L DAT-3.L.1

Justify a claim about the A decision to either reject or fail to reject
population based on the the null hypothesis for a chi-square test for
results of a chi-square homogeneity or independence is based on
test for homogeneity or
independence. [Skill 4.E] comparison of the p-value to the significance
level, .

DAT-3.L.2

The results of a chi-square test for
homogeneity or independence can serve as
the statistical reasoning to support the answer
to a research question about the population
that was sampled (independence) or the
populations that were sampled (homogeneity).

AP Statistics Course and Exam Description Course Framework V.1 | 191

Return to Table of Contents

© 2020 College Board

8UNIT Inference for Categorical Data: Chi-Square

TOPIC 8.7

Skills Focus: Selecting
an Appropriate Inference
Procedure for Categorical Data

AVAILABLE RESOURCE Required Course Content
§§ Classroom Resource >
Inference This topic is intended to focus on the skill of selecting an appropriate inference
procedure now that students have a range of options. Students should be given
opportunities to practice when and how to apply all learning objectives relating to
inference for categorical data.

AP Statistics Course and Exam Description Course Framework V.1 | 192

Return to Table of Contents

© 2020 College Board

AP STATISTICS

UNIT 69

Inference for
Quantitative
Data: Slopes

2–5%

AP EXAM WEIGHTING

~7–8

CLASS PERIODS

AP Statistics Course and Exam Description Course Framework V.1 | 193

Return to Table of Contents

© 2020 College Board


Click to View FlipBook Version