# AP Stats Multiple Choice

a. census

b. sample survey

c. observational study

d. experiment

e simulation

I. The population of interest is all U.S. teens

II. 8% is a statistic and not the actual percentage of all U.S. teens who would rank this movie as their favorite.

III. This sampling design should provide a reasonably accurate estimate of the actual percentage of all U.S. teens who would rank this movie as their favorite.

a. I only

b. II only

c. III only

d. I and II

e. I, II, and III

a. Yes, because the students were chosen at random

b. Yes, because each student is equally likely to be chosen

c. Yes, because they could have chosen any 250 students from throughout the district

d. No, because we can’t guarantee that there are students from each school in the sample

e. No, because we can’t guarantee that there are students from each grade in the sample

I. Let 0,1 represent making the first shot and 2,3 represent making the second shot…..8,9 represent making the fifth shot. Generate five random numbers 0-9 ignoring repeats

II. Let 0,1,2 represent missing a shot and 3,4….,9 represent making a shot. Generate five random numbers 0-9 and count how many numbers are in 3-9

III. Let 0,1,2 represent missing a shot and 3,4….,9 represent making a shot. Generate five random numbers 0-9 and count how many numbers are in 3-9, ignoring repeats

a. I only

b. II only

c. III only

d. II and III

e I, II, and III

a. randomized experiment

b. survey

c. prospective study

d. retrospective study

e. blocked experiment

a. voluntary response bias

b. nonresponse bias

c. response bias

d. undercoverage

e. none of the above

a. sampling

b. blocking

c. blinding

d. control

e. randomization

I. The evaluators do not know which treatment group the participants are in

II. The participants do not know which treatment group they are in

III. No one knows which treatment any of the participants are getting

a. I only

b. II only

c. III only

d. I and II

e. I, II, and III

a. blocking

b. control

c. randomization

d. replication

e. all are required in an experimental design

a. is completely randomized

b. has three factors (shampoo type, gender, whether hair is dyed)

c. has two factors (shampoo type and whether hair is dyed) blocked by gender

d. has two factors (gender and whether hair is dyed) blocked by shampoo type

e. has one factor (shampoo type), blocked by gender and whether hair is dyed

a. near -1.0

b. near 0

c. near +0.7

d. exactly +1.0

e. somewhat greater than 1.0

a. equal to

b. 0.36 SD above

c. 0.60 SD above

d. 0.90 SD above

e. 1.5 SD above

a. is 0.35

b. is 0.50

c. is 0.70

d. is 0.90

e. cannot be determined

a. 0.70

b. 7

c. 70

d. 700

e. 7000

I. Families tend to spend about 30% of their incomes in restaurants

II. In general, the higher the income, the more families spend in restaurants

III. The line of best fit passes through 30% of the (income, restaurant$) data points

a. I only

b. II only

c. III only

d. II and III only

e. I, II, and III

I. This model can correctly predict the profit for 72% of companies

II. On average, about 72% of a company’s profit results from advertising

III. On average, companies spend about 72% of their profits on advertising

a. None

b. I only

c. II only

d. III only

e. I and III

I. The correlation between pulse rate and weight is 0.33

II. If you lose 6 pounds your pulse rate will slow down 2 beats per minute

III. a positive residual means a person’s pulse rate is higher than he model predicts

a. none

b. I only

c. II only

d. III only

e. II and III only

a. class (freshman, sophomore, junior, senior)

b. whether the student is in AP classes

c. grade point average

d. whether the student has taken the SAT

e. none of these

a. family income

b. heights of singers in a co-ed choir

c. weights of adult male elephants

d. scores on an easy test

e. all of these

a. boxplot

b. timeplot

c. dotplot

d. pie chart

e. histogram

a. 84.3

b. 85.0

c. 85.7

d. none of these

e. cannot be determined

I. You got 75% on the test

II. You can’t really tell what this means without knowing the standard deviation

III. You can’t really tell what this means unless the class distribution is nearly Normal

a. none of these

b. I only

c. II only

d. III only

e. II and III

a. is 2.5 points above average for the class

b. is 2.5 standard deviations above average for the class

c. has a standard deviation of 2.5

d. has a sore that is 2.5 times the average for the class

e none of the above

a. satisfies the area principle

b. shows the shape of the distribution better than a dotplot

c. preserves the individual date values

d. a stem-and-leaf plot is for quantitative data, while a dotplot shows categorical data

e. none of these

I. the mean

II. the median

III. the standard deviation

a. I only

b. III only

c. I and II

d. I and III

e. I, II, and III

a. the correlation between a football player’s weight and the position he plays is 0.54

b. the correlation between the amount of fertilizer used and the yield of beans is 0.42

c. the correlation between a car’s length and its fuel efficiency is 0.71 miles per gallon

d. there is a high correlation (1.09) between height of a corn stalk and its age in weeks

e. There is a correlation of 0.63 between gender and political party

a. possible models not explore by the researcher

b. variation in the data that is explained by the model

c. the difference between observed responses and values predicted by the model

d. data collected from individuals that is not consistent with the rest of the group

e. none of these

I. Removal of an influential point changes the regression line

II. Data points that are outliers in the horizontal direction are more likely to be influential than points that are outliers in the vertical directions

III. Influential points have large residuals

a. I only

b. I and II

c. I and III

d. II and III

e. I, II, and III

I. Random scatter in the residuals indicates a model with high predictive power

II. If two variables are very strongly associated, then the correlation between them will be near +1.0 or -1.0

III. The higher the correlation between two variables the more likely the association is based in cause and effect

a. none

b. I only

c. II only

d. I and II only

e. I, II, and III

a. linear

b. exponential

c. logarithmic

d. power

e. quadratic

a. linear

b. exponential

c. logarithmic

d. power

e. quadratic

a. the correlation between X and Y is near +1.0

b. the scatterplot of Y vs X also shows a linear pattern

c. the residuals plot for regression of Y on X shows a curved pattern

d. large values of X are associated with large values of Y

e. accurate predictions can be made for Y even if extrapolation is involved

a. 5′

b. 11′

c. 19′

d. 83′

e. 93′

a. leverage

b. a lurking variable

c. extrapolation

d. regression

e. an outlier

a. 9%

b. 11%

c. 34%

d. 57%

e. 91%

a. 9%

b. 11%

c. 34%

d. 57%

e. 91%

a. 0.395

b. 0.444

c. 0.506

d. 0.569

e. 0.722

a. less than 50%, since “tails” is due to come up

b. 50%

c. greater than 50%, since it appears that we are in a streak of “heads”

d. it cannot be determined

a. 18.0%

b. 43.5%

c. 56.5%

d. 82.0%

e. 91.8%

a. 6.5%

b. 12.6%

c. 49.5%

d. 50.5%

e. 93.5%

a. the number of people we survey until we find someone who has taken Statistics

b. the number of people we survey until we find two people who have taken statistics

c. the number of people in a class of 25 who have taken Statistics

d.the number of aces in a five-card Poker hand

e. the number of sodas students drink per day

a. the number of people we survey until we find someone who has taken Statistics

b. the number of people we survey until we find two people who have taken statistics

c. the number of people in a class of 25 who have taken Statistics

d.the number of aces in a five-card Poker hand

e. the number of sodas students drink per day

a. 5

b. 100

c. 200

d. 995

e. 2000

a. 2 minutes

b. 4.5 minutes

c. 6 minutes

d. 20 minutes

e. it cannot be determined