53-App-Answers.Rmd


# Answers to end-of-chapter exercises {#Answers}

This Appendix contains answers to *most* (not all) exercises.
Some are fully worked, and some are only brief solutions.

* Answers to Chap. \@ref(Intro) (Introduction): Sect. \@ref(IntroAnswer).
* Answers to Chap. \@ref(RQs) (Research questions): Sect. \@ref(RQsAnswer).
* Answers to Chap. \@ref(ResearchDesign) (Research design): Sect. \@ref(ResearchDesignAnswer).
* Answers to Chap. \@ref(Ethics) (Ethics): Sect. \@ref(EthicsAnswer).
* Answers to Chap. \@ref(Sampling) (Sampling): Sect. \@ref(SamplingAnswer).
* Answers to Chap. \@ref(FactorsInfluenceY) (Factors that influence the response variable): Sect. \@ref(FactorsInfluenceYAnswer).
* Answers to Chap. \@ref(DesignExperiment) (Designing experiments): Sect. \@ref(DesigningExperimentsAnswer).
* Answers to Chap. \@ref(DesignObservational) (Designing observational studies): Sect. \@ref(DesigningObservationalAnswer).
* Answers to Chap. \@ref(Interpretation) (Interpretation): Sect. \@ref(InterpretationAnswer).
* Answers to Chap. \@ref(CollectingDataProcedures) (Collecting data): Sect. \@ref(CollectionAnswer).
* Answers to Chap. \@ref(DescribingVars) (Describing variables): Sect. \@ref(DescribeAnswer).
* Answers to Chap. \@ref(Graphs) (Graphs): Sect. \@ref(GraphsAnswer).
* Answers to Chap. \@ref(NumericalQuant) (Numerical summaries for quantitative data): Sect. \@ref(NumericalQuantAnswer).
* Answers to Chap. \@ref(NumericalQual) (Numerical summaries for qualitative data): Sect. \@ref(NumericalQualAnswer).
* Answers to Chap. \@ref(MakingDecisions) (Making decisions): Sect. \@ref(MakingDecisionsAnswer)
* Answers to Chap. \@ref(Probability) (Probability): Sect. \@ref(ProbabilityAnswer).
* Answers to Chap. \@ref(SamplingDistributions) (Sampling distributions): Sect. \@ref(SamplingDistributionsAnswer).
* Answers to Chap. \@ref(SamplingVariation) (Sampling variation): Sect. \@ref(SamplingVariationExercisesAnswer).
* Answers to Chap. \@ref(CIOneProportion) (CIs for one proportion): Sect. \@ref(CIOneProportionAnswer).
* Answers to Chap. \@ref(AboutCIs) (More about forming CIs): Sect. \@ref(AboutCIsAnswer).
* Answers to Chap. \@ref(OneMeanConfInterval) (CIs for one mean): Sect. \@ref(OneMeanConfIntervalAnswer).
* Answers to Chap. \@ref(PairedCI) (CIs for paired data): Sect. \@ref(PairedCIExercisesAnswer).
* Answers to Chap. \@ref(CITwoMeans) (CIs for two independent means): Sect. \@ref(MeansIndSamplesAnswer).
* Answers to Chap. \@ref(OddsRatiosCI)  (CIs for odds ratios): Sect. \@ref(OddsRatiosCIAnswer).
* Answers to Chap. \@ref(EstimatingSampleSize) (Sample size estimation): Sect. \@ref(EstimatingSampleSizeAnswer)
* Answers to Chap. \@ref(TestOneProportion) (Tests for one proportion): Sect. \@ref(TestOneProportionAnswer).
* Answers to Chap. \@ref(TestOneMean) (Tests for one mean): Sect. \@ref(TestOneMeanAnswer).
* Answers to Chap. \@ref(MoreAboutTests) (More about hypothesis tests): Sect. \@ref(MoreAboutTestsAnswer).
* Answers to Chap. \@ref(TestPairedMeans) (Tests for paired mean): Sect. \@ref(TestPairedMeansAnswer).
* Answers to Chap. \@ref(TestTwoMeans) (Tests for two independent mean): Sect. \@ref(TestTwoMeansAnswer).
* Answers to Chap. \@ref(TestsOddsRatio) (Tests for odds ratios): Sect. \@ref(TestsOddsRatioAnswer).
* Answers to Chap. \@ref(TwoQuant) (Relationships between two quantitative variables): Sect. \@ref(TwoQuantAnswer).
* Answers to Chap. \@ref(Correlation)  (Correlation): Sect. \@ref(CorrelationAnswer).
* Answers to Chap. \@ref(Regression) (Regression): Sect. \@ref(RegressionAnswer).
* Answers to Chap. \@ref(Reading) (Reading research): Sect. \@ref(ReadAnswer).
* Answers to Chap. \@ref(WritingResearch) (Writing research): Sect. \@ref(WriteAnswer).


## Answers: Introduction {#IntroAnswer}

Answers to exercises in Sect. \@ref(IntroExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsTypeTourniquet)**:
The RQ requires numerical information to be answered, such as the *average time* taken to apply the tourniquets.
This RQ would be answered using a **quant**itative RQ.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsTypeMangroves)**:
The RQ does not require numerical information to be answered.
This RQ would be answered using a **qual**itative RQ.
:::


## Answers: RQs {#RQsAnswer}

Answers to exercises in Sect. \@ref(RQsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsBloodPressure)**:
**1.** P: Danish University students.
**2.** O: *Average* resting diastolic blood pressure.
**3.** C: between students who regularly drive to university and those who regularly ride their bicycles.
**4.** No intervention.
**5.** Relational.
**6.** *Conceptual*: What is meant by 'regularly'; 'university student' (on-campus and online? undergraduate *and* postgraduate? full-time and part-time?). *Operational*: how 'resting diastolic blood pressure' will be measured.
**7.** Resting diastolic blood pressure; whether they regularly drive to university or regularly ride their bicycles.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsNutrition)**:
**1.** Some elements are not well defined, but perhaps: P: Children aged under 3 in a Peruvian peri-urban community; O: proportion of children with diarrhoea; C: nutritional status; No intervention.
**2.** Hard to be sure; perhaps something like: 'In children aged under 3 in a Peruvian peri-urban community, is there a a relationship between diarrhoea status and nutritional status?'.
**3.** Relational.
**4.** How is 'diarrhoea status' measured? 
Likewise, how is 'nutritional status' measured? 
There are probably others.
**5.** Response: diarrhoea status; explanatory: nutritional status.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsOutcomeResponse)**:
Recall that the outcome is used to describe a *group* (the population), not the individuals.

**1.** The *percentage* of vehicles that crash.
**2.** The *average* jump height.
**3.** The *average* number of tomatoes per plant.
**4.** The *percentage* of people who own a car.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsWalkingSpeed)**:
**1.** What *individual* people are doing with their phones probably explains their walking speed: the *explanatory variable* is the way in which the mobile phone is being used.  
Notice that 'talking on the phone' and 'texting on the phone' are not *variables*.
They are particular values that the *variable* can take.
That is, 'what people are doing on their phone' is the variable, because it can vary: Sometimes people will be talking, sometimes texting, etc.
**2.** Waking speed probably depends on (or *responds to*) how *individual* people are using their phone: the *response variable* is the walking speed.
**3.** The *outcome* is how the response *variable* is summarised over a group of individuals.
The walking speeds from many individuals could be summarised numerically using the *average walking speed*, which would be the outcome.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsComparisonExplanatory)**:
Recall that the explanatory variable is what is actually measured on the individuals in the population.

**1.** The type of car fuel.
**2.** The type of coffee.
**3.** The dose of iron supplement.
**4.** The diet.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsComparisonVsPaired)**:
**1.** *Does* have a comparison (between a group of people in winter, and a different group of people in summer).
The outcome is 'the percentage of people wearing hats'.
**2.** *Does not* have a comparison. 
Two subsets of the population are not being compared: instead, each person is measured twice.
So an Outcome may be 'average *change* in cholesterol levels'.
**3.** *Does not* have a comparison. 
Two subsets of the population are not being compared: instead, each person gets two measurements. 
So an Outcome may be 'average *difference* between right- and left-leg balance times'.
**4.** *Does* have a comparison: The three subsets of the population are being compared: the three groups of tomato plants.
The Outcome is 'average yield' (which could be measured in kg/plant, tomatoes/plant, kg/hectare, etc).
:::


<!-- SEM 2 2018: Christopher WALTON; WILLIAM LUDOWYK; LOK RAJ BHATTA-->
<!-- ::: {.answer data-latex=""} -->
<!-- **Answer to Exercise \@ref(exr:ProjectRQ1)** -->

<!-- * What does 'is there a differentiation' actually mean? Do they simply mean 'is there a difference'? -->
<!-- * People who work 20 hours per week don't actually belong to either group. (This isn't necessarily a problem, but it is odd and looks like the students inadvertently left out people who work 20 hours per week.) -->

<!-- Perhaps try this: -->

<!-- > Among the USC students,  -->
<!-- > is the mean study time per week different -->
<!-- > for those who work more than 20 hours per week compared to those that work 20 hours or less? -->

<!-- ::: -->


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsAnimals)**:
The *unit of observation* is the *animal*; the animals, for example, are weighed.

The *unit of analysis* is the *pen*, as the food is allocated to the animals in the whole pen.
In addition, the animals in the same pen are not independent: they compete for the same space, food, resources, and would all have similar environments that they share.
:::


<!-- SEM 2 2018: Christopher WALTON; WILLIUAM LUDOWYK; LOK RAJ BHATTA--> 
::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ProjectRQ2)**:
The *population* surely is not 10 adults; that sounds like the sample.
It does not make clear how many fonts are being compared (or which fonts are being used).

Perhaps try this:
  
> Among Australian adults, is the average time taken to read a passage of text different when Arial font is used compared to Times Roman font?
:::


<!-- SEM 2 2018: Alex Bruce; Levi Crawley; Stuart Stevens--> 
::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ProjectRQ3)**:
The RQ is about comparing *groups*, so it should talk about the *average* lung capacity of males and females.
Perhaps:

> Of students that study at UniSC, Sippy Downs, do males have a larger *average* lung capacity than females? 
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsCommonCold)**:
Remember that the **outcome** describes a group ('the *average* cold duration') whereas the response **variable** is measured from an individual ('the cold duration for each individual').

The explanatory *variable* is measured on each individual: Whether they take a tablet or not.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RQsBlueGum)**:

The unit of observation is the leaf: the size of leaves can only be measured from individual leaves.
However, all the leaves on the same tree have 'lived their life together': They all get nutrition from the same trunk and root system, share genetics, and are located in the same place... so the unit of analysis is the tree.

The sample size is the number of units of analysis, so the sample size is 20.
:::


## Answers: Research designs {#ResearchDesignAnswer}

Answers to exercises in Sect. \@ref(ResearchDesignsExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignConcreteBeams)**:
The researchers could decide which beams go into Group A and into Group B.
Researchers could also allocate treatments to the groups: they could select what treatments is applied to each group of beams.
This is a *true experiment*.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignMatresses)**:
The researchers had no say in who was in hospital at the time: they could not allocate the patients to the two groups (overlay; mattress).
This is a *quasi-experiment*.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignPetsAndHealth)**:

1. *P*: Perhaps people in a suburb of the Sunshine Coast;  
   *O*: number of doctor's visits in the next six months;  
   *C*: between people owning a pet for those six months, and those who do not own a pet for those six months.
2. For an experiment, we would need to *intervene* to *give* subjects a pet, or *not* give them a pet.
3. For an observational study, we would *not* intervene: We would find the subjects who *already* owned a pet, or who did not *already* own a pet.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignDietsForWeightLoss)**:

1. *P*: A bit vague from this small extract: people of some kind;  
   *O*: the *average change* in body weight over two years;  
   *C*: Between the four diets;  
   *I*: The diets seems to be have been imposed.
2. Experimental: The diets have been *manipulated* and *imposed* by the researchers, with the intent of changing the outcome (the weight change).
3. Probably a true experiment.
4. The individuals: the diets are allocated to each individual.
5. The individuals: those from whom the weight change is taken.
6. The *change* in body weight over two years.
7. The type of diet.
:::


## Answers: Ethics {#EthicsAnswer}

Answers to exercises in Sect. \@ref(EthicsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:EthicsCougars)**:
Answers will vary.
:::   


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:EthicsSideEffects)**:
Answers will vary.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:EthicsLying)**:
Answers will vary.
:::


## Answers: Sampling {#SamplingAnswer}

Answers to exercises in Sect. \@ref(SamplingExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingBooks)**:
A tricky thing here is that some books are not physically in the library, as they have been borrowed.

1. Simple random sample: 
  A list of all the books held by the UniSC library is needed. 
  This may be possible for a librarian (it may not be, and would be really huge), it certainly is not possible for a student or non-library staff member.
   In principle though, number each book, and randomly select a sample from that list.
2. Stratified: 
  Use locations (Sippy Downs; Fraser Coast; Caboolture; Gympie; Southbank; SCHI) as strata, and then a random sample of all the book in each locations.
3. Cluster:
  Consider each set of shelves as a cluster, and randomly select some shelves, and determine the number of pages in each book on the selected shelves.
4. Convenience: 
  Finding books in the libraries within reach and easily accessible and on the shelves,
5. Multi-stage: Consider taking a random of campuses, then a random sample of the sets of shelves in the selected libraries, then selecting a random shelf from each one, then a small number of random book from each shelf.
6. Multi-stage perhaps.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingApartments)**:
**1.** Multi-stage.
**2.** It's a bit like stratified... but not quite.
**3.** Convenience.
**4.** A combination of multi-stage and convenience.
**5.** The second last is poor, and the last is a slight improvement.
The second is bit odd but is probably OK. 
The first might be the best.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingShoppingCentre)**:
**1.** Convenience, but by approaching every 10th person they are trying to make it a little more representative... but they can do a lot better.
**2.** Convenience, but by approaching every 5th person and going every day for a week they are trying to make it a little more representative... but they can do a lot better.
**3.** Self-selecting.
**4.** Convenience. At least the researcher is trying to get a more representative sample, by going every day for two weeks, and at different times and locations each week, and approaching someone every 15 minutes.
**5.** The fourth is the best, but it is still far from 'random'. 
**6.** None.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingSchools)**:
A bit like *cluster sampling* (randomly taking a small sample from many groups, and taking everyone (or everything) in those selected groups)... but not every person in the selected schools would respond (they would decide if they responded).

A *combination* of *cluster* and *voluntary response* sampling.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingMalaria)**:
True; False; True.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingForest)**:
:::


## Answers: Overview of internal validity {#FactorsInfluenceYAnswer}

Answers to exercises in Sect. \@ref(FactorsInfluenceYExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:FactorsInfluenceYStudy1)**:
Presumably *all* are extraneous variables, as all are possibly related to the response variable (incidence of depression): That is why the researchers obtained this information. 
*None* can be lurking variables, as the researchers measure or observe all of them.

To be a confounding variable, the extraneous variable should be related to *both* the response variable (incidence of depression) *and* the explanatory variable (diet quality).
As a result, all of the extraneous variables could potentially be confounding variables. 
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:FactorsInfluenceYStudy2)**:
*Response* variable: something like 'risk of developing a cancer of the digestive system'.
*Explanatory* variable: 'whether or not the participants drank green tea at least three times a week'.

*Lurking* variable: 'health consciousness of the participants', because the researchers don't seem to have measured or observed this.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:FactorsInfluenceYStudy3)**:
Older children would probably be more likely to be smokers, and would be larger and older in general: age would be a confounding variable.
Age is easy to record, and usually *is* recorded in these types of studies, so probably *not* a lurking variable.
(The age, height and gender of each child *is* recorded.)
:::


## Answers: Designing experimental studies {#DesigningExperimentsAnswer}

Answers to exercises in Sect. \@ref(DesigningExperimentsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DesignExpTrueFalse).**
Only Statement 5 is true.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DesignExpExtraneous).**
Lurking variables and confounding variables are special types of extraneous variables.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DesignExpWeightLoss).**
Statements 2, 3, 7 and 8 are true.
'Sex of the patient' and 'Initial weight of the patients' are probably possible confounding variables.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DesignExpParamedicsPills).**
**1.** A group receiving a pill that looks just like Treatment A and B, but has no effective ingredient.
**2.** Blinding the participants.
**3.** To ensure that participants did not change their behaviour because of the treatment they were receiving.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DesignExpFillBlanks).**
Random allocation; analysis; washout; control group; researchers; participants.

:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchTasteOfWater2)**:
Observer bias.
The researcher is directly contacting the subjects, so may unintentionally influence their responses.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignTasteOfWater3)**:
**1.** Randomly allocate the type of water to the subject (or the *order* in which the subjects taste-test each drink.)
**2.** The subjects do not know which type of water they are drinking.
**3.** The person providing the water and receiving the ratings does not know which type of water they are drinking.
**4.** Hard to find a control.
**5.** Any random sampling is good, if possible.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignSunscreen)**:
**1.** *Response*: The amount of sunscreen used; 
       *Explanatory*: The time spent on sunscreen application.
**2.** They were looking at potential confounding variables.
**3.** If the mean of both the response and explanatory variables was different for females and males, then the sex of the participant would be a *confounding* variable, and this would need to be factored into the analysis of the data.
**4.** The participants are blinded to what is happening in the study.
:::


## Answers: Designing observational studies {#DesigningObservationalAnswer}

Answers to exercises in Sect. \@ref(DesigningObservationalExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignObsTrueFalse).**
Only the second statement is true.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignObsPollen).**
Probably in case the size of the hive is a confounding variable.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignTasteOfWater)**:
**1.** Since this is an *observational study*, we *cannot* allocate students to receive bottled or tap water (because then the study would be an *experimental study*).
       In an *experiment* we could randomly allocate students to receive *either* bottled or tap water and have them rate the taste (or even randomly allocate students to receive bottled or tap water *first*, then swap to the other type of water, and each student would then provide *two* ratings).
**2.** The students would not be aware of which water they would be drinking.
**3.** Neither the students nor the researchers who give the students the water would know which type of water the students are drinking.
**4.** We can't really set up a control here. 
**5.** Any of the random sampling methods are possible, and are preferred. 
       In practice, perhaps use a convenience sample, but try to get a sample as representative as possible (Sect. \@ref(Representative-samples)).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignHawthorneEffect)**:
No. 
People can know they are being observed.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ResearchDesignSleep)**:
The descriptions indicates that patients probably knew they were involved, so the Hawthorne effect should be considered when interpreting the results.
:::


## Answers: Interpretation {#InterpretationAnswer}

Answers to exercises in Sect. \@ref(InterpretationExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ValidityLighting).**

External.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ValidityFarmManagement).**

External; internal.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:InterpretationExerciseTrueFalse}.**
True; false; false; false.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:InterpretationExerciseValidities}.**
Ecological; external; internal; confounding; sampling.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:InterpretationExerciseExternalValidity)**:
Population: 'UniSC students on-campus'.
External validity refers to whether the results apply to other members of *this* population, not to people outside this population (such as members of the general public).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:InterpretationExerciseParachutes)**:
**1.** P: Aircraft passengers aged 18 and over. 
       O: Unclear; something about 'composite of death or major traumatic injury'. 
       C: Between wearing a parachute and wearing a backpack. 
       I: Yes: Having participants wear the parachute or backpack.
**2.** Experimental: The researchers decide if the participants use a parachute or backpack.
**3.** *Explanatory*: 'whether or not a parachute is worn'. 
       *Response*: harder to understand; is it 'whether or not the participant dies or sustains a major injury'?
**4.** These results won't apply in the real world; not ecologically valid. 
       In the real world, parachutes are used at high altitude, for example.
**5.** The study is not very useful!
**6.** Speaking loosely: 
       That jumping from a small plane that is on the ground, parachutes are equally effective as backpacks in keeping people safe.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:InterpretationSleep)**:
Because the sample is not a random sample, the researchers are (rightly) noting that the results may not *generalise* to all hospitals.
Because the data was only collected at night, perhaps the study is not *ecologically valid*.
:::


## Answers: Data collection {#CollectionAnswer}

Answers to exercises in Sect. \@ref(CollectionExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CollectSurveyQuestions1)**: 
People aged 18 do not have a category.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CollectSurveyQuestions2)**: 
The second.
The first is *leading*: Should *concerned* cat owners...
The third is *leading* also (Do you *agree*...)
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SunscreenQuestions)**.
The first question seems fine. 
However, as always with the types of response options, what is meant by "seldom" (for instance) may vary from person to person.
The second question has options that overlap: Both 1h and 2h are in two categories.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:KidsEnvironmentQuestions).**
1. The first two are *closed*, with one option to be selected. The other is *open*.
1. I'm not sure primary school children could recall the answers to the first two questions accurately...
:::


## Answers: Describing variables {#DescribeAnswer}

Answers to exercises in Sect. \@ref(DescribeExercises).


::: {.answer data-latex=""}
**Answer to Exercise\@ref(exr:DescribeClassifyingGraphsLimeTrees)**:
*Foliage biomass*: quantitative continuous.
*Tree diameter* (in cm): quantitative continuous.
*Age of the tree* (in years): quantitative continuous.
*Origin of the tree*: Qualitative nominal.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DescribeClassifyingVariables1)**:
**1.** Systolic blood pressure: quantitative continuous.
**2.** Program of enrolment: qualitative nominal.
**3.** Academic grade: qualitative ordinal.
**4.** Number of times people visited the doctor last year: quantitative discrete.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DescribeClassifyingVariables2)**:
**1.** Age: qualitative ordinal.
**2.** Gender: qualitative nominal.
**3.** Location: qualitative nominal.
**4.** Social media use: qualitative ordinal.
**5.** BMI: quantitative continuous.
**6.** Total sitting time, in minutes per day: quantitative continuous.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DescribeClassifyingOrthoses)**:
*Gender*: Qualitative nominal.
*Age*: Quantitative continuous.
*Height*: Quantitative continuous.
*Weight*: Quantitative continuous.
*GMFCS*: Qualitative ordinal.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DescribeClassifyingNitrogenInSoil)**:
*Fertilizer dose*: Quantitative continuous.
*Soil nitrogen*: Quantitative continuous.
*Fertilizer source*: Qualitative nominal.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DescribeClassifyingKangaroos)**:
*Response of kangaroos*: Qualitative ordinal. (Or perhaps nominal?)
*Height of drone*: 'Height' is quantitative, 
but with just four values used it would probably be treated as qualitative ordinal.
*Mob size*s: Quantitative discrete.
*Sex*: Qualitative nominal.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DescribeSelfieDeaths)**:
*Location* is the only variable (something observed from the *individuals*).
The *number of people* and the *percentage of people* who died at each location is a *summary* of the data collected from the individuals.
'Location' is a *nominal*, *qualitative* variable, with seven *levels*.
:::

 
## Answers: Graphs {#GraphsAnswer}

Answers to exercises in Sect. \@ref(GraphsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphsCars)**:
None of them are *bad* graphs.
I'd prefer the bar chart, but any are OK.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphsLimeTrees)**:
A graph of the individual variables is always useful as a starting point: so a *bar chart* for the origin, and a *histogram* for the others.

But *relationships* are the main focus.
Relationships between foliage biomass and tree origin: *boxplot*.
Relationships between foliage biomass and the other variables: *scatterplot*.
On the scatterplot, the different origins of the trees could be *encoded* 
by using different colours or plotting symbols.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphsOrthoses)**:
`Gender` and `GFMCS`: both qualitative; the others are quantitative.
Relationships between two *quantitative* variables: use a scatterplot.
Relationships between two *qualitative* variables: (say) a side-by-side bar chart.
With one of each: boxplot.
See Fig. \@ref(fig:CPalsyPlots) for some examples.
:::

```{r CPalsyPlots, fig.align="center", fig.cap="Some graphs from the cerebral palsy data", fig.width=9, fig.height=4, out.width='100%'}
NumP <- 15
Gender <- rep("M", 
              NumP)

Gender[ c(11, 14, 15)] <- "F"
Age <- c(9, 7, 7, 12, 11, 5, 6, 8, 8, 6, 7, 11, 7, 9, 8)
Ht <- c(136, 106, 129, 152, 146, 113, 112, 112, 138, 116, 113, 141, 136, 128, 133)
Wt <- c(34.5, 16.2, 21.1, 40.4, 39.3, 18.1, 16.7, 19.1, 28.6, 19.3, 17.6, 34.9, 34.5, 21.9, 23.0)
GMFCS <- c(1, 2, 1, 1, 2, 1, 2, 1, 1, 1, 1, 1, 1, 1, 1)

CPalsy <- data.frame(Gender = Gender,  
                     Age = Age,
                     Height = Ht,
                     Weight = Wt,
                     GMFCS = GMFCS)

par( mfrow = c(1, 3))
plot( Height ~ Age, 
      data = CPalsy,
      las = 1,
      xlab = "Age (in years)",
      ylab = "Ht (in cm)")

boxplot( Height ~ Gender, 
         data = CPalsy,
      las = 1,
      xlab = "Gender",
      ylab = "Ht (in cm)")

barplot( xtabs( ~ GMFCS + Gender, 
                data = CPalsy),
      las = 1,
      beside = FALSE,
      xlab = "Gender",
      ylab = "GMFCS")
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphNitrogenInSoil)**:
*Fertilizer* (quantitative): histogram (response variable).
*Soil nitrogen* (quantitative): Histogram (explanatory variable).
*Source* (qualitative nominal): Bar chart (explanatory variable).
*Relationships*: Between fertilizer dose and soil nitrogen: scatterplot. 
*Source* could be encoded using different *coloured* points.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphSurveyData)**:
A bar chart (or dot chart).
A pie chart would *not* be appropriate, as respondents could select more than one option.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphsAIS)**:
In general, female basketball players are taller than female netballer players (the first, second and third quartiles are all greater for basketball players).
For the second and third quartiles, the differences look quite substantial.
The minimum heights are similar.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphNoisyMiners)**:
What do the different plotting symbols mean?
The labels on the axes are not helpful.
The vertical axis goes up to 35, but could easily stop at 20.
See Fig. \@ref(fig:MinersGood).
:::

```{r MinersGood, fig.cap="The number of noisy miners and the number of eucalyptus trees", fig.align="center", fig.width=5, fig.height=3.5}
data(NMiner)

plot(Minerab ~ Eucs,
     data = NMiner,
     ylim = c(0, 20),
     las = 1,
     xlab = "Number of eucalypt trees",
     ylab = "Number of noisy miners",
     main = "No. of eucalypts and the no. of noisy miners in\n 2 ha. buloke woodland patches in Vic.",
     pch = 19)
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphHorseshoeCrabs)**:
The graph is *inappropriate*!
Both variables are qualitative, but the graph is a scatterplot (used for two *quantitative* variables).
What does that plot even tell you?

A stacked or side-by-side bar chart should be used (Fig. \@ref(fig:CrabsPlotBar)).
:::


```{r CrabsPlotBar, fig.cap="The colour of female horseshoe crabs and the condition of their spines. There are no missing values.", fig.align="center", fig.width=5, fig.height=4}
data(HCrabs)

CrabsMat <- xtabs(~ Spine + Col, 
                   data = HCrabs)

barplot( CrabsMat,
         beside = FALSE,
         xlab = "Condition of the spine",
         ylab = "Carapace colour",
         ylim = c(0, 100),
         col = grey( seq(0, 1, length = 3)),
         las = 1)
legend("topleft", 
       fill = grey( seq(0, 1, length = 3)), 
       legend = rownames(CrabsMat))
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphsMADRS)**:
**1.** Response variable: *Change* in MADRS (quantitative continuous).
**2.** Explanatory variable: treatment group (qualitative nominal with three levels).
**3.** Response variable: Histogram. Explanatory: bar chart. Relationship: boxplot.
:::


:::{.answer data-latex=""}
**Answer to Exercise \@ref(exr:GreenBuilding)**:
See Fig. \@ref(fig:OfficeTempsBoxplot).

```{r OfficeTempsBoxplot, fig.cap="Boxplot of the office temperatures", fig.align="center", fig.width=5, fig.height=4}
 Green.stats <- matrix( 
   c(16.4    ,15.9    ,20.1,
     22.8    ,23.8    ,24.6,
     24.4    ,25.5    ,26.1,
     25.5   ,26.9    ,27.2,
     27.4    ,31.0    ,30.3), 
   ncol = 3, 
   byrow = TRUE)

Green.bxp <- list( stats = Green.stats,
                   n = rep(100, 3), # MADE UP
                   conf = NA,
                   out = rep(numeric(0), 3),
                   group = rep(numeric(0), 3),
                   names = c("Office A", 
                             "Office B", 
                             "Office C"))
bxp( Green.bxp,
     las = 1,
     xlab = "Office",
     ylab = "Temperature (in degrees C)",
     ylim = c(15, 35)
     )
```
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:GraphsSkewBar)**:
Variable is the 'Sport' (*qualitative*).
The bars can be ordered any way.
*Skewness makes no sense*: It only makes sense to talk about skewness for *quantitative* variables.
:::


## Answers: Numerical summaries for quantitative data {#NumericalQuantAnswer}

Answers to exercises in Sect. \@ref(NumericalQuantExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQuantRides)**:
**1.** 3.7.
**2.** 3.5.
**3.** 1.888562.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQuantFulmars)**:
**1.** 643g.
**2.** 12.884g.
**3.** 637.5g.
**4.** We do not know.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQuantNHANES)**:
Probably the median as slightly skewed right, with some outliers.
*Both* the mean and median *can* be quoted...
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQuantSOI)**:
**1.** Sample mean: 0.467.
**2.** Sample median: 3.35.
**3.** Range: 29.6 (from -19.8 to 9.8).
**4.** Sample standard deviation: 10.40263.
(SOI has no units of measurement.)
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQuantMatchingHistogramsAndBoxplots)**:
**A**: II (median; IQR).
**B**: I (mean; standard deviation).
**C**: III (median; IQR).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQuantConstructionWorkerProductivity)**:
See Fig \@ref(fig:PanelsBoxplot).
Worker 2 is faster in general (more panels installed per minute), including one  *fast* outlier.
Workers 1 and 3 have similar medians, but Worker 3 is more consistent (smaller IQR).
:::


```{r PanelsBoxplot, fig.align="center", fig.cap="The boxplot for the productivity data", fig.width=5, fig.height=4}
Panels <- array(NA, 
                dim = c(7, 3))

rownames(Panels) <- c("Mean", 
                      "Minimum", 
                      "1st quartile", 
                      "Median", 
                      "3rd quartile", 
                      "Maximum", 
                      "Range")
colnames(Panels) <- c("Worker 1", 
                      "Worker 2", 
                      "Worker 3")

Panels[, 1] <- c(1.24, 0.59, 0.88, 1.35, 1.49, 1.88, 1.28)
Panels[, 2] <- c(1.73, 1.13, 1.51, 1.70, 1.91, 3.00, 1.87)
Panels[, 3] <- c(1.36, 0.86, 1.16, 1.38, 1.58, 2.17, 1.31)

Panels.bxp <- list(
  stats = as.matrix(Panels[2:6, ]),
  n = rep(10, 3), # No idea; a guess
  conf = matrix( NA, ncol = 3),
  out = rep( numeric(0), 3),
  group = rep(numeric(0), 3),
  names = c("Worker 1", 
            "Worker 2", 
            "Worker 3")
)


bxp( Panels.bxp, 
     las = 1,
     ylab = "Productivity (panels/minute)")
```


::: {.answer data-latex=""}
**Answer to exercise \@ref(exr:NumericalQuantMatchingMicroPlastics)**:
**1.** $\bar{x} = 3.09$.
**2.** median: 2.0.
**3.** $s = 2.77$.
**4.** IQR: 4.
:::


## Answers: Numerical summaries for qualitative data {#NumericalQualAnswer}

Answers to exercises in Sect. \@ref(NumericalQualExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:NumericalQualHangovers)**:
**1.** *Vomited*: 0.50 had beer then wine; 0.50 had wine only.  
*Didn't vomit*: 0.738 had beer then wine, 0.262 had wine only. 
They tell us the proportion that drank various things, among those who did and didn't vomit.
**2.** *Beer then wine*: 8.8% vomited and 91.2% didn't; *Wine only*: 21.4% vomited and 78.6% didn't. 
They tell us the percentage that vomited, for each drinking type.
**3.** $(6 + 6)/(6 + 6 + 62 + 22) = 0.125$.
**4.** $6/22 = 0.2727$.
**5.** $6/62 = 0.096774$.
**6.** $0.27272/0.096774 = 2.82$.
**7.** $0.096774/0.27272 = 0.354$.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:Wallabies)**:
**1.** $91/(91 + 188) = 0.32616$.
**2.** $188/91 = 2.0659$, or about 2.07.
**3.** $22/13 = 1.6923$, or about 1.69.
**4.** $13/(13 + 22) = 0.37142$, or about 37.1%.
**5.** $2.0659/1.6923 = 1.22$.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OddsAugustRainfall)**:
**1.** 21/114, or about 18.4%.
**2.** 14/54, or about 25.9%.
**3.** 7/60, or about 11.7%.
**4.** 21/93, or about 0.226.
**5.** 14/40, or 0.35.
**6.** 7/53, or about 0.132.
**7.** 0.35/0.132, or about 2.7.
**8.** The odds of no August rainfall in Emerald is 2.7 times higher in months with non-positive SOI.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OddsBackpacks)**:
**1.** 45.9%.
**2.** 61.4%.
**3.** 0.848.
**4.** 1.59.
**5.** 1.15.
**6.** 0.533.
**7.** The odds of reporting back pain from carrying school bags, comparing boys to girls. 
:::


## Answers: Making decisions {#MakingDecisionsAnswer}

Answers to exercises in Sect. \@ref(MakingDecisionsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MakingDecisionsDice)**:
**1.** Yes! Seems likely there is a problem (we can't be certain).
**2.** **Assuming** the die was fair, I would not **expect** to get a **6** ten times in a row; sounds highly unusual.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MakingDecisionsClaim)**:
**1.** That the population mean is 12 inches, as claimed. 
       We have no evidence to refute this claim.
**2.** *First*: the *population* mean diameter is $\mu = 12$ inches; the *sample* mean is not 12 inches due to sampling variation. 
       *Second*: the *population* mean diameter isn't 12 inches, reflected in the sample.
**3.** 11.48 is 0.52 inches from the target of 12; seems unlikely that the sample mean would be that far from 12 inches through sampling variation alone.
**4.** $\bar{x} = 11.25$ inches is further from $\mu = 12$ that $\bar{x} = 11.48$: claim probably not supported. 
**5.** Smaller sample sizes: sample mean would vary more (in general, larger samples give more precise estimates).
:::


## Answers: Probability {#ProbabilityAnswer}

Answers to exercises in Sect. \@ref(ProbabilityExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ProbabilityThreeEvents).**
False; true; it depends on what the first card was.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ProbabilityDie2).**
$3/6$; $5$; yes: what happens on the die won't change what happens on the coin.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ProbabilityCards)**:
**1.** *Probability* draw a King: $4/52 = 0.07692$.
**2.** *Odds* draw a King: $4/48 = 0.08333$.
**3.** *Probability* draw a picture card: $16/52 = 0.3077$.
**4.** *Odds* draw a picture card: $16/(52 - 16) = 0.4444$.
**5.** *Not* independent. This is like Example \@ref(exm:IndependenceCards). 
**6.** *Are* independent. What happens on the die does not change what happens with the cards.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:AndrewsQuote)**:
Only a 50--50 chance if the events were equally likely... they clearly are not.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:FirstNationStudents)**:
NA.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:IndependentEvents)**:
**1.** *Not independent* events: If it rains, less likely to walk to work than if it doesn't rain.
**2.** *Not independent* events: A smoker is far more likely to suffer from lung cancer than a non-smoker.
**3.** *Dependent* events: If it rains, I won;t water my garden. 
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SensitivitySpecifity)**:
**1.** Expect $100\times 0.99 = 99$ people to return a positive test result.
**2.** Expect $900\times (1 - 0.98) = 18$ people to return a positive test result.
**3.** $18 + 99 = 117$ positive test results.
**4.** $99/117$, or $84$% of having the disease.

A positive test result may or may not mean the person has the disease.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CoinOutcomes)**:
The reasoning assumes that the three outcomes (HH, TT, HT) are *equally likely*, which is not true.
For example, consider tossing a 20-cent coin (shown in lower-case, normal font) and a 1-dollar coin (shown in capitals, **bold** font).
The *four* outcomes are:
h**H**,
h**T**,
t**H**
t**T**.
:::


## Answers: Sampling distributions {#SamplingDistributionsAnswer}

Answers to exercises in 
Sect. \@ref(SamplingDistributionsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsTrees)**:
**1.** $z = (8 - 8.8)/2.7 = 0.2962$, or $z = -0.30$. 
       From tables, the probability is 0.3821, or about 38.2%.
**2.** $z = 0.07$; probability is $1 - 0.52379 = 0.4721$, or about 47.2%.
**3.** The $z$-scores are $z_1 = -0.67$ and $z_2 = 0.44$; the probability is $0.6700 - 0.2514 = 0.4186$, or about 41.9%. (Draw a diagram!)
**4.** Using the tables 'backwards': $z$-score is about 1.04; corresponding tree diameter is $x = 8.8 + (1.04\times 2.7) = 11.608$, or about 11.6 inches. 
       About 15% of tress will have diameters larger than about 11.6 inches.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsGestationLength)**:
**1.** $z = (39 - 40)/1.64 = -0.6097561$, or $z=-0.61$. 
       Using tables: probability *less than* this value of $z$ is 0.2709, so the answer is $1 - 0.2709 =0.7291$, or about 72.9%.
**2.** $z = (37 - 40)/1.64 = -1.83$; probability is 0.0336, about 3.4%.
**3.** The two $z$-scores: $z_1 = -4.878$ and $z_2 = -1.83$. 
       Drawing a diagram, probability is $0.0336 - 0 = 0.0336$, or about 3.4%.
**4.** The $z$-score: 1.64 (or 1.65). 
       Gestation length: $x = 40 + (1.64 \times 1.64) = 42.7$ (same answer to one decimal place using $z = 1.65$). 
       5% of gestation lengths *longer* than about 42.7 weeks. 
**5.** $z$-score is -1.64 (or -1.65). 
       Gestation length: $x = 40 + (-1.64 \times 1.64) = 37.3$ (same answer to one decimal place using $z = -1.65$). 
       5% of gestation lengths *shorter* than about 37.3 weeks.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsIQs)**:
$z$-score: about $z = 2.05$.
Corresponding IQ: $x = 100 + (2.05\times 15) = 130.75$.
An IQ greater than about 130 is required to join Mensa.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsIQsMilitary)**: 
An IQ score lower than about 80.8 leads to a rejection by the US military.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsIQForwards)**: 
**1:** C; **2:** A; **3:** B; **4:** D.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsIQBackwards)**: 
**1:** A; **2:** C; **3:** B; **4:** D.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SamplingDistributionsChargingEVs)**:
Be *very* careful: work with the *number of minutes from the mean, or from 5:30pm*.
The standard deviation already is in decimal, but converted to minutes, standard deviation is 120 minutes, plus $0.28\times 60 = 16.8$ minutes. 
The standard deviation is 136.8 minutes.

**1.** 9pm is 3 hours and 30 minutes from 5:30pm: 210 minutes. 
       $z$-score: $z = (210 - 0)/136.8 = 1.54$; probability: $1 - 0.9382 = 0.0618$, or about 6.2%.
**2.** $z = (5 - 5.5)/2.28 = -0.22$; probability: 0.4129$, or about 41.3%.
**3.** $z$-scores are $z_1 = -0.22$ and $z_2 = 0.22$; probability: $0.5871 - 0.4129 = 0.1742$, or about 17.4%.
**4.** $z$-score is $0.52$; time is $x = 0 + (0.52\times 136.8) = 71.136$ minutes after 5pm; about one hour and 11 minutes after 5:30pm, or 6:41pm.
**5.** $z$-score: $-1.04$; time is $x = 0 + (-1.04\times 136.8) = -141.272$, or 141.272 minutes *before* 5pm; about two hours and 21 minutes before 5:30pm, or 3:09pm.
:::


## Answers: Sampling variation {#SamplingVariationExercisesAnswer}

Answers to exercises in Sect. \@ref(SamplingVariationExercises)


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:StdErrorOrStdDeviation)**:
**1.** Standard deviation.
**2.** Standard error. More specifically, the standard error of the mean.
**3.** Standard deviation.
**4.** Standard error. More specifically, the standard error of the proportion.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:HasStandardError)**:
**1.** No: Population proportions don't vary from sample to sample.
**2.** Yes: varies from sample to sample.
**3.** Yes: varies from sample to sample.
**4.** Yes: varies from sample to sample.
**5.** No: Population odds don't vary from sample to sample.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:QuoteStdError)**:
The *standard error of the mean* is used to describe how much the sample mean is likely to vary from sample to sample.
Alternatively, it describes how precisely the sample mean is estimating the (unknown) population mean.
:::


## Answers: CIs for one proportion {#CIOneProportionAnswer}


Answers to exercises in Sect. \@ref(CIOneProportionExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneProportionSaltIntake)**:
$\hat{p} = 2182/6882 =  0.317059$ and $n=6882$.
So:

\[
  \text{s.e.}(\hat{p}) = \sqrt{ \frac{0.317059 \times(1 - 0.317059)}{6882} } =  0.005609244.
\]
The CI is $0.317059 \pm (2\times 0.005609244)$, or $0.317059\pm 0.01121849$.
Rounding sensibly: $0.317\pm 0.011$ (notice we keep lots of decimal places in the *working*, but round the final answer).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneProportionSnacking)**:
$\hat{p} = 8/154 = 0.05194805$; $\text{s.e.}(\hat{p}) = 0.0017833$; approximate 95% CI is $0.05194 \pm (2 \times 0.0017833)$, or $0.0519\pm 0.0358$, equivalent to 0.016 to 0.088.
The CI is statistically valid.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneProportionHiccups)**:
Use $\hat{p} = 708/864 = 0.8194444$ and $n = 864$.
Standard error: $\text{s.e.}(\hat{p}) = 0.01308604$; approximate 95% CI is $0.8194444 \pm (2\times 0.01308604)$.

The CI is statistically valid.
:::

```{r, fig.cap="The sampling distribution of the proportion of males in samples of 864 people with hiccups", fig.width=6, fig.height=3.5, fig.align="center"}
z <- seq(-3.5, 3.5, 
         length = 100)
mn <-  0.8194444
sdev <- 0.01308604
y <- dnorm(z)

z.special <- seq(-3, 3, 
                 by = 1)
labs <- round(mn + (z.special * sdev), 3)

plot( y ~ z,
      axes = FALSE,
      type = "l",
      lwd = 2,
      ylab = "",
      xlab = "Proportion of males")
axis(side = 1, 
     at = seq(-3, 3, by = 1),
     cex = 0.9,
     labels = labs)
abline(v = 0, 
       lwd = 2, 
       col = "grey")
abline(v = c(-1, 1), 
       lwd = 1, 
       col = "grey")
abline(v = c(-2, 2), 
       lwd = 1, 
       col = "grey")
abline(v = c(-3, 3), 
       lwd = 1, 
       col = "grey")
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneProportionAustSmokers)**:
**1.** Approximately $n = 1/(0.05^2)  = 400$.
**2.** Approximately $n = 1/(0.025^2) = 1600$.
**3.** To *halve* the width of the interval, *four* times as many people are needed.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CITurbines)**:
After 3000 hours: $\hat{p} = 0.2143$; $\text{s.e.}(\hat{p}) = 0.06331$.
The CI is from 0.088 to 0.341.
The statistical validity conditions are satisfied.

After 400 hours: $\hat{p} = 0$; $\text{s.e.}(\hat{p}) = 0$.
The CI is from 0 to 0: clearly silly (implies no sampling variation).
This is because the statistical validity conditions are **not** satisfied.
:::


## Answers: More about formings CIs {#AboutCIsAnswer}

Answers to exercises in Sect. \@ref(AboutCIsExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:AboutCIsInterpretation)**:
The conclusion states that the interval is one in which they are reasonably sure (i.e., 95% sure) that the *sample* proportion will lie.
But the researcher knows *exactly* what the sample proportion is: it is $\hat{p} = 0.314$.

CIs give intervals in which we are reasonably certain that the *population* value is within, because the population proportion is unknown. 
(In addition, the CI is a 68% anyway, not a 95% CI as claimed.)
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:AboutCIsInterpretation2)**:
The CI is not about *individual* trees; it is about a population parameter.
Presumably, it should read something like 'This CI means that between 22.3% and 40.5% of trees are infected with apple scab'.
:::


## Answers: CIs for one mean {#OneMeanConfIntervalAnswer}

Answers to exercises in Sect. \@ref(OneMeanConfIntervalExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneMeanCIBears).**

**1.** The *parameter* of interest is the population mean weight of an American black bear, $\mu$.
**2.** $\text{s.e.}(\bar{x}) = \frac{s}{\sqrt{n}} = \frac{51.1}{\sqrt{185}} = 3.756947$.
**3.** From $84.9 - (2 \times 3.756947) = 77.38611$ to $84.9 + (2 \times 3.756947) = 92.41389$; from 77.4 to 92.4&nbsp;kg.
**4.** We are approximately 95% confidence that the *population mean* weight of male American black bears is between 77.4 and 92.4&nbsp;kg.
**5.** Statistically valid, since the sample size is much larger than 25.
We *do not* have to assume that the distribution of the weights of male black bears has a normal distribution in the population, as the sample size is (much) larger than 25.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneMeanLungCapacityInChildren)**:
Standard error: $\text{s.e.} = s/\sqrt{n} = 0.43/\sqrt{45} = 0.06410062$ (keeping lots of decimal places in the working).
*Approximate* 95% CI: $2.85 \pm(2\times 0.06410062)$, or $2.85\pm 0.1282012$, or from 2.72 litres to 2.98 litres.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneMeanEnvironmentalPollution)**:
Standard error: $\text{s.e.} = s/\sqrt{n} = 7571.74/\sqrt{58} = 994.2182$ (keeping lots of decimal places in the working).
*Approximate* 95% CI is: 4967.984 micrograms to 8944.86 micrograms.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneMeanToothbrushing)**:
Approximate 95% CI for the mean brushing time: 29.9 seconds to 36.1 seconds.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOneMeanBloodLoss)**:
**1.** Standard error: $\text{s.e.}(\bar{x}) = 651.1/\sqrt{199} = 46.15526$; approximate 95% CI: 754.1ml to 938.7ml.
**2.** They don't seem very good at estimating (the article reports that the guesses ranged from 50ml to 3000ml).
**3.** The sample size is much larger than 25; the CI should be statistically valid.
**4.** Using the margin-of-error as 50, and $s = 651.1$:

\[
  \left( \frac{2\times 651.1}{50}\right)^2 = 678.2899.
\]
We would need about 679 participants (remembering to round *up*).

**5.** Using margin-of-error as 25, and $s = 651.1$:

\[
  \left( \frac{2\times 651.1}{25}\right)^2 = 2713.16.
\]
Need about 2714 participants (remembering to round *up*).
**6.** To *halve* the width of the margin of error, *four* times as many subjects are needed.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneMeanCINHANESInterpret)**:
*None* of these interpretations are acceptable.
**1.** CIs are not about how individual observations vary; they are about how a *statistic* varies (in this case, the sample mean).
       In addition, CIs are about populations and not samples.
**2.** CIs are not about how individual observations vary; they are about how a *statistic* varies (in this case, the sample mean.
**3.** This doesn't make sense: *samples* can't vary between two values. 
       Sample *statistics* vary.
       In addition, CIs are about populations, not samples.
**4.** This doesn't make sense:  *populations* can't vary between two values. 
       Even population *parameters* don't vary.
**5.** The population parameter does not vary. 
       It is a fixed (but unknown) value to be estimated.
       (If the value of the population mean was constantly changing, it would be very hard to estimate...)
**6.** We know *exactly* what the sample mean is ($\bar{x}=1.3649$mmol/L: We don't need a interval for the sample mean.
**7.** We know *exactly* what the sample mean is ($\bar{x}=1.3649$mmol/L: We don't need a interval for the sample mean.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneMeanStdError)**:
*Neither* is correct.
To learn about the variation in *individuals* trees, use the *standard deviation* rather than the standard error.
The standard error tells us about the population mean diameter, not about individual trees.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ChewingTime)**:
$\text{s.e.}(\bar{x}) = 5.36768$, so the approximate 95% CI is $61.3\pm (2 \times 5.36768)$, which is $61.3\pm 10.74$s (or 50.56s to 72.04s).
Since $n = 30$, which is greater than 25, the CI is likely to be statistically valid.
:::


## Answers: CIs for paired data {#PairedCIExercisesAnswer}

Answers to exercises in Sect. \@ref(PairedCIExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:PairedCIExercisesBrocolli)**:
Mean of the *differences*: 5.2; standard error 3.6.
*Approximate* 95% CI: $5.2 \pm (2\times 3.06)$, or $5.2\pm 6.12$, from -0.92 to 11.22.
Mean taste preference between preferring it better *with* dip by up to 11.2mm on the 100mm visual analogue scale, or preferring it *without* dip by a little (up to -0.9mm on the 100mm visual analogue scale.
(Understanding how the *differences* are defined is needed to understand where this came from.)

A useful summary might be like Table \@ref(tab:BrocolliSummaryTable).
:::


```{r BrocolliSummaryTable}
BrocolliSummaryTable <- array( NA, 
                               dim = c(3,3))
rownames(BrocolliSummaryTable) <- c("Raw", 
                                    "With dip", 
                                    "Differences")
colnames(BrocolliSummaryTable) <- c("Mean", 
                                    "Standard deviation", 
                                    "Standard error")
BrocolliSummaryTable[1, ] <- c(56.0, 
                               26.6, 
                               round(26.6/sqrt(101), 2))
BrocolliSummaryTable[2, ] <- c(61.2, 
                               28.7, 
                               round(28.7/sqrt(101), 2))
BrocolliSummaryTable[3, ] <- c(5.2, 
                               "", 
                               3.06)

tab.cap.br <- "A  numerical summary for the brocilli data"

if( knitr::is_latex_output() ) {
  kable( BrocolliSummaryTable,
         format="latex",
         booktabs = TRUE,
         digits = c(1, 1, 2),
         longtable = FALSE,
         col.names = colnames(BrocolliSummaryTable),
         caption = tab.cap.br) %>%
    column_spec(1, bold = TRUE)  %>%
    row_spec(0, bold = TRUE) %>%
    kable_styling(font_size = 9)
}
if( knitr::is_html_output() ) {
  out <- kable( BrocolliSummaryTable,
         format = "html",
         booktabs = TRUE,
         digits = c(1, 1, 2),
         longtable = FALSE,
         col.names = colnames(BrocolliSummaryTable),
         caption = tab.cap.br) %>%
    column_spec(1, bold = TRUE) 
}
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:PairedCIExercisesCaptopril)**:
**1.** Computing differences as `Before` minus the `After` measurements seems sensible: the average blood pressure *decrease*, the purpose of the drug.
**2.** The differences (when defined as*reductions*): 9, 4, 21, 3, 20, 31, 17, 26, and so on.
**3.** Mean difference: 18.933; standard deviation: 9.027; standard error: $9.027/\sqrt{15} = 2.331$. 
       *Approximate* 95% CI: $14.271$ to $23.56$&nbsp;mm&nbsp;Hg.
**4.** Exact 95% CI: 13.934 to 23.93&nbsp;mm&nbsp;Hg from output.
**5.** The first uses *approximate* multipliers; the second uses exact multipliers.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:PairedCIExercisesSmokeExercise)**:

**1.** *Approximate* 95% CI for *reduction*: $0.66 \pm(2\times 0.37)$, or -0.08 to 1.4: average could be an *increase* of up to 0.08 to a *reduction* of up to 1.4 on the given scale for women.
**2.** Sample size is not larger than 25, but close: probably reasonably statistically valid.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:StressSurgeryCI).**

$\bar{d} = 7.70$ fmol/mol and the *standard error for this increase* is 3.10 fmol/mol.
The approximate 95% CI is $7.70 \pm (2\times 3.10)\quad\text{or:}\quad 7.70 \pm 6.20$, or from $1.50$ to $13.90$ fmol/m.

Using jamovi, the *exact* 95% CI is from 1.18 to 14.22 fmol/mol, similar to the *approximate* CI computed manually.
The sample size is $n = 19$, just less than 25, so the results may not be statistically valid (but shouldn't be too bad).
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MeanDiffCOVIDCI).**


**1.** The *differences* are defined as *during* minus *before*, so that *positive* difference therefore means the *during* value is higher.
**2.**
1. $\text{s.e.}(\bar{d}) = {s_d}/{n} = {51.30}/{\sqrt{213}} = 3.515018$.
1. From -4.35 to 9.71 minutes.
   The lower value is *negative*; this does **not** mean a negative amount of PA (which would make no sense).
   The CI is for the population mean *increase* during compared to before lockdown, so, a negative value means that the *before* values are higher than the *during* values on average. The CI means: 'In the population, the mean difference between the amount of vigorous PA by Spanish health students is between 4.35 minutes more *during* lockdown, and 9.71 minutes more *before* lockdown.'
:::


## Answers: CIs for two means {#MeansIndSamplesAnswer}

Answers to exercises in Sect. \@ref(MeansIndSamplesExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MeansIndSamplesExercisesWhalesCI).**


**1.** Define the *difference* as the mean length of female gray whales *minus* the mean length of male gray whales. (Either direction is fine, as long as you are consistent.)
**2.** 1. The parameter: $\mu_F - \mu_M$ (where $F$ and $M$ represent female and male gray whales respectively).
The estimate: $\bar{x}_F - \bar{x}_M = 4.66 - 4.60 = 0.06$.
**3.** See \@ref(fig:WhalesErrorbar).
**4.** From $-0.12$&nbsp;m to $0.25$&nbsp;m. 'The population mean difference between the length of female and male gray whales at birth has a 95% chance of being between $0.12$&nbsp;m longer for male whales to $0.25$&nbsp;m longer for female whales.'
**5.** Both sample sizes are larger than 25, so the CI is likely to be statistically valid.
:::


```{r WhalesErrorbar, fig.cap="Error bar chart comparing the lengths of female and male gray whales", fig.align="center", fig.width=4.5, fig.height=3.5}

ciF <- c(4.66 - 2 * (0.379 / sqrt(26)),
         4.66 + 2 * (0.379 / sqrt(26)) )
ciM <- c(4.60 - 2 * (0.305 / sqrt(30)),
         4.60 + 2 * (0.305 / sqrt(30)) )


plot( range( c( ciF, ciM) ) ~ c(1, 2), 
      type = "n",
      xlim = c(0.5, 2.5),
      ylim = c(4.45, 4.85),
      pch = 19,
      axes = FALSE,
      main = "Error bars chart:\n length of gray whales at birth",
      xlab = "Sex", 
      ylab = "Length (in metres)",
      sub = "(Error bars are 95% confidence intervals)",
      las = 1)
axis(side = 1, 
     at = 1:2, 
     labels = c("Females", "Males"), 
     las = 1) 
axis(side = 2, 
     las = 1)
box()

points( c(4.66, 4.60) ~ c(1:2), 
        pch = 19,
        col = plot.colour,
        cex = 1.5)

arrows(1:2, c(ciF[1], ciM[1]), 
       1:2, c(ciF[2], ciM[2]), 
       length = 0.05, 
       angle = 90, 
       code = 3)
```    


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MeansIndSamplesExercisesNHANES)**:
**1.** Table \@ref(tab:NHANESSummaryTable).
**2.** From SPSS, *exact* 95% CI: 0.05438 to 0.11501 (bottom row).
       Exact 95% CI for the difference between the mean direct HDL cholesterol concentrations: 0.05438 to 0.11501&nbsp;mm&nbsp;Hg higher for non-smokers.
:::


```{r NHANESSummaryTable}
NHANESSummaryTable <- array( NA, 
                             dim = c(3,4))
rownames(NHANESSummaryTable) <- c("Non-smokers", 
                                  "Smokers", 
                                  "Differences")
colnames(NHANESSummaryTable) <- c("Mean", 
                                  "Standard deviation", 
                                  "Standard error",
                                  "Sample size")
NHANESSummaryTable[1, ] <- c(1.3924, 
                             0.42792, 
                             0.01048, 
                             1668)
NHANESSummaryTable[2, ] <- c(1.3077, 
                             0.42353, 
                             0.01137, 
                             1388)
NHANESSummaryTable[3, ] <- c(0.08470, 
                             "", 
                             0.01546, 
                             "")


if( knitr::is_latex_output() ) {
  kable(NHANESSummaryTable, 
        format = "latex",
        longtable = FALSE,
#        escape = FALSE,
        col.names = colnames(NHANESSummaryTable),
        booktabs = TRUE,
        #linesep = c("\\addlinespace",  # Otherwise adds a space after five lines... 
        align = c("r", "r", "r", "r"),
        caption = "A summary table for the NHANES data; statistics in mm Hg") %>%
    column_spec(1, bold = TRUE) %>%
    kable_styling(font_size = 9)
}
if( knitr::is_html_output() ) {
  kable(NHANESSummaryTable, 
        format = "html",
        longtable = FALSE,
        col.names = colnames(NHANESSummaryTable),
        booktabs = TRUE,
        align = c("r", "r", "r", "r"),
      caption = "A summary table for the NHANES data; statistics in mm Hg") %>%
    kable_styling(full_width = FALSE) %>%
    column_spec(1, bold = TRUE) 
   
}
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MeansIndSamplesExercisesEchinacea)**:
**1.** Placebo group: $3.62/\sqrt{176} = 0.2728678$ days; echinacea group $3.31/\sqrt{183} = 0.2446822$ days.
**2.** $0.53 \pm (2\times 0.367)$, or -0.204 to 1.264 days.
**3.** Placebo *minus* echinacea: the difference between the means show how much *longer* symptoms last with placebo, compared to echinacea.
**4.** $6.34 \pm (2\times 0.2446822)$, or 5.85 to  6.83 days.
**5.** Sample sizes are large, so the CIs statistically valid.

The difference between the means is an average of 0.53 days; about half a day (quicker on echinacea). 
Probably not that important when a cold last for almost seven days.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MeansIndSamplesExercisesCarpalTunnelSyndrome)**:
**1.** Exercise group: $1.4/\sqrt{10} = 0.4427189$; splinting group: $1.1/\sqrt{10} = 0.3478505$.
**2.** Splinting *minus* exercise: the difference are how much greater the pain is with splinting.
**3.** $0.3\pm (2\times 0.563$, or from -0.826 to 1.426: 0.826 greater pain with exercise to 1.426 greater pain with splinting.
**4.** $1.1\pm (2\times 0.3478505)$: from 0.404 to 1.796.
**5.** Sample sizes are small; CIs may not be statistically valid, roughly correct only.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:DecelerationCI)**
**1.** The parameter could be written as $\mu_{\text{After}} - \mu_{\text{Before}}$, the *increase* is the deceleration.
**2.** The approximate CI is $-0.00162$ to $0.00562$ m/s.
       That is, the difference between the mean decelerations is likely to be somewhere between $-0.0016$ m/s (i.e, a mean acceleration of 00016 m/s) to $0.0056$ m/s.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:FacePlantCI)**

**1.** Either direction is fine; we use $\mu_Y - \mu_O$: the amount by which younger women can lean further forward than older women.
**2.** Quite a small dataset... a dot plot? (We would probably produce an error bar chart, too, but that does not display the *data*.)
**3.** See Table \@ref(tab:FacePlantSummary).
**4.** Approximate CI: . Exact CI: From the *second row* of the output: $9.10$ to $19.90$.
   These are different, since the sample sizes are not large.
**5.** Both sample sizes are less than 25, so probably not statistically valid.
**6.** Based on the sample, a 95% CI  for the difference between population mean one-step fall-recovery angle for healthy women is between $9.1$ and $19.9$ degrees *greater* for younger women than for older women (two independent samples).
:::


```{r FacePlantSummary}
data(ForwardFall)

FF.Summary <- array( NA, dim=c(3, 4))

FF.Summary[1:2, 1] <- aggregate(ForwardFall$LeanAngle ~ ForwardFall$Group, 
                                FUN = "mean")[, 2]
FF.Summary[1:2, 2] <- aggregate(ForwardFall$LeanAngle ~ ForwardFall$Group, 
                                FUN = "sd")[, 2]
FF.Summary[1:2, 3] <- aggregate(ForwardFall$LeanAngle ~ ForwardFall$Group, 
                                FUN = function(x){sd(x) / sqrt(length(x))})[, 2]
FF.Summary[1:2, 4] <- aggregate(ForwardFall$LeanAngle ~ ForwardFall$Group, 
                                FUN = "length")[, 2]

FF.Summary[3, 1] <- FF.Summary[1, 1] - FF.Summary[2, 1] 
FF.Summary[3, 3] <- sqrt( FF.Summary[1, 2]^2/FF.Summary[1, 4] + 
                          FF.Summary[2, 2]^2/FF.Summary[2, 4] )
row.names(FF.Summary) <- c("Younger women", 
                           "Older women",
                           "Difference")

FF.Summary2 <- FF.Summary
FF.Summary2[, 1] <- round(FF.Summary[, 1], 1) 
FF.Summary2[, 2] <- round(FF.Summary[, 2], 2) 
FF.Summary2[, 3] <- round(FF.Summary[, 3], 2) 
FF.Summary2[, 4] <- round(FF.Summary[, 4], 0)
FF.Summary2[3, 2] <- ""
FF.Summary2[3, 4] <- ""
  
  
if( knitr::is_latex_output() ) {
  kable(FF.Summary2,
      format = "latex",
      booktabs = TRUE,
      align = "r",
      col.names = c("Mean", 
                    "Standard deviation", 
                    "Standard error", 
                    "Sample size"),
      caption = "Numerical summary for the face-plant data (means, standard deviations and standard errors are in degrees)"
      ) %>%
    row_spec(0, bold = TRUE) %>%
    kable_styling(font_size = 10)
}
if( knitr::is_html_output() ) {
  kable(FF.Summary2,
      format = "html",
      booktabs = TRUE,
      align = "r",
      col.names = c("Mean", 
                    "Standard deviation", 
                    "Standard error", 
                    "Sample size"),
      caption = "Numerical summary for the face-plant data (means, standard deviations and standard errors are in degrees)"
      )
}
```


## Answers: CIs for odds ratios {#OddsRatiosCIAnswer}

Answers to exercises in Sect. \@ref(OddsRatiosCIExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OddsRatiosCIScarHeights)**:
**1.** $99/62 = 1.596774$; about 1.60.
**2.** $216/115 = 1.878261$; about 1.88.
**3.** $1.596774/1.878261 = 0.850$, as in the output.
**4.** A few ways; for example: For every 100 men with a smooth scar, about 85 women with a smooth scar.
**5.** (Graph not shown, but use a stacked or side-by-side bar chart.)
**6.** Table \@ref(tab:ScarsNumericalSummary).
**7.** Exact 95% CI for the OR, from the output: 0.576 to 1.255.
**8.** If study repeated study many times (with the same numbers of men and women), about 95% of the CIs would contain population OR.
       In practice: population OR is probably between 0.576 and 1.255.
:::


```{r ScarsNumericalSummary}
tab.cap <- "The odds and percentage of having smooth scars, for women and men"
ScarTable <- array(dim = c(3, 3) )
ScarTable[1, ] <- c("1.60", 
                    "61.5%", 
                    "161")
ScarTable[2, ] <- c("1.88", 
                    "65.3%", 
                    "331")
ScarTable[3, ] <- c("0.850", 
                    "", 
                    "")

colnames(ScarTable) <- c("Odds with smooth scars",
                         "Percentage with smooth scars", 
                         "Sample size")
rownames(ScarTable) <- c("Women:", 
                         "Men:",
                         "Odds ratio:")
if( knitr::is_latex_output() ) {
  kable(ScarTable, 
        format = "latex",
        longtable = FALSE,
#        escape = FALSE,
        col.names = colnames(ScarTable),
        booktabs = TRUE,
      #linesep = c("\\addlinespace",  # Otherwise adds a space after five lines... 
        align = c("r", "r", "r"),
        caption = tab.cap) %>%
    column_spec(1, bold = TRUE) %>%
    kable_styling(font_size = 9)
} 
if( knitr::is_html_output() ) {
  out <- kable(ScarTable, 
        format = "html",
        longtable = FALSE,
        col.names = colnames(ScarTable),
        booktabs = TRUE,
        align = c("r", "r", "r"),
        caption = tab.cap) %>%
    kable_styling(full_width = FALSE) %>%
    column_spec(1, bold = TRUE) 
}
```


::: {.answer data-latex=""}
**Answer to \@ref(exr:OddsRatiosCITurbines)**:
**1.** Table \@ref(tab:TurbinesORData).
**2.** Table \@ref(tab:TurbinesORSummary).
**3.** OR: Odds of a 1800-hr turbine getting a fissure is 0.389 times the odds of a 3000-hr turbine getting a fissure.
**4.** CI from 0.133 to 1.14. 
       Plausible values for the population OR that may have produced the sample OR likely to be between these values.
:::


::: {.answer data-latex=""}
**Answer to \@ref(exr:OddsRatiosCIEarInf)**:
The output can be interpreted in one of two ways (Sect. \@ref(CIOddsRatiosSummaries)):
  
* Odds are the odds of swimming at the beach; OR compares these odds between those without an ear infection, to those with an ear infection.
* Odds are the odds of *not* having an ear infection; OR compares these odds for beach swimmers to non-beach swimmers.

:::

<!-- library(epitools) -->
<!-- oddsratio.wald(TurbineCITable) -->

```{r TurbinesORData}
tab.cap <- "The number of fissures for two sets of turbines, run for different numbers of hours"
TurbineCITable <- array(dim = c(3, 3) )

TurbineCITable[1, ] <- c(7, 66, 73)
TurbineCITable[2, ] <- c(9, 33, 42)
TurbineCITable[3, ] <- c(16, 106, 122)

colnames(TurbineCITable) <- c("Fissures", 
                              "No fissures",
                              "Total")
rownames(TurbineCITable) <- c("About 1800 hours", 
                              "About 3000 hours",
                              "Total")

if( knitr::is_latex_output() ) {
  kable(TurbineCITable, 
        format = "latex",
        longtable = FALSE,
        col.names = colnames(TurbineCITable),
        booktabs = TRUE,
        #linesep = c("\\addlinespace",  # Otherwise adds a space after five lines... 
        align = c("r", "r", "r"),
        caption = tab.cap) %>%
    column_spec(1, bold = TRUE) %>%
    kable_styling(font_size = 9)
}
if( knitr::is_html_output() ) {
  out <- kable(TurbineCITable, 
               format = "html",
               longtable = FALSE,
               col.names = colnames(TurbineCITable),
               booktabs = TRUE,
               align = c("r", "r", "r"),
               caption = tab.cap) %>%
    kable_styling(full_width = FALSE) %>%
    column_spec(1, bold = TRUE) 
}
```


```{r TurbinesORSummary}
tab.cap <- "The numerical summary for the fissures data"
TurbineCISumTable <- array(dim = c(3, 3) )

TurbineCISumTable[1, ] <- c("0.1061", 
                            "9.59%", 
                            "73")
TurbineCISumTable[2, ] <- c("0.2727", 
                            "21.43%", 
                            "42")
TurbineCISumTable[3, ] <- c("0.389", 
                            "", 
                            "")

colnames(TurbineCISumTable) <- c("Odds with fissures",
                                 "Percentage with fissures", 
                                 "Sample size")
rownames(TurbineCISumTable) <- c("About 1800 hours", 
                                 "About 3000 hours",
                                 "Odds ratio")

if( knitr::is_latex_output() ) {
  kable(TurbineCISumTable, 
        format = "latex",
        longtable = FALSE,
        col.names = colnames(TurbineCISumTable),
        booktabs = TRUE,
        #linesep = c("\\addlinespace",  # Otherwise adds a space after five lines... 
        align = c("r", "r", "r"),
        caption = tab.cap) %>%
    column_spec(1, bold = TRUE) %>%
    kable_styling(font_size = 9)
}
if( knitr::is_html_output() ) {
  out <- kable(TurbineCISumTable, 
               format = "html",
               longtable = FALSE,
               col.names = colnames(TurbineCISumTable),
               booktabs = TRUE,
               align. = c("r", "r", "r"),
               caption = tab.cap) %>%
    kable_styling(full_width = FALSE) %>%
    column_spec(1, bold = TRUE) 
}
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OddsRatiosCIAugustRainfall)**:
Odds of no rainfall (non-positive SOI): $14/40 = 0.35$.
Odds of no rainfall (negative SOI): $7/53 = 0.1320755$.

Required OR is $0.35/0.1320755 = 2.65$, as in output.
95% CI from 0.979 to 7.174.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CIOddsRatioSunglasses)**:
The 95% CI is from 0.151 to 0.408.
The OR of not wearing a hat, comparing males to females (males*less* likely to be *not* wearing a hat; rewording, males *more* likely to be wearing a hat).
:::


## Answers: Estimating sample sizes {#EstimatingSampleSizeAnswer}

Answers to exercises in Sect. \@ref(EstimatingSampleSizeExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SampleSizeProp1)**:

**1.** Use $\displaystyle n = \frac{1}{0.04^2} = 625$; 625 needed.
**2.** Use $\displaystyle n = \frac{1}{0.02^2} = 2500$; 2500 needed. That is *four times as many* as when the margin-of-error was 0.04.
**3.** Use $\displaystyle n = \frac{1}{0.01^2} = 10,00$; 10,000 needed. That is *sixteen times as many* as when the margin-of-error was 0.04.
**4.** To get an estimate *half* as wide, we need *four* times as many units of analysis in the sample.
**5.** To get an estimate *a quarter* as wide, how many *sixteen* more units of analysis in the sample.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SampleSizePropEating)**:

**1.** Use $\displaystyle n = \left(\frac{1}{0.01} \right)^2 = 10000$; 10,000 needed.
**2.** Use $\displaystyle n = \left(\frac{1}{0.02} \right)^2 = 2500$; 2,500 needed.
**3.** Use $\displaystyle n = \left(\frac{1}{0.10} \right)^2 = 100$; 1000 needed.
**4.** To have the same give-or-take, but to be *more* confident that this interval contained the value of $\mu$, we would need a *larger* sample.
**5.** Probably is expensive (both time *and* money), so 10,000 and 2500 are both probably unrealistic without lots of funding.

:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:SampleSizeMeanLungCapacity)**:

Using $s = 0.43$:

**1.** Use $\displaystyle n = \left(\frac{2\times 0.43}{0.02} \right)^2 = 1849$; 1849 needed.
**2.** Use $\displaystyle n = \left(\frac{2\times 0.43}{0.05} \right)^2 = 295.85$; 296 needed.
**3.** Use $\displaystyle n = \left(\frac{2\times 0.43}{0.10} \right)^2 = 73.96$; 74 needed.
**4.** Probably is expensive (both time *and* money), so 74 probably the most realistic.
:::


## Answers: Tests for one proportion {#TestOneProportionAnswer}

Answers to exercises in Sect. \@ref(OneProportionTestExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesPlacebos).**

**1.** One-in-five; or 0.2.
**2.** $H_0$: $p = 0.2$; $H_1$: $p > 0.2$.
**3.** One-tailed.
**4.** $\text{s.e.}(\hat{p}) = 0.044444$; so draw a normal distribution centred on $0.2$, with a standard deviation of $\text{s.e.}(\hat{p}) = 0.0444$.
**5.** $\hat{p} = 50/81 = 0.6173$; $z = (0.6173 - 0.2)/0.04444 = 9.39$.
   This is *very* large, so the $P$-value will be *very small*:
   Very strong evidence to support the alternative hypothesis that people do better-than-guessing at identifying the placebo.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesEPL).**
$H_0$: $p = 0.5$ and $H_1$: $p > 0.5$ (which is *one*-tailed).
$\hat{p} = 91/(91 + 67) = 0.5759$ and $n = 91 + 67 = 158$.
$\text{s.e.}(\hat{p}) = 0.03977786$, so $z = 1.91$.
Using the normal-distribution tables, the one-tailed $P$-value is 0.0281; there is slight evidence in the sample of a home-ground advantage in the population.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesPedalMachines).**
$H_0$: $p = 0.0602$ and $H_1$: $p < 0.602$ (which is *one*-tailed).
$\hat{p} = 295/589 = 0.5008489$ and $n = 589$, so $\text{s.e.}(\hat{p}) = 0.0201689$, so $z = -5.015$.
This is a very large $z$-score, so the $P$-value will be very small.
Strong evidence exists that the proportion of females using the machines was lower than the proportion of females in the university population.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesCasinos).**
$H_0$: $p = 0.255$ and $H_1$: $p \ne 0.255$.
$\hat{p} = 88/357 = 0.2464986$ and $n = 357$, so $\text{s.e.}(\hat{p}) = 0.02306822$, so $z = -0.368533$.
This is very small; the $P$-value will be large.
No evidence exists that the proportion of smokers is different for casino-goers compared to the general U.S. population.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionBreadfruitPasta).**
$H_0$: $p = 0.5$ and $H_1$: $p > 0.5$ (one-tailed).
$\hat{p} = 0.8028169$ and $n = 71$, so $\text{s.e.}(\hat{p}) = 0.05933908$, so $z = 5.10$.
This is very large; the $P$-value will be very small.
Strong evidence exists that the majority of people like breadfruit pasta (for the population that the sample represents anyway).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesCTS).**
$H_0$: $p = 0.15$ and $H_1$: $p \ne 0.15$.
$\hat{p} = 33/516 = 0.06395349$ and $n = 516$, so $\text{s.e.}(\hat{p}) = 0.01571919$, so $z = -5.473$.
This is very large; the $P$-value will be very small.
Strong evidence exists that the proportion of people with CTS with a PL tendon absent is different for people with CTS.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesBorers).**
$H_0$: $p = 1/16 = 0.625$; $H_1$: $p \ne 0.0625$.
With $n = 172$, find $\hat{p} = 24/172 = 0.1395349$ and $\text{s.e.}(\hat{p}) = 0.01845701$ and so $z = 26.3$, wich is *massive*; the $P$-value will be incredibly small.
Very strong evidence that the population proportion is not $1/16$, and the borers were *not* resistant.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:OneProportionTestExercisesLEDlights).**
$H_0$: $p = 0.5$ and $H_1$: $p \ne 0.5$.
Use $n = 52$ and so $\hat{p} = 29/52 = 0.5576923$ and $\text{s.e.}(\hat{p}) = 0.06933752$, giving $z = -0.8320503$, which is not large at all so the $P$-value will be 'large'.
There is no evidence to suggest that choice in non-random.
:::


## Answers: Tests for one mean {#TestOneMeanAnswer}

Answers to exercises in Sect. \@ref(TestOneMeanAnswerExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanExercisesSleep).**

$H_0$: $\mu = 10$ (or $\mu \ge 10$) and $H_1$: $\mu < 10$.

**Females**: $\text{s.e.}(\bar{x}) = 0.05924742$; $t = -25.32$ which is *huge*, so $P$-value is extremely small.


**Males**: $\text{s.e.}(\bar{x}) = 0.0700152$; $t = -19.42$ which is *huge*, so $P$-value is extremely small.

For both boys and girls, very strong evidence exists (girls: $t = -25.32$; boys: $t = -19.42$; $P < 0.001$ for both) that the mean sleep time one weekend is less than ten hours (girls: mean $8.64$ hours; boys: $8.50$ hours).
:::


<!-- ::: {.answer data-latex=""} -->
<!-- **Answer to Exercise \@ref(exr:TestOneMeanExercisesEnergyIntake).** -->

<!-- **1.** $H_0$: $\mu=7725$; $H_1$: $\mu\ne7725$ (two tailed). -->
<!-- **2.** $\bar{x} = 6753.64$ and $\text{s.e.}(\bar{x}) = s/\sqrt{n} = 1142.123/\sqrt{11} = 344.363$. -->
<!-- **3.** $t = (6753.64 - 7725)/344.363 = -2.821$, as in output.  -->
<!--        This 'large'; expect small $P$-value; software confirms this: two-tailed $P=0.018$. -->
<!-- **4.** Moderate evidence ($P = 0.018$) that the *mean* energy intake is not meeting the recommended daily energy intake (mean: 6753.6kJ; std. dev.: 1142.1kJ).   -->
<!-- ::: -->


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanExercisesToothbrushing)**:
$H_0$: $\mu = 120$ and $H_1$: $\mu \ne 120$ (two-tailed), where $\mu$ is the mean time *in seconds*.
Standard error: $\text{s.e.}(\bar{x}) = 23.8/\sqrt{85} = 2.581472$. 
$t$-score: $(60.3 - 120)/2.581472 = -23.13$, which is *huge*; $P$-value will be *really* small.

*Very* strong evidence ($P<0.001$) that children do not spend 2 minutes (on average) brushing their teeth (mean: 60.3s; std. dev.: 23.8s).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanExercisesAutomatedVehicles)**:
$H_0$: $\mu=50$ and $H_1$: $\mu>50$ (one-tailed), where $\mu$ is the mean mental demand.
Standard error: $\text{s.e.}(\bar{x}) = 22.05/\sqrt{22} = 4.701076$. 
$t$-score: $(84 - 50)/4.701076 = 7.23$, which is very large; $P$-value will be very small.

*Very* strong evidence ($P < 0.001$) that the mean mental demand is *greater* than 50.
(Notice we say *greater* than, because of the RQ and the alternative hypothesis.)
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanExercisesQualityOfLife)**:
*Physical:* $t = -1.28$; *Mental:* $t = 1.80$. 
The $P$-values both larger than 5%.

No evidence that the mean score for patients is different than the general population score.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanExercisesCherryRipes)**:
$H_0$: $\mu = 12$ and $H_1$: $\mu \ne 12$ (two-tailed), where $\mu$ is the mean weight in grams.
Standard error: $\text{s.e.}(\bar{x}) = 0.60652/\sqrt{43} = 0.09249343$. 
$t$-score: $(14.9577 - 12)/0.09249343 = 31.98$, which is **huge**; $P$-value will be very small.

*Very* strong evidence ($P < 0.001$) that the mean weight of a Fun Size Cherry Ripe bar is not 12 grams (mean: 14.9577; std. dev.: 0.067g), and they may be larger.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanBloodLoss)**:
$H_0$: $\mu = 1000$ and $H_1$: $\mu \ne 1000$, where $\mu$ is the population mean guess of the spill volume.
Standard error: 46.15526.
$t$-score: $(846.4 - 1000)/46.15526 = -3.33$, which is very large (and negative), so the $P$-value will be very small.

Very strong evidence that the mean guess of blood volume is not 1000\,ml, the actual value.
The sample is much larger than 25: the test is statistically valid.
:::


```{r}
QC <- structure(list(High1 = c(61.63, 63.11, 66.88, 62.56, 66.12, 65.34, 
64.83, 64.22, 65.54, 65.33, 67.83, 65.97, 65.01, 64.51, 64.3, 
64.38, 64.75, 63.12, 62.93, 63.59, 67.04, 64.38, 63.41, 64.19, 
64.86, 63.7, 63.14, 66, 63.43, 64.25, 63.67, 64.78, 58.02, 63.46, 
65.2, 63.74), Mid1 = c(18.36, 18.77, 18.98, 17.97, 19.69, 19.63, 
19.5, 19.39, 19.87, 19.66, 20.41, 18.37, 19.09, 18.98, 19.22, 
18.99, 19.58, 19.7, 19, 19.11, 19.97, 19.52, 19.39, 17.2, 19.75, 
19.35, 19.95, 19.38, 19.33, 19, 19.05, 19.24, 19.29, 19.29, 19.18, 
19.35), High2 = c(62.64, 64.36, 66.06, 65.39, 66.85, 65.56, 66.6, 
66.9, 65.5, 65.92, 65.51, 65.03, 64.22, 64.19, 64.95, 64.95, 
64.45, 64.77, 65.58, 64.87, 63.52, 65.02, 66.05, 63.82, 64.58, 
65.09, 64.86, 64.85, 66.75, 65.37, 64.29, 65.5, 62.85, 64, 63.72, 
64.3), Mid2 = c(19.12, 19.07, 19.58, 19.35, 19.65, 19.13, 19.55, 
19.85, 19.45, 19.87, 19.73, 19.6, 20.76, 18.98, 19.93, 19.15, 
19.23, 19.2, 19.52, 19.38, 19.08, 18.89, 19.57, 19.35, 19.13, 
19.44, 19.27, 19.58, 19.71, 19.64, 19.14, 19.28, 19.68, 18.27, 
18.75, 19.57)), .Names = c("High1", "Mid1", "High2", "Mid2"), class = "data.frame", row.names = c(NA, 
-36L))

DataAndTargets <- array(dim = c(3, 4))
DataAndTargets[1, ]<- c(64.31, 
                        19.24, 
                        64.97, 
                        19.40)
DataAndTargets[2, ]<- c(1.700, 
                        0.588, 
                        1.029, 
                        0.413)
DataAndTargets[3, ]<- c(64.22, 
                        19.01, 
                        65.05, 
                        19.45)

n <- length(QC$High1)

rownames(DataAndTargets) <- c("Mean of data", 
                              "Std. dev. of data", 
                              "Pre-determined target")
colnames(DataAndTargets) <- c("High (Instrument 1)", 
                              "Mid (Instrument 1)", 
                              "High (Instrument 2)", 
                              "Mid (Instrument 2)")

ts <- ( DataAndTargets[1,] - DataAndTargets[3,] ) / ( DataAndTargets[2,]/sqrt(n))
```

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOneMeanQualityControl)**:
Hypotheses have the form $H_0$: $\mu = \text{pre-determined target}$, and $H_1$: $\mu \ne \text{pre-determined target}$.
$t$-scores: $t_1 = 0.318$, $t_2 = 2.347$, $t_3 = -0.466$, $t_4 = -0.726$.
$P$-values will be large, except for second test.

No evidence that the instruments are dodgy, except perhaps for the first instrument for mid-level LH concentrations.
Should be statistically valid.

While assessing the means is useful, how *variable* the measurements are is also useful (but beyond us).
:::


## Answers: More about hypothesis tests {#MoreAboutTestsAnswer}

Answers to exercises in Sect. \@ref(MoreAboutTestsExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MoreAboutExercisesApproximatingPValues)**:
Using the [68--95--99.7 rule](#def:EmpiricalRule):
**1.** Very small; certainly less than 0.003 (99.7% between -3 and 3).
**2.** Very small; bit bigger than 0.003  (99.7% between -3 and 3).
**3.** Large-ish: Between 0.32 (68% between 1 and -1) and 5% (95% between -1 and 1), but closer to 0.32.
**4.** Bit smaller than 0.32 (68% between -1 and 1).
**5.** *Very* large! Almost 0.50.
**6.** *Very* small; *much* smaller than 0.003.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MoreAboutExercisesApproximatingPValuesOneTailed)**:
The answers are *half* the values given to Exercise \@ref(exr:MoreAboutExercisesApproximatingPValues).
Using the [68--95--99.7 rule](#def:EmpiricalRule):
**1.** Very small; certainly less than 0.0015 (99.7% between -3 and 3).
**2.** Very small; bit bigger than 0.0015  (99.7% between -3 and 3).
**3.** Large-ish: Between 0.16 (68% between 1 and -1) and 2.5% (95% between -1 and 1), but closer to 0.16.
**4.** Bit smaller than 0.16 (68% between -1 and 1).
**5.** *Very* large! Almost 0.25.
**6.** *Very* small; *much* smaller than 0.0015.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MoreAboutTestsInterpretingResults)**:
Using the 68--95--99.7 rule, the $P$-value is *just under* 0.05, and hence 'small'.
If the $t$-score was 0.0499, the $P$-value would be *just larger* than 0.05 and hence 'big'.

The difference between 0.0501 and 0.0499 is trivial though...  it is silly to jump from 'evidence supports the alternative hypothesis!' to the complete opposite conclusion 'evidence doesn't support the alternative hypothesis!' over such a minor difference.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MoreAboutTestsInterpretingHypotheses)**:
**1.** Hypotheses are about *population* parameters like $\mu$, not *sample* statistics like $\bar{x}$. 
**2.** Hypotheses are about parameters like $\mu$, not statistics like $\bar{x}$. 
       The value of 36.8051 is a sample mean, but hypothesis are meant to be written *before* the data are collected. 
       In any case, these hypotheses are asking to test if the *sample* mean is 36.8051... which we *know* it is.
**3.** 36.8051 is a sample mean, but hypothesis are meant to be written down *before* the data are collected.
**4.** 36.8051 looks like a sample mean, but hypothesis are meant to be written down *before* the data are collected. 
**5.** Hypotheses are about parameters like $\mu$, not statistics like $\bar{x}$.
**6.** This would be fine, if the RQ was one-tailed... but it is two-tailed.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MoreAboutTestsConclusions)**:
**1.** The conclusion is about the **mean** energy intake (*population* mean energy intake specifically).
**2.** Conclusions are *never* about sample statistics. 
       We want to know what the *statistic* that tells us about the *population* parameter.
**3.** The conclusion is about the population **mean** energy intake.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:MoreAboutTestsBatteries)**:
**1.** *No* evidence of a difference in lifetime between the two brands, as the $P$-value is "large".
**2.** No. The difference is 0.29 hours, or about 17 minutes. 
       A difference of 17 minutes in over 5 hours of use is trivial.
**3.** Conclusion: no evidence of a difference between the mean lifetimes. 
       That's cumbersome for advertising. 
       A common advertising trick: "No other battery lasts longer!"... 
       meaning there is no evidence of a difference in means.
**4.** Price!
:::


## Answers: Tests for paired means {#TestPairedMeansAnswer}

Answers to exercises in Sect. \@ref(TestPairedMeansExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestPairedMeansTasteOfBrocolli)**:
$H_0$: $\mu_d = 0$ and $H_1$: $\mu_d>0$: *differences* are positive when the dip rating is better than the raw rating.
$t = (5.2 - 0)/3.06 = 1.699$; the *approximate* one-tailed $P$-value, from using the 68--95--99.7 rule, is somewhere between 16% and 2.5%. 

So we cannot be sure if the $P$-value is larger than 0.05... but it is likely that it is (since the calculated $t$-score is quite a long distance from $z=1$).
The evidence *probably* doesn't support the alternative hypothesis.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestPairedMeansCaptopril)**:
**1.** Because it is the blood pressure *reduction*, and a reduction is what the drug is meant to produce, so expect the reductions to be positive numbers. 
**2.** Differences shown below.
**3.** Histogram of *differences*: Fig. \@ref(fig:SBPHistDiffs).
**4.** $H_0$: $\mu_d = 0$ and $H_1$: $\mu_d>0$ (because the differences are *reductions*).
**5.** $t=8.12$.
**6.** $P = 0.001\div 2 = 0.0005$ (one-tailed test).
**7.** Very strong evidence ($P=0.0005$) that the drug reduces the average systolic blood pressure (mean reduction: 8.6 mm Hg) in the population.
:::

```{r}
blood <- structure(list(Patient = c(1L, 2L, 3L, 4L, 5L, 6L, 7L, 8L, 9L, 
10L, 11L, 12L, 13L, 14L, 15L, 1L, 2L, 3L, 4L, 5L, 6L, 7L, 8L, 
9L, 10L, 11L, 12L, 13L, 14L, 15L), Before = c(210L, 169L, 187L, 
160L, 167L, 176L, 185L, 206L, 173L, 146L, 174L, 201L, 198L, 148L, 
154L, 130L, 122L, 124L, 104L, 112L, 101L, 121L, 124L, 115L, 102L, 
98L, 119L, 106L, 107L, 100L), After = c(201L, 165L, 166L, 157L, 
147L, 145L, 168L, 180L, 147L, 136L, 151L, 168L, 179L, 129L, 131L, 
125L, 121L, 121L, 106L, 101L, 85L, 98L, 105L, 103L, 98L, 90L, 
98L, 110L, 103L, 82L), BP = structure(c(2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L), .Label = c("D", "S"), class = "factor")), .Names = c("Patient", 
"Before", "After", "BP"), class = "data.frame", row.names = c(NA, 
-30L))

blood$Differences <- blood$Before - blood$After

bloodS <- subset(blood, 
                 BP == "S")
bloodS <- bloodS[, c("Before", 
                     "After", 
                     "Differences")]
```

```{r SBPHistDiffs, fig.cap="A histogram of the systolic blood pressure reductions (in mm Hg)", fig.align="center", fig.width=5, fig.height=3}
hist(bloodS$Differences,
     breaks = seq(0, 40, by = 10),
     main = "SBP reductions",
     xlim = c(-5, 35),
     las = 1,
     xlab = "SBP reduction (in mm Hg)",
     col = plot.colour)
box()
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestPairedMeansSmokingAndExercise)**:
$H_0$: $\mu_d = 0$ and $H_1$: $\mu_d > 0$, where *differences* are positive when the intention to smoke is reduced after exercise.
$t = (0.66 - 0)/0.37 = 1.78$; $P$-value larger than 0.05: the evidence doesn't support the alternative hypothesis.

No evidence ($P > 0.05$) that the mean 'intention to smoke' reduced after exercise in women (mean change in intention to smoke: -0.66; std. error: 0.37).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestPairedMeansFerritin)**:
$H_0$: $\mu_d = 0$ and $H_1$: $\mu_d > 0$, where *differences* refer to the *reduction* in ferritin. 
$\bar{d} = -424.25$ and $s = 2092.693$ and $n = 20$, so $t = -0.90663$.
$t$ is 'small'; $P > 0.05$ (actually $P = 0.376$): the evidence doesn't support the alternative hypothesis.
Since $n < 25$, the test may not be statistically valid (the histogram of data (Fig. \@ref(fig:FerritinGraph)) suggests that the population *might* have a normal distribution), though the $P$-value is very large so it probably makes little difference.
:::

```{r FerritinGraph, fig.align="center", fig.cap="A histogram of the change in ferritin concentration", fig.width=5, height=2}
FR <- structure(list(September = c(6630L, 4590L, 3510L, 6375L, 2500L, 
1400L, 4580L, 6885L, 4200L, 5600L, 5360L, 6110L, 5300L, 3120L, 
3300L, 11400L, 3100L, 2800L, 3500L, 12700L), March = c(5100L, 
3510L, 6600L, 8000L, 2800L, 2860L, 3640L, 9030L, 4420L, 7910L, 
6780L, 7250L, 6000L, 4300L, 4680L, 8500L, 3735L, 2730L, 6600L, 
7000L), Reduction = c(1530L, 1080L, -3090L, -1625L, -300L, -1460L, 
940L, -2145L, -220L, -2310L, -1420L, -1140L, -700L, -1180L, -1380L, 
2900L, -635L, 70L, -3100L, 5700L)), .Names = c("September", "March", 
"Reduction"), class = "data.frame", row.names = c(NA, -20L))

hist(FR$Reduction, 
     breaks = seq(-4500, 6000, by = 1500),
     col = plot.colour,
     main = "Histogram of ferritin reduction",
     las = 1,
     ylim = c(0, 10),
     axes = FALSE,
     xlab = "Reduction (micrograms per litre)")
axis(side = 1,
     at = seq(-4500, 6000, by = 1500))
axis(side = 2, 
     las = 1)
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:StressSurgeryHT).**
$H_0$: $\mu_D = 0$ and $H_1$: $\mu_D > 0$, provided we define the difference as the beta-endorphin concentration 10 minutes before surgery, minus the concentration 12--14 hours before surgery.
The $t$-score is $(\bar{d} - \mu_d)/{\text{s.e.}(\bar{d})} = (7.70 - 0)/{3.10} = 2.48$, which is quite large.
Using the 68--95-99.7 rule, we would expect a small **one-tailed** $P$-value (certainly less than 0.025).
jamovi reports a *two*-tailed $P$-value of 0.023, so a *one*-tailed $P$-value of $0.023/2 = 0.0115$.
Conclude (where the CI was [found earlier](#StressSurgery)) that strong evidence exists in the sample (paired $t = 2.48$; one-tailed $P = 0.0115$) of a population mean increase in beta-endorphin concentrations from 12--14 hours before surgery to 10 minutes before surgery (95% CI: 1.62 to 13.78 fmol/mol.)

The sample size is $n = 19$, just less than 25, so the results may not be statistically valid (but shouldn't be too bad).
:::


## Answers: Tests for two means {#TestTwoMeansAnswer}

Answers to exercises in Sect. \@ref(TestTwoMeansExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TwoSampleHTWhalesHT).**

**1.** The mean length of female gray whales *minus* the mean length of male gray whales.
(Either direction is fine, provided you are consistent.)
**2.** $H_0: \mu_F - \mu_M = 0$ and $H_1: \mu_F - \mu_M \ne 0$.
**3.** $t = [(\bar{x}_F - \bar{x}_M) - (\mu_F - \mu_M)]/{\text{s.e.}(\bar{x}_F - \bar{x}_M)} = (0.06 - 0)/0.0929 = 0.65$.
This is a *very* small $t$-score, so the $P$-value will be very large.
**4.** `There is no evidence ($t = 0.65$; two-tailed $P > 0.10$) in the sample that the mean length of adult gray whales is different in the population for females (mean: 12.70\ m; standard deviation: 0.611\ m) and males (mean: 12.07\ m; standard deviation: 0.705\ m; $95$% CI for the difference: $-1.26$\ m to $0.246$\ m).'
**5.** Since both sample sizes are large, the test is statistically valid.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestTwoMeansNHANES)**:
$H_0$: $\mu_S - \mu_{NS} = 0$ and $H_1$: $\mu_S - \mu_{NS} \ne 0$.
From output: $t = 5.478$ and $P < 0.001$.
Very strong evidence to support $H_1$.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestTwoMeansFitnessOfParamedics)**:
**1.** Table \@ref(tab:ParamedicsTestAnswerTable).
**2.** $H_0$: $\mu_C - \mu_{SO} = 0$ and $H_1$: $\mu_C - \mu_{SO} \ne 0$. 
       Then $t=( (51 - 56) - 0 )/3.3044 = -1.513$;  $P$-value larger than 5%. 
       Sample size are small; test may not be statistically valid.
**3.** $H_0$: $\mu_C - \mu_{SO} = 0$ and $H_1$: $\mu_C - \mu_{SO} \ne 0$. 
       Then $t=( (36 - 47) - 0 )/4.0689 = -2.70$; $P$-value smaller than 5%. 
       Sample size are small; the test may not be statistically valid.
:::


```{r ParamedicsTestAnswerTable}
tab.cap <- "The physical profile of conventional and special operation paramedics in Western Australia"

ParaTable <- array(dim = c(7, 2) )
ParaTable[1, ] <- c("11", 
                    "11")
ParaTable[2, ] <- c("51", 
                    "56")
ParaTable[3, ] <- c("8", 
                    "9")
ParaTable[4, ] <- c("1.86", 
                    "2.71")
ParaTable[5, ] <- c("36", 
                    "47")
ParaTable[6, ] <- c("10", 
                    "11")
ParaTable[7, ] <- c("2.36", 
                    "3.3")


colnames(ParaTable) <- c("Conventional",
                         "Special Operations")
rownames(ParaTable) <- c("Sample size",
                         "Mean", 
                         "Standard deviation", 
                         "Standard error",
                         "Mean", 
                         "Standard deviation", 
                         "Standard error")
if( knitr::is_latex_output() ) {
  kable(ParaTable, 
        format = "latex",
        longtable = FALSE,
        col.names = colnames(ParaTable),
        booktabs = TRUE,
        align = c("r", "r"),
        caption = tab.cap) %>%
    row_spec(0, bold = TRUE) %>%
  pack_rows("Grip strength (in kg)", 2, 4, 
             bold = TRUE, 
             italic = FALSE) %>%
  pack_rows("Push-ups (per minute)", 5, 7, 
            bold = TRUE, 
            italic = FALSE) %>%
    kable_styling(font_size = 9)
}
if( knitr::is_html_output() ) {
  out <- kable(ParaTable, 
               format = "html",
               longtable = FALSE,
               col.names = colnames(ParaTable),
               booktabs = TRUE,
               align = c("r", "r"),
              caption = tab.cap) %>%
    kable_styling(full_width = FALSE) %>%
    row_spec(0, bold = TRUE) %>%
    pack_rows("Grip strength (in kg)", 2, 4, 
               bold = TRUE, 
               italic = FALSE) %>%
    pack_rows("Push-ups (per minute)", 5, 7, 
               bold = TRUE, 
               italic = FALSE)
}
```


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestTwoMeansBodyTemperature)**:
$H_0$: $\mu_M - \mu_{F} = 0$ and $H_1$: $\mu_M - \mu_{F} \ne 0$.
From output, $t = -2.285$; (two-tailed) $P$-value is 0.024.
Moderate evidence to support $H_1$.

Moderate evidence ($P = 0.024$) that the mean internal body temperature is different for females (mean: $36.886^{\circ}\text{C}$) and males (mean: $36.725^{\circ}\text{C}$).

The difference between the means, of 0.16 of a degree, is hardly of any *practical* importance in everyday use.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestTwoMeansDental)**:
**1.** $H_0$: The means are equal: $\mu_I = \mu_{NI}$ or $\mu_I - \mu_{NI} = 0$.
       $H_1$: The means are not equal: $\mu_I \ne \mu_{NI}$ or $\mu_I - \mu_{NI} \ne 0$.
**2.** CI from -22.54 to -11.95: the mean sugar consumption between 11.95 and 22.54 kg/person/year *greater* in industrialised countries. 
**3.** Very strong evidence in the sample ($P<0.001$) that the mean annual sugar consumption per person is different for industrialised (mean: 41.8 kg/person/year) and non-industrialised (mean: 24.6 kg/person/year) countries (95% CI for the difference 11.95 to 22.54).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:FacePlantHT).**

**1.** Either direction is fine; the amount by which younger ($Y$) women can lean further forward is $\mu_Y - \mu_O$.
**2.** One-tailed, according to the RQ.
**3.** $H_0$: $\mu_Y - \mu_O = 0$ or $\mu_Y = \mu_O$. $H_1$: $\mu_Y - \mu_O > 0$ or $\mu_Y > \mu_O$.
**4.** $t = 6.69$ (from the *second row*); $P < 0.001/2$ since one-tailed; i.e., $P < 0.0005$.
**5.** Very strong evidence exists in the sample ($t = 6.691$;  one-tailed $P < 0.0005$) that the population mean one-step fall recovery angle for healthy women is  *greater* for young women (mean: $30.7^\circ$; std. dev.: $2.58^\circ$; $n = 10$) compared to older women (mean: $16.20^\circ$; std. dev.: $4.44^\circ$; $n = 5$; 95% CI for the difference: $9.1^\circ$ to $19.9^\circ$).
:::


## Answers: Tests for odds ratios {#TestsOddsRatioAnswer}


Answers to exercises in Sect. \@ref(TestsOddsRatioExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestsOddsSandflies)**:
The missing entries: Odds: 1.15; Percentage: 58.1%.

$\chi^2 = 4.593$; approximately $z = \sqrt{4.593/1} = 2.14$; expect small $P$-value.
Software gives $P = 0.032$.
Evidence that the difference between the sample proportions is unlikely to be due to sampling variation.
The test is statistically valid.

> The sample provides *moderate evidence* ($\text{chi-square}=4.593$; two-tailed $P = 0.032$) that the *population* odds of finding a male sandfly in eastern Panama is different at 3 feet above ground (odds: 1.15) compared to 35 feet above ground (odds: 1.71; OR: $0.67$; 95% CI\ from $0.47$ to $0.97$).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestsOddsRatioScars)**:
One option: $H_0$: The population OR is one; $H_1$: The population OR is not one.
From software, $\chi^2 = 0.667$; $P = 0.414$, which is large.
No evidence ($P = 0.414$) that the odds of having a smooth scar is different for women and men (chi-square: 0.667).
The test is statistically valid.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestsOddsRatioAugustRainfallInEmerald)**:
One option: $H_0$: The population OR is one; $H_1$: The population OR is not one.
From software, $\chi^2 = 3.845$; $P = 0.050$.

Moderate evidence ($P = 0.05$) that the odds of having no rainfall is different for non-positive SOI Augusts and negative-SOI Augusts (chi-square: 3.845).
The test is statistically valid.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOddsRatioSunglasses)**:
**1.** $22/366 \times 100 = 6.0$%.
**2.** $79/386 \times 100 = 20.5$%.
**3.** $22/344 = 0.06395349$, or about 0.0640.
**4.** $79/307 = 0.257329$, or about 0.257.
**5.** $0.257/0.0640 = 4.02$.
**6.** $0.0640/0.257= 0.249$.
**7.** From $0.151$ to $0.408$.
**8.** $\chi^2 = 33.763$% (approximately $z = 5.81$) and $P < 0.001$. 
**9.** Strong evidence ($P < 0.001$; $\chi^2 = 33.763$; $n = 752$) that the odds of wearing hat is different for males (odds: 0.257) and females (odds: 0.0640; OR: 0.249, 95% CI from 0.151 to 0.408).
**10.** Yes.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TestOddsRatioPedestrianPhones)**:
**1.** Low exposure (in order): 73.7%, 72.5%, 85.6%. 
       High exposure (in order): 26.3%, 27.5%, 14.4%.
**2.** In order: 2.80, 2.64, 5.92.
**3.** Various ways; probably the easiest: $H_0$: No association between level of exposure and type of interaction (in the population).
**4.** Table \@ref(tab:PhoneTableExpected).
**5.** Approximately $z=\sqrt{20.923/2} = 3.23$: expect small $P$-value.
**6.** Very strong evidence in the sample of an association between level of exposure and type of interaction in the population ($\chi^2 = 20.923$; $P < 0.001$).
:::

```{r PhoneTableExpected}
PhoneTable <- array( dim = c(2,3))
PhoneTable[1, ] <- c(263, 259, 302)
PhoneTable[2, ] <- c(94, 98, 51)
colnames(PhoneTable) <- c("Answer call", 
                          "Respond to text", 
                          "Reply to email")
rownames(PhoneTable) <- c("Low exposure", 
                          "High exposure")

if( knitr::is_latex_output() ) {
  kable(round(chisq.test(PhoneTable)$expected, 2),
        format = "latex",
        longtable = FALSE,
        booktabs = TRUE,
        digits = 2,
        col.names = c("Answer call", 
                      "Respond to text", 
                      "Reply to email"),
        caption = "The expected counts for the phone-use data"
      ) %>%
   kable_styling(full_width = FALSE, 
                 font_size = 9)
}
if( knitr::is_html_output() ) {
  out <- kable(round(chisq.test(PhoneTable)$expected, 2),
               format = "html",
               longtable = FALSE,
               booktabs = TRUE,
               digits = 2,
               col.names = c("Answer call",
                             "Respond to text", 
                             "Reply to email"),
               caption = "The expected counts for the phone-use data"
      ) %>%
    kable_styling(full_width = FALSE)
}
```


## Answers: Relationships between two quantitative variables {#TwoQuantAnswer}

Answers to exercises in Sect. \@ref(TwoQuantExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TwoQuantExercisesWindmill)**:
Non-linear; higher wind speed related to higher DC output (in general); a small to moderate amount of variation.
The DC output increases as wind speed increases, but not linearly.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TwoQuantExercisesMandible)**:
The relationship probably linear... but a few observations at the top right look a bit different. 
Variation seems to increase a little as the Age increases.
Of note: A few observations in the top right of the scatterplot seem to not follow the linear relationship.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:TwoQuantExercisesSoftdrink)**:
Approximately linear; positive relationship; variation seems to get larger for a larger number of cases.
:::


## Answers: Correlation {#CorrelationAnswer}

Answers to exercises in Sect. \@ref(CorrelationExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CorrelationExerciseDrawNegR)**:
Many correct answers.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CorrelationExercisePunting)**:
**1.** $R^2 = 0.881^2 = 77.6$%. 
       About 77.6% of the variation in punting distance can be explained by the variation in right-leg strength.
**2.** $H_0$: $\rho = 0$ and $H_1$: $\rho \ne 0$. 
       $P$-value *very* small. 
       Very strong evidence of a correlation in the population.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CorrelationSoftdrink)**:
The plot looks linear; $n = 25$; variation doesn't seem constant.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CorrelationExerciseBitumen)**:
**1.** Very close to $-1$.
**2.** $r = -\sqrt{0.9929} = -0.9964$. ($r$ must be negative!)
**3.** Very small. This is a very large value for $r$ on a reasonable sized sample.
**4.** Yes.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:CorrelationExerciseSoil)**:
**1.** Close to $-1$, but not super close.
**2.** $r = -\sqrt{0.6707} = -0.819$. ($r$ must be negative!)
**3.** Very small. This is a large value for $r$ on a reasonable sized sample.
       (The $P$-value turns out to be 0.000104.)
**4.** Since $n < 25$, the test may not be statistically valid.
:::


## Answers: Regression {#RegressionAnswer}

Answers to exercises in Sect. \@ref(RegressionExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RegressionExercisePunting)**:
**1.** Intercept *not* about 110; that's where the line 'stops', but the intercept is the predicted value of $y$ when $x = 0$. 
       We have to extend the line quite a bit.
       Using rise-over-run, guess slope is $(190 - 110)/(180 - 110) = 1.14$.
**2.** $\hat{y} = -3.69 + 1.04x$, where $y$ is punting distance (in feet), and $x$ is right leg strength (in pounds).
**3.** For each extra pound of leg strength, the punting distance increases, on average, by about 1 foot.
**4.** $H_0$: $\beta=0$; $H_1$: $\beta\ne 0$. 
       (You could answer in terms of correlations.) 
       The question is stated as a two-tailed question, but testing if stronger legs *increase* kicking distance seems sensible.
**5.** $t = 6.16$, which is huge; $P = 0.0001$ (two-tailed).
**6.** $1.0427 \pm (2\times 0.1692)$, or $1.0427 \pm 0.3384$, or 0.70 to 1.4.
**7.** Very strong evidence in the sample ($t = 6.16$; $P = 0.0001$ (two-tailed)) that punting distance is related to leg strength (slope: $1.0427$; $n = 13$).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RegressionExerciseManifold)**:
**1.** *Way* too many decimal places. 
       $r$ is not relevant as relationship is non-linear.
**2.** Regression is inappropriate: the relationship is non-linear.
**3.** $y$ should be $\hat{y}$; the slope and intercept have been *swapped* (from the plot, the intercept for their line is about 0.4, which they give as the slope).
**4.** The whole thing is as dodgy-as...
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RegressionExerciseBitumen)**:
**1.** $\hat{y} = 17.47 - 2.59x$, where $x$ is the percentage bitumen by weight, and $y$ is the percentage air voids by volume.
**2.** *Slope*: an increase in the bitumen weight by one percentage point *decreases* the average percentage air voids by volume by 2.59 percentage points. 
       *Intercept*: dodgy (extrapolation); in principle 0% bitumen content by weight, the percentage air voids by volume is about 17.47%.
**3.** $t = -74.9$: Massive! Extremely strong evidence ($P < 0.001$) of a relationship.
**4.** $\hat{y} = 17.4712 - (2.5937\times 5) = 4.5027$, or about 4.5%. 
       Expected good prediction, as relationship is strong.
**5.** $\hat{y} = 17.4712 - (2.5937\times 6) = 1.909$, or about 1.9%. 
       Might be a poor prediction, since this is extrapolation.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RegressionExerciseSunscreen)**:
**1.** $b_0$: When someone spends *no* time on sunscreen application, an average of 0.27g has been applied; nonsense.
$b_1$: Each extra minute spent on application adds an average of 2.21g of sunscreen: sensible.
**2.** The value of $\beta_0$ could be zero... which would make sense.
**3.** $\hat{y} = 0.27 + (2.21\times 8) = 17.95$; an average of about 18g.
**4.** About 64% of the variation in sunscreen amount applied can be explained by the variation in the time spent on application.
**5.** $r = \sqrt{0.64} = 0.8$, and need a positive value of $r$. 
       A strong and positive correlation between the variables.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:RegressionPredictBirthWeight)**:
**1.** No.
**2.** Possibly; no idea of accuracy of predictions really.
**3.** Intercept: Weight of infant with chest circumference zero; silly.
       Slope: average increase in birth weight (in g) for each increase in chest circumference by one cm.
**4.** Intercept: cm; slope: cm/gram.
**5.** $\hat{y} = 2538.7$g.
**6.** Too many decimal places!
       Regression equation implies predicting to 0.0001 of a gram.
       $r$ has too many decimal places too.
:::


## Answers: Reading research {#ReadAnswer}

Answers to exercises in Sect. \@ref(ReadExercises).

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ReadExerciseiPhoneStepCounts)**:
**1.** Not ecologically valid.
**2.** Ethical. People understand that sometimes unexpected things happen.
**3.** Convenience; self-selected. 
       However, nothing obvious to suggest the people in the study would record different accuracies than people not in the study.
**4.** Inclusion criteria.
**5.** Paired $t$-test.
**6.** Evidence in the sample that the mean difference in step-count between the two methods cannot be explained by chance: likely is a difference.
**7.** From the given information: Probably valid.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:ReadExerciseHeadphones)**:
**1.** Students at that university (QUMS).
**2.** A random sampling method has been used, so results should be generalisable to the population (students at that university).
**3.** $t$-test comparing two means.
**4.** Three groups. 
       *Null* hypothesis: the population mean hearing loss score is the same in all three groups.
       *Alternative* hypothesis: the mean hearing loss score is *not* the same in all three groups.
**5.** Standard error: $\text{s.e.}(\bar{x}) = 3.08/\sqrt{745} = 0.1128$; CI is $19.8 \pm 0.26$.
**6.** Need the standard error for the *difference* between two means, which is not reported.
:::


## Answers: Writing research {#WriteAnswer}

Answers to exercises in Sect. \@ref(WriteExercises).


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExercisesDecimals)**:

The number of decimal places given (ten!) is ridiculous.
:::

::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExercisesNHANES)**:
**The graph**: odd colour choice; vertical axis label isn't helpful; horizontal axis isn't labelled at all; units of measurement not given; title and/or caption would be helpful.

**The table**: the two limits of the CI are under the *Mean* and *Std dev* columns, but that is not what they are; units of measurement are not given; no caption, or any way to know what the table is about; number of decimal places is inconsistent; sample sizes not given; *difference*, and probably the other rows too, should report a standard error.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExercisesStudent1)**:
RQ: P, O, C and I are not clear or explicit; the two fonts being compared should be identified.
Perhaps better: 

> For students, is the mean reading speed for text in the Georgia font the same as for text in Calibri font?

**Abstract**: statement poorly constructed (*fonts* are not fast or slow!).
Perhaps: 

> The sample provided evidence that the mean reading speeds were different ($P = ???$), when comparing text in Georgia font (mean: ???) and Calibri font (mean: ???; 95% CI for the difference: ??? to ???).
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExercisesStudent2)**:
No units of measurement given (they are centimetres.); jump-heights are given to 0.001 of a centimetre... which seems rather optimistic; table gives information for the *differences*, which is great but it could also provide numerical summary information for each individual jump type too; a numerical summary shouldn't include a $P$-value, $t$-score, or confidence interval.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExercisesStudent3)**:
Variables are *qualitative*, so means inappropriate; the appropriate summary is an odds ratio, so the values almost certainly refer to the CI for the OR.
Without more information, we can't really be sure what the OR means though.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExercisesStudent4)**:
Such a study cannot *prove* anything just by itself (only a sample studied); the two CIs for the hang time for each plane design is fine... but really, the *difference* is of interest: The appropriate CI is for the difference between the mean hang times.
:::


::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExerciseBaur)**:
**1.** Table: Reasonably good!
**2.** Figure: Poor (it is 3D). 
Use a stacked or side-by-side bar chart.
:::
       
       
::: {.answer data-latex=""}
**Answer to Exercise \@ref(exr:WriteExerciseBaughman)**:
Reasonably good: should not be gaps between the bars of the histogram. 
:::