38-Read.Rmd

# (PART) Reporting, writing and reading research {-}


# Reading research {#Reading}

<!-- Introductions; easier to separate by format -->
```{r, child = if (knitr::is_html_output()) {'./introductions/38-Read-HTML.Rmd'} else {'./introductions/38-Read-LaTeX.Rmd'}}
```


## Introduction {#Chap36-Intro}


<div style="float:right; width: 222x; border: 1px; padding:10px">
<img src="Illustrations/pexels-pixabay-258353.jpg" width="200px"/>
</div>


Scientific practice requires reading the research of others.
Advances in every science-based discipline build on existing research, so being able to critique the research of others is important.
Research is usually communicated in *journal articles* (also called *papers*), or in *presentations* (conferences; seminars).
Millions of journal articles are available (this book references many articles).

At some time during your studies, you will need to read researches articles: to understand current practices in your discipline; to know *why* your discipline does things as it does; to critique the evidence for current practices; and to identify open or unresolved questions in your discipline.
Understanding the language of research is important for understanding these articles, even if you will not be conducting your own research.

Reading research articles can be challenging.
Start by reading the *Abstract* (also called a *Summary*, or *Overview*), as it provides a useful overview of the whole paper (without the details).

To understand a paper, the six steps of the research process (Sec. \@ref(SixStepsOfResearch)) can be used as a guide:

1. **Ask the question**:
   What *research question* is the paper answering?
   To what population do the results apply?
   Are inclusion and/or exclusion criteria given?
2. **Design the study**:
   How did the authors *design the study*? 
   Is the study observational or experimental?
   Is the study designed to maximize internal and external validity? 
   What are the design limitations?
3. **Collect the data**:
   How did the authors *collect the data*? 
   Could the study be approximately repeated if needed?
4. **Describe and summarise the data**:
   Is the data *summary* appropriate, complete and clear?
   What do these summaries reveal about the data?
5. **Analyse the data**:
   Is the *analysis* appropriate, accurate, valid and clear?
   What do the results mean?
6. **Report the results**:
   Are the results accurately, appropriately and well *reported*? 
   What is the answer to the RQ? 
   Are the study limitations acknowledged, and their implications discussed? 
   What other questions have emerged?

In the examples that follow, some extracts from articles will be studied to demonstrate.


## Example 1: reading research {#ReadExample1}


<div style="float:right; width: 222x; border: 1px; padding:10px">
<img src="Illustrations/pexels-pixabay-531446.jpg" width="200px"/>
</div>


A study [@data:Fritts2018:Vegetables] explored the impact of adding herbs and spices on the consumption of vegetables by adolescent school children.
Part of the Abstract states (slightly edited for brevity):

> *Purpose*: We evaluated whether new vegetable recipes using herbs and spices would increase preference for vegetables served to adolescents at this school.\smallskip
>
> *Methods*: To evaluate recipe acceptance, we assessed liking (100 mm visual analog scales) among students ($n =$ 96--110; aged 14--18 years) for 8 plain (oil and salt) and 8 seasoned vegetables. 
> Liking ratings between plain and seasoned vegetables were compared with paired $t$-tests... \smallskip
>
> *Results*: Students reported higher liking for several seasoned recipes compared to plain: broccoli ($P = 0.02$), vegetable dip ($P < 0.0001$), black beans and corn ($P < 0.001$) and cauliflower ($P < 0.0001$).\smallskip
>
> *Conclusions*: Common herbs and spices improved liking for several school lunch vegetables compared to plain varieties among rural high school students...
>
> --- @data:Fritts2018:Vegetables, p. 125

Later we read this (again, slightly edited):

> This is a cross-sectional [i.e., non-directional] study assessing preference for plain and seasoned vegetables in a population of middle/high school students (aged 14--18 years) attending a rural Pennsylvania public school. 
>
> --- @data:Fritts2018:Vegetables, p. 126

Even using this small amount of information (the original article is 10 pages long), much can be learnt about the study. 
For example:

1. **Ask the question**: 
   The POCI elements are:

   * *Population*: 'middle/high school students (aged 14--18 years) attending a rural Pennsylvania public school'
   * *Outcome*: The *mean difference* in taste ratings between plain and seasoned vegetables.
     The taste ratings are given using a '100 mm visual analog scale'.
   * *Comparison*: There is **no** comparison:
     Every member of the population is treated the same way.
     A comparison exists if different subsets of the population are treated differently (for example, one group of students is given plain vegetables, and a different group is given seasoned vegetables).
   * *Intervention*: No; there is no comparison, so there is no comparison to be allocated to different groups of the population.
2. **Design the study**: 
    Since this RQ is *descriptive*, the study is *descriptive*.
    The participants were probably not *blinded*, since the presence of seasoning was probably obvious.
    About 100 sudents were used, so the sample size is large.
3. **Collect the data**:
    No details are given about the data collection in this information.
4. **Describe and summarise the data**:
   The Abstract gives no summary data (since eight vegetables were studied, this would have consumed too much space I guess).
5. **Analyse the data**:
   The data were analysed using *paired* $t$-tests, one for each different vegetable used.
   (Each subject gave two ratings for each vegetable: one for *plain* vegetables and one for *seasoned* vegetables),
6. **Report the results**:
   Evidence exists of a mean difference (that students preferred the seasoned vegetables) in many cases, but not all: the Abstract states that eight vegetables were used, with statistically significant differences (i.e., $P < 0.05$) for five.

For more details, the whole paper could be read.


## Example 2: reading research {#ReadExample2}


<div style="float:right; width: 222x; border: 1px; padding:10px">
<img src="Illustrations/pexels-taryn-elliott-4652248.jpg" width="200px"/>
</div>

Consider this Abstract [@data:Groves:bicycleweight] (slighty edited):

> **Objective** To determine whether the author's 20.9 lb (9.5 kg) carbon frame bicycle reduced commuting time compared with his 29.75 lb (13.5 kg) steel frame bicycle.\smallskip
>
> **Design** Randomised trial.\smallskip
>
> **Setting** Sheffield and Chesterfield, UK, between mid-January 2010 and mid-July 2010. \smallskip
>
> **Participants** One consultant in anaesthesia and intensive care. \smallskip
>
> **Main outcome measure** Total time to complete the 27 mile (43.5 kilometre) journey from Sheffield to Chesterfield Royal Hospital and back.\smallskip
>
> **Results** The total distance travelled on the steel frame bicycle during the study period was 809 miles (1302 km) and on the carbon frame bicycle was 711 miles (1144 km). 
> The difference in the mean journey time between the steel and carbon bicycles was 00:00:32 (hr:min:sec; 95% CI -00:03:34 to 00:02:30; $P = 0.72$).\smallskip
>
> **Conclusions** A lighter bicycle did not lead to a detectable difference in commuting time.
> Cyclists may find it more cost effective to reduce their own weight rather than to purchase a lighter bicycle.\smallskip
>
> --- @data:Groves:bicycleweight, p. 341

Based on this Abstract, again we can learn many things about the study:

1. **Ask the question**:
   The POCI elements are:

   * *Population*: The trips by *this* rider, on *his* bikes, on *his* route to work.
     This is not easy to identify, but notice that there are many examples of this rider, on his bikes, on his route.
     Only one bike, one rider, and one route is used.
   * *Outcome*: 'Total time to complete the 27 mile (43.5 kilometre) journey'.
   * *Comparison*: Between the steel-frame and carbon-frame bicycles.
   * *Intervention*: Yes, because the elements of the population (the different commutes) can be randomly allocated to be taken with the steel- or carbon-frame bikes.
2. **Design the study**:
   The study is 'randomised controlled trial', a type of experimental study.
   Random allocation has been used (which bike to use each day).
3. **Collect the data**:
   The Abstract gives no information.
4. **Describe and summarise the data**:
   The Abstract gives no summary data for each bike, but summarises the *difference* between the means: 32 seconds (95% CI between -3:34 and 2:30 minutes, but *which* bike produces the faster mean time is not stated).
5. **Analyse the data**:
   Though not stated, probably a two-sample $t$-test.
6. **Report the results**:
   'A lighter bicycle did not lead to a detectable difference in commuting time':
   There is no evidence that the carbon-frame bicycle reduced the commmuting time (for this rider, on his route to work, with his bikes...).
   In any case, the difference between the two mean commuting times is 32 seconds... over a 43.5 kilometre journey:
   Hardly of any *practical* importance (Sect. \@ref(PracticalSignificance))!


One thing to observe:
The RQ is *one*-tailed (does the carbon frame bicycle *reduce* commuting time), but the conclusion gives a *two*-tailed $P$-value.
(This may not be obvious, but a one-tailed $P$-value cannot be larger than 0.5.)


## Exercises {#ReadExercises}

Selected answers are available in Sect. \@ref(ReadAnswer).


::: {.exercise #ReadExerciseiPhoneStepCounts}
A research article [@duncan2018walk] examined the accuracy of step counts recorded on iPhones.
The paper records this information about the selection of participants:

> Participants were recruited through word of mouth and posters displayed around the [researcher's] university.
> Participants were eligible if they were ambulatory, $\ge 18$ years of age, and owned an iPhone 6 [...] or newer model.

Although 33 participants were selected, the authors note some parts of the study used a smaller sample size because one subject lost their phone, while others chose to withdraw from the study.
The paper notes that previous studies have been able to:

> [...] demonstrate the accuracy of the iPhone pedometer function in laboratory test conditions. However, no studies have attempted to evaluate evidence [...] in the field.

1. What is the issue that the authors raise with previous studies?
1. Why did the authors discuss the changes in sample size for some parts of the study?
1. How would you describe the sampling method?
1. What would you call the information about given about the subjects needing to be ambulatory and 18 years of age or over?
1. Among many other things, the researchers compared the *mean diference* between the number of step counts recorded by manually counting steps and the iPhone-recorded number of steps. 
   What type of test would be appropriate?
1. While walking at 2.5 km/h, the above test resulted in $P = 0.006$.
   What does this mean?
1. The sample size for the part of the study mentioned above was $n = 32$. 
   Do you think the test will be statistically valid?
:::


::: {.exercise #ReadExerciseHeadphones}
One study of hearing loss among Iranian students [@mohammadpoorasl2018prevalence] used a non-directional study to explore the relationship between hearing loss and headphone use.
The article states that

> ... 890 students were randomly selected from five schools at QUMS (Medicine, Dentistry, Nursing and Midwifery, Public Health, and Paramedical Sciences schools) using a proportional cluster sampling method...

The participants completed a hearing test and completed a Hearing Loss Questionnaire (values are between 17 and 34:  higher scores indicating more severe hearing loss).

1. What is the population?
1. Critique the sampling method: 
   What is the implication for interpreting the results of the study?
1. Some of the results are presented in 
   Table \@ref(tab:HearingLossTable). 
   What statistical test do you think was used to compare the scores for males and females?
1. What are the hypotheses being tested about 'Frequency of use'?
1. Form an approximate 95% CI for the mean hearing loss score for students who use earphones. 
1. What information is needed to be able to form an approximate 95% CI for the *difference* between the hearing loss scores for females and males?
:::

<!-- http://jhealthscope.com/en/articles/65901.html -->


```{r HearingLossTable}
HearingTable <- array( dim = c(7,6) )

HearingTable[, 1] <- c("Sex", 
                       "", 
                       "Earphone use", 
                       "", 
                       "Frequency of use", 
                       "", 
                       "")
HearingTable[, 2] <- c("Female", 
                       "Male", 
                       "Yes", 
                       "No", 
                       "0, 1 times/day", 
                       "2 to 3 times/day", 
                       "More than 3 times/day")
HearingTable[, 3] <- c(543, 302, 745, 100, 194, 319, 278)
HearingTable[, 4] <- c(19.37, 19.99, 19.8, 19.0, 19.2, 19.6, 20.2)
HearingTable[, 5] <- c(2.91, 3.51, 3.08, 1.71, 2.87, 2.66, 3.54)
HearingTable[, 6] <- c("0.009", "", "< 0.001", "", "0.001", "", "")
tab.cap.HL <- "The Hearing Loss Questionnaire scores for various demographic variables"

# Swap rows around so the space inserted by kable is in a useful place
HearingTable[ c(1, 2, 6, 7, 3, 4, 5), ] <- HearingTable

colnames(HearingTable) <- c("Criterion", 
                            "Levels", 
                            "Sample size", 
                            "Mean", 
                            "Std. dev", 
                            "P-value")

if( knitr::is_latex_output() ) {
  kable(HearingTable,
        format = "latex",
        longtable = FALSE,
        booktabs = TRUE,
        align = c("r", "r", "c", "l", "l", "r"),
        linesep = c("", "\\addlinespace", 
                    "", "", "\\addlinespace",
                    "",""), # Otherwise addes a space after five lines... which looks odd when there are only six
        col.names = colnames(HearingTable),
        caption = tab.cap.HL) %>%
   column_spec(column = 1, 
               bold = TRUE) %>%
   row_spec(row = 0, 
            bold = TRUE) %>%
    kable_styling(font_size = 10)
}

if( knitr::is_html_output() ) {
  kable(HearingTable,
               format = "html",
               longtable = FALSE,
               booktabs = TRUE,
               align = c("r", "r", "c", "l", "l", "r"),
               col.names = colnames(HearingTable),
               caption = tab.cap.HL ) %>%
    column_spec(column = 1, 
                bold = TRUE) %>%
    row_spec(row = 0, 
             bold = TRUE)
}
```


::: {.exercise #ReadExerciseSoftDrinks}
The Abstract from a large study is given below:

> *OBJECTIVE*:
> This study aims to elucidate any existing link between energy-containing liquids, consumed in various forms within the diet, and the effect they may have on body weight or other diseases [...] \smallskip
>
> *METHODS*:
> A self-administered online survey was conducted in 2496 participants from different countries, in six languages 
> (Spanish, English, Chinese, French, German and Portuguese). 
> Questions referred to their soft drink and water consumption habits, physical exercise performed, presence or absence of certain diseases and medication. \smallskip
> 
> *RESULTS*:
> There is statistically significant difference ($p < 0.001$) in BMI and consumption of cola per week: those who consumed 0--3 cans a week have a lower BMI than those who consume >7 cans of cola a week [...] 
> There is greater presence of obesity ($p < 0.001$), gastritis (p < 0.001), constipation ($p < 0.001$) and mental illness ($p = 0.003$) among people who drink cola soft drinks. \smallskip
>
> *CONCLUSION*:
> Removal of energy-containing beverages from our diet may be an appropriate public health message to support those interested in preventing weight gain as well as other diseases.
>
> --- @martin2018soft, p. 1

Evaluate the study using the six steps of research discussed in this book.
:::