forked from jrnold/bugs-examples-in-stan
-
Notifications
You must be signed in to change notification settings - Fork 1
/
genbeetles.Rmd
98 lines (82 loc) · 5.52 KB
/
genbeetles.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
# Generalized Beetles: Generalizing Link Functions for Binomial GLMs {#genbeetles}
```{r genbeetles_setup,message=FALSE,cache=FALSE}
library("rstan")
library("tidyverse")
```
GLMs rely on link functions, linking the linear predictors and the response probability, $\pi$.[^genbeetles-src]
Logit and probit are perhaps the most familiar link functions, mapping from the unit probability interval to the real line using the inverse CDFs of the logistic and standard Normal distributions, respectively.
The logit and probit link functions have the interesting property that they are symmetric
about $\pi = 0.5$, and guarantee the effects of $x_i$ on $\pi$ to be greatest when $\pi = 0.5$.
To see this, recall that in GLMs for binomial data the effects of $x_i$ on $\pi$ are not constant, but vary over $\pi$.
For logit and probit, with link functions symmetric around zero, the effect of $x_i$ on $\pi$ is at its greatest when $f(x_i \beta)$ is its maximum, which for logit and probit occurs at $x_i \beta = 0$.
In dose-response studies, this means that responsiveness to dose is at its greatest when subjects are on the cusp of a response, at, that is, when $E(\pi) = 0.5$.
In a study of voter turnout, ordinary logit or probit is estimated subject to the constraint that the effects of the covariates are at their greatest for citizens who are indifferent between turning out and abstaining [@Nagler1994a].
Furthermore, for logit/probit, these marginal effects diminish in magnitude symmetrically as we move away from $E(\pi) = 0.5$.
This symmetry follows from the symmetry of the logistic and normal PDFs/CDFs.
One can easily envisage situations where the researcher would not want to impose these features of the logit or probit link functions on their data.
In many settings, knowledge of exactly where the marginal impact of the covariates is maximized is of tremendous practical importance, with implications for targeting policy interventions, resource allocation, and so on.
For example, how to distribute resources for educational or health improvements?
Given that the effects of interventions are not constant across a set of baseline probabilities, knowing where proposed interventions are likely to have bigger or smaller effects is valuable information for policy-makers.
As we have seen, logit or probit models constrain these effects to be at their greatest when $E(\pi) = 1/2$, via their symmetric S-shaped link functions.
Ceteris paribus we would prefer to estimate the shape of the link function from the data.
A relatively straightforward way to let the data be informative as to the shape of the link function is via a simple one-parameter transformation of the logit link [@Prentice1976a]:
$$
\pi = \frac{1}{(1 + \exp(-x_i \beta))^\nu}
$$
where $\nu > 0$ is a parameter that skews the logit link. The standard logit model is a special case, where $\nu = 1$.
Thus the model for the binomial responses, $r_i \in \{0, n_i}$, for $i \in 1, \dots, N$,
$$
\begin{aligned}[t]
r_i &\sim \mathsf{Binomial}(n_i, \pi_i) , \\
\pi_i &= 1 - \frac{1}{(1 + e^{(\alpha + x_i' \beta_i)})^\nu} .
\end{aligned}
$$
Estimating $m$ and $b$ by maximum likelihood is relatively straightforward, although there is little reason to believe the frequentist sampling distribution for $m$ is likely to be well approximated by the normal in a finite sample.
Notice that $m$ enters the model in a highly non-linear fashion, and that different ranges of $m$ imply quite different relationships between the linear predictors and $\pi$.
In Bayesian terms, we can reasonably expect the posterior density of $m$ to be non-normal, and probably log-
Inferences for these quantities could well be misleading if we were to rely only on point estimates and asymptotic normal approximations; instead, a Bayesian approach via MCMC offers a way for us to obtain arbitrarily precise approximations to the posterior densities of these quantities.
Give $\mu$ a Gamma prior with a mean of 1, corresponding to the standard logit model, and a standard deviation of 2,
$$
\nu \sim \mathsf{Gamma}(0.25, 0.25) .
$$
The regression coefficients are given weakly informative priors,
$$
\begin{aligned}[t]
\alpha &\sim N(0, 10) , \\
\beta &\sim N(0, 2.5) .
\end{aligned}
$$
```{r genbeetles_mod,results='hide',cache.extra=tools::md5sum("stan/genbeetles.stan")}
genbeetles_mod <- stan_model("stan/genbeetles.stan")
```
```{r echo=FALSE,cache=FALSE,results='asis'}
genbeetles_mod
```
## Data
To demonstrate the use of MCMC methods in this context, I use the famous
beetles data of @Bliss1935a. These data have been extensively used by
statisticians in studies generalized link functions [@Prentice1976a;
@Stukel1988a], and are used by @SpiegelhalterBestGilks1996a to demonstrate how
BUGS handles GLMs for binomial data. @CarlinLouis2000a use these data in an
MCMC implementation of the one-parameter generalization used here.
These data are included with the **VGAM"** package as `flourbeetle`:
```{r genbeetles_flourbeetle}
(data("flourbeetle", package = "VGAM"))
```
```{r genbeetles_data}
genbeetles_data <-
within(list(), {
r <- flourbeetle$killed
N <- length(r)
n <- flourbeetle$exposed
x <- as.matrix(flourbeetle$logdose) %>%
scale() %>% as.numeric()
})
```
```{r genbeetles_fit,results='hide'}
genbeetles_fit <- sampling(genbeetles_mod, data = genbeetles_data)
```
```{r}
genbeetles_fit
```
[^genbeetles-src]: This example is derived from Simon Jackman, "[Generalized Beetles: generalizing link functions for binomial GLMs](https://web-beta.archive.org/web/20051027084129/http://jackman.stanford.edu:80/mcmc/genbeetles.odc)", 2005-10-27.