Suggestion: in $ k=\inf\left{ k:\sum_{i=1}^k\hat{f}(X_{test})_{\pi_i}\geq 1-\alpha \right} $, using the same symbol
Suggestion: since regression is usually done on tabular data, and boosting regressors tend to do better than NNs on tabular data you may want to mention that scikit-learn offers the possibility to train a GradientBoostingRegressor
with pinball loss. XGBoost usually gives better point estimates than GradientBoostingRegressor
, but that library doesn't currently offer the possibility to train with pinball loss. However, they're working on it (see issue #7435).
Typo: the caption of Figure 8 is wrong (part of it is copy-pasted from Figure 6)
Suggestion: you may want to mention that Henrik Bostrom wrote a sklearn
-like library crepes which offers the possibility to conformalize generic regressors using uncertainty scalars. I wasn't paid by Henrik to tell you this (I don't even know him 😀)
Typo: "We now we will examine the distribution of distributions random variables$-$does not have a closed form".
Typo: "If the simulated average empirical coverage does not align well with the coverage observed on the real data, there is likely a problem in the conformal implementation.".
Typo: "In words, this is the observed coverage for all units for which to the discrete feature takes value g"
Not sure if typo: "For example, in classification we might divide the observations into units into three groups". The last part is a bit unclear to me, not sure if it's a typo or not.
Typo: in the table at the top of page 25,, the two null hypotheses are
Typo: "For example, we may we may require"
Typo: "[..] so diseases present during to infancy will be over-predicted."
Typo: the formula for
Typo: equation in the middle of the page, more or less: