This lecture is part of the "Machine Learning in R" graduate course held at University of Münster, School of Business and Economics (winter term 2021/22). 🎓
Slides: https://simonschoe.github.io/ml-with-tidymodels/
This lecture gives a deep insight into tidymodels
, a unified framework towards modeling and machine learning in R
using tidy data principles. It introduces and motivates tools that facilitate every step of the machine learning workflow, from resampling, over preprocessing and model building, to model tuning and performance evaluation.
More specifically, after this lecture you will
- be familiar with the core packages of the
tidymodels
ecosystem and hopefully realize the value of a unified modeling framework, - know how to design a full-fledged machine learning pipeline for a particular prediction task,
- broaden your technical skills by learning about declarative programming, hyperparameter scales and parallel processing, and
- most importantly, be capable of conducting your own machine learning projects in
R
.
1 Learning Objectives
2 Introduction to tidymodels
3 Himalayan Climbing Expeditions Data
4 The Core tidymodels
Packages
4.1
rsample
: General Resampling Infrastructure
4.2recipes
: Preprocessing Tools to Create Design Matrices
4.3parsnip
: A Common API to Modeling and Analysis Functions
4.4workflows
: Modeling Workflows
4.5dials
: Tools for Creating Tuning Parameter Values
4.6tune
: Tidy Tuning Tools
4.7broom
: Convert Statistical Objects into Tidy Tibbles
4.8yardstick
: Tidy Characterizations of Model Performance
5 Additions to the tidymodels
Ecosystem