Skip to content

Latest commit

 

History

History
35 lines (22 loc) · 1.29 KB

File metadata and controls

35 lines (22 loc) · 1.29 KB

3.7 Feature importance: Correlation

Slides

Notes

Correlation coefficient measures the degree of dependency between two variables. This value is negative if one variable grows while the other decreases, and it is positive if both variables increase. Depending on its size, the dependency between both variables could be low, moderate, or strong. It allows measuring the importance of numerical variables.

Functions and methods:

  • df[x].corrwith(y) - returns the correlation between x and y series.

The entire code of this project is available in this jupyter notebook.

⚠️ The notes are written by the community.
If you see an error here, please create a PR with a fix.

Nagivation