data leakage
#1084
Replies: 1 comment 8 replies
-
Hi @bbb801 - yes, you are indeed correct. This is not the best practice and therefore, our example should not include it. |
Beta Was this translation helpful? Give feedback.
8 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dear Sir/Madam,
I noticed in your '01 Feature Extraction and Selection.ipynb' that it first uses feature extraction and then splits the data into train and test. Is it leading to data leakage? Can we split the data first and then train the model to do feature extraction on the test set?
Beta Was this translation helpful? Give feedback.
All reactions