Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add validation option to TrainDataset #3067

Open
wants to merge 1 commit into
base: dev
Choose a base branch
from

Conversation

melopeo
Copy link
Contributor

@melopeo melopeo commented Nov 30, 2023

Description of changes:

TrainDatasets currently have only fields for train and test datasets. In this commit we add the optional case for validation dataset.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Please tag this pr with at least one of these labels to make our release process faster: BREAKING, new feature, bug fix, other change, dev setup

@@ -427,6 +433,7 @@ def __call__(self, data: DataEntry) -> DataEntry:
def load_datasets(
metadata: Path,
train: Path,
validation: Optional[Path],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is breaking the tests I think

@lostella
Copy link
Contributor

lostella commented Dec 1, 2023

Hey @melopeo, could you maybe update the OP to explain the motivation behind this? For example, why would one want to fix the validation data, when that could be considered a degree of freedom (some ways of setting it up may be better than others in some use cases; how to do that best may even depend on whether you use it for say model tuning vs early stopping)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants