Skip to content

adding functionality to provide explicit validation dataset #48

Open
llevitis wants to merge 1 commit intoalawryaguila:masterfrom
llevitis:add_validation_data_explicitly
Open

adding functionality to provide explicit validation dataset #48
llevitis wants to merge 1 commit intoalawryaguila:masterfrom
llevitis:add_validation_data_explicitly

Conversation

@llevitis
Copy link

@llevitis llevitis commented Aug 1, 2024

I'm adding functionality to provide a validation dataset directly instead of splitting training data in scenarios where data leakage needs to be avoided between training and validation. I've made the following changes:

  • The fit function in base_model.py has been updated to accept an optional split_labels argument that is a list comprised of train and val entries to be used for splitting the input *data.
  • Functionality has been added to dataloaders.py to split the input data either using the split_labels or by splitting data directly into 90% for training and 10% for validation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant