-
Notifications
You must be signed in to change notification settings - Fork 202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unbalanced dataset splits #125
Conversation
Pulling openfl develop changes
Rename tensorboard option Co-authored-by: Ilya Trushkin <76161256+itrushkin@users.noreply.github.com>
If we dont define this callback in notebook it will be taken from workspace, if exists
Pulling changes from the main repo
It's necessary for logger to parse this tag
Collaborator logs metric task result, but sometimes collaborator and aggregator have different consoles (in interactive api e.g.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please fix the integration to the shard_descriptor part.
enforce_image_hw: str = None) -> None: | ||
"""Initialize KvasirShardDescriptor.""" | ||
super().__init__() | ||
class KvasirDataset(Dataset): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is no need in pytorch dataset here. I suggest removing all pytorch mentions
|
||
# Sharding | ||
shard_idx = data_splitter.split(labels, self.world_size)[self.rank] | ||
self.shard = Subset(dataset, shard_idx) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this line could be just
self.images_names = [self.images_names[i] for i in shard_idx]
""" | ||
self.shuffle = shuffle | ||
|
||
def split(self, labels, num_collaborators): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
seems like the signature is changed in this subclass
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me!
Jenkins please retry a build |
This PR introduces a data splitting interface with the following implementations: