Supervised Reconstruction Dataset + bug fixes and formatting #25

GabrielBG0 · 2024-02-12T18:47:09Z

No description provided.

otavioon

I suggest to remove the __getitem__ implementation, as it is equivalent to its parent class implementaion. If you wont want, I would suggest to change the return type from __getitem__ to Tuple[Any,Any], as inner type could not be infered yet (depends on readers' return type).

otavioon · 2024-02-12T20:43:17Z

sslt/data/datasets/supervised_dataset.py

+        """
+        return len(self.readers[0])
+
+    def __getitem__(self, index: int) -> Tuple[np.ndarray, np.ndarray]:


The SimpleDataset already implements the equivaltent __getitem__code to this one.
Thus, you can omit this getitem implementation.
It worth noticing that we cannnot infer the return type (inside the tuple) yet, as it depends of the reader. The second element could be an int instead of np.ndarray, that represents the label, for instance.

The goal with this dataset is to be as specific as possible, hence the Transform Pipeline and the (exactly) two readers. The implementation is almost the same but it uses only one transform for both data points and it returns a known type as its output (the numpy array tuple), which is better for code downstream in the full pipeline process. Sure, as it is implemented, I can’t be sure what types are returned by the readers but if that’s a problem I would rather put a check to ensure it then to return Any.

Hello @GabrielBG0. I did not see the class name, my bad.

I agree with a more specific implementation. However, I think this class could be a bit more generic. The name suggests that this dataset would only be used for Semantic Segmentation classes, which is not true. In fact, the same implementation here would be used for any other reconstruction task (predicting seismic attributes, seismic facies classification/segmentation, etc). Thus, any task that takes an input and has a target with the same shape as input should subclass this one. Therefore, maybe we can change its name to something like SupervisedReconstructionDataset. What do you think?

I agree with the typing hint, as it is a specific class. However, it is worth notice that, yes, this is the same behavior as in the base class. If you have only one Transform and several readers, the same transform will be applied to all data fetched from the readers (equivalent to the codehere). We can still rewrite this whole __getitem__ impementation to a simple super().__getitem__(index). This would reduce code duplication and unit test cases.

Sure, what do you think about it now?

sslt/data/datasets/supervised_dataset.py

otavioon · 2024-02-13T12:43:08Z

sslt/data/datasets/supervised_dataset.py

+        """
+        return len(self.readers[0])
+
+    def __getitem__(self, index: int) -> Tuple[np.ndarray, np.ndarray]:


Hello @GabrielBG0. I did not see the class name, my bad.

I agree with a more specific implementation. However, I think this class could be a bit more generic. The name suggests that this dataset would only be used for Semantic Segmentation classes, which is not true. In fact, the same implementation here would be used for any other reconstruction task (predicting seismic attributes, seismic facies classification/segmentation, etc). Thus, any task that takes an input and has a target with the same shape as input should subclass this one. Therefore, maybe we can change its name to something like SupervisedReconstructionDataset. What do you think?

I agree with the typing hint, as it is a specific class. However, it is worth notice that, yes, this is the same behavior as in the base class. If you have only one Transform and several readers, the same transform will be applied to all data fetched from the readers (equivalent to the codehere). We can still rewrite this whole __getitem__ impementation to a simple super().__getitem__(index). This would reduce code duplication and unit test cases.

sslt/data/datasets/supervised_dataset.py

Signed-off-by: Otavio Napoli <otavio.napoli@gmail.com>

otavioon · 2024-02-13T16:15:44Z

LGTM! @GabrielBG0 I've fixed some typos in class' documentation and add some examples. Please, check if it is OK.

GabrielBG0 added 2 commits February 12, 2024 15:39

Supervised Semantic Segmentation Dataset

8fa5854

some bug fixes

42f5551

GabrielBG0 linked an issue Feb 12, 2024 that may be closed by this pull request

Supervised Reconstruction Dataset #14

Closed

GabrielBG0 requested a review from otavioon February 12, 2024 18:47

otavioon requested changes Feb 12, 2024

View reviewed changes

take out redundant __len__

f8582b3

otavioon requested changes Feb 13, 2024

View reviewed changes

corrections to supervised_dataset

1d33e4f

GabrielBG0 requested a review from otavioon February 13, 2024 15:11

GabrielBG0 changed the title ~~Supervised Semantic Segmentation Dataset + bug fixes and formatting~~ Supervised Reconstruction Dataset + bug fixes and formatting Feb 13, 2024

Fix typos in documentation and add more docs

ffb620f

Signed-off-by: Otavio Napoli <otavio.napoli@gmail.com>

otavioon approved these changes Feb 13, 2024

View reviewed changes

otavioon requested review from otavioon and removed request for otavioon February 13, 2024 16:14

GabrielBG0 merged commit 1aa1f34 into main Feb 13, 2024
1 check passed

GabrielBG0 deleted the 14-supervised-dataset branch February 13, 2024 16:29

GabrielBG0 self-assigned this Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supervised Reconstruction Dataset + bug fixes and formatting #25

Supervised Reconstruction Dataset + bug fixes and formatting #25

GabrielBG0 commented Feb 12, 2024

otavioon left a comment

otavioon Feb 12, 2024

GabrielBG0 Feb 13, 2024

otavioon Feb 13, 2024

GabrielBG0 Feb 13, 2024

otavioon Feb 13, 2024

otavioon commented Feb 13, 2024

Supervised Reconstruction Dataset + bug fixes and formatting #25

Supervised Reconstruction Dataset + bug fixes and formatting #25

Conversation

GabrielBG0 commented Feb 12, 2024

otavioon left a comment

Choose a reason for hiding this comment

otavioon Feb 12, 2024

Choose a reason for hiding this comment

GabrielBG0 Feb 13, 2024

Choose a reason for hiding this comment

otavioon Feb 13, 2024

Choose a reason for hiding this comment

GabrielBG0 Feb 13, 2024

Choose a reason for hiding this comment

otavioon Feb 13, 2024

Choose a reason for hiding this comment

otavioon commented Feb 13, 2024