-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat_categorical_labels #1053
Merged
Merged
Feat_categorical_labels #1053
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
might approximately work already, but haven't tested much yet basic idea is (1) a CategoricalLabels class which stores lists of integer indices for class labels along with the class list, and provides various "views" such as sparse or dense matrices or dfs, or multihot labels for a single row; (2) convert sparse label dtypes to dense when creating AudioSample objects
…ions' into feat_categorical_labels
these methods replace multihot_clip_labels() and multihot_labels_like(), and support four different output formats: multihot, categorical with integers, categorical with class names, or CategoricalLabels class. Adds CategoricalLabels class, which stores labels as lists of integers and provides methods to "view" or access the labels in various formats including sparse or dense 2d arrays or dataframes. Also provides methods to create from multihot or categorical dataframes. Will be useful for storing labels in a lightweight format when there are many classes and samples. Adds tests for the new BoxedAnnotations methods and removes/updates outdated tests. Still needs tests for the CategoricalLabels class itself.
avoids FutureWarnings when creating Pandas dfs from scipy sparse matrix types, but also seems like unnecessary memory use, consider reverting to bool pending info on this issue: pandas-dev/pandas#59739
this property wont exist when the method is called with Lightning
fixes passing list of classes (rather than pd.Index) in classmethod from_multihot_df adds properties .labels and .class_labels for CategoricalLabels adds missing docstrings for properties and methods of CategoricalLabels
updated labels_df to have categorical rather than int labels, needed to update assertion in test accordingly
This was
linked to
issues
Sep 10, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
these methods replace multihot_clip_labels() and multihot_labels_like(), and support four different output formats: multihot, categorical with integers, categorical with class names, or CategoricalLabels class.
Adds CategoricalLabels class, which stores labels as lists of integers and provides methods to "view" or access the labels in various formats including sparse or dense 2d arrays or dataframes. Also provides methods to create from multihot or categorical dataframes. Will be useful for storing labels in a lightweight format when there are many classes and samples.
Adds tests for the new BoxedAnnotations methods and removes/updates outdated tests. Still needs tests for the CategoricalLabels class itself.