Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outlier detection #88

Closed
mzur opened this issue Apr 7, 2021 · 5 comments · Fixed by #120
Closed

Outlier detection #88

mzur opened this issue Apr 7, 2021 · 5 comments · Fixed by #120
Assignees
Labels
MI3 Idea for the MI3 project

Comments

@mzur
Copy link
Member

mzur commented Apr 7, 2021

Use the mechanism implemented in biigle/core#336 to sort Largo patches by "similarity". Depending on how fast we can implement the sorting method, this can be done on demand with a click on a button (and the user has to wait a few seconds for the sorting). The sorting cannot be computed a priori because the annotations can constantly change.

@mzur mzur added the student label Apr 7, 2021
@mzur
Copy link
Member Author

mzur commented Aug 13, 2021

The mechanism proposed in biigle/core#336 can't be easily transferred to Largo. Instead, we should find an algorithm to quickly detect and highlight outliers for a given label selected in Largo. These outliers can be shown first in the grid so they can be quickly selected and dismissed. There is no need for a general "similarity sorting".

@mzur mzur changed the title Sort patches by similarity Outlier detection Aug 13, 2021
@mzur
Copy link
Member Author

mzur commented Aug 20, 2021

Idea (no clue if it could work): Construct a "prototype" image hash (e.g. the mean bit vector of all hashes) and then compute the distance between the hash of each Largo patch and the prototype hash. The patches with the largest distance are most likely to be outliers. This also heavily depends on the image hashing algorithm that is used.

@tschoeni
Copy link

With Hash = Feature Vector you could also use (an) MPEG7 descriptor(s). See also: https://marine-imaging.com/fair/ifdos/iFDO-content/

@mzur
Copy link
Member Author

mzur commented Aug 20, 2021

Thanks for pointing that out. We've experimented with image hashes that are used for image retrieval (i.e. "find the images most similar to a given image"). Any method should work that allows you to compute a distance between hashes/feature vectors.

@mzur mzur moved this to Medium Priority in BIIGLE Roadmap Oct 15, 2021
@mzur mzur mentioned this issue Aug 15, 2022
5 tasks
@mzur mzur removed the student label Oct 20, 2022
@mzur mzur added the MI3 Idea for the MI3 project label Oct 18, 2023
@mzur
Copy link
Member Author

mzur commented Oct 24, 2023

The outlier detection will be implemented in the same way than biigle/maia#128 in MAIA, only the sorting is reversed in the "dismiss" step (i.e. the most dissimilar patches are shown first). In the "relabel" step, the sorting is regular (i.e. the most similar patches are shown first). Later when #97 is implemented, users can also choose to "reverse" the sorting again.

@mzur mzur self-assigned this Oct 24, 2023
@mzur mzur mentioned this issue Dec 6, 2023
33 tasks
@mzur mzur closed this as completed in #120 Jan 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
MI3 Idea for the MI3 project
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

2 participants