-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential bug in neighborhood assignments #968
Comments
Looks like the color assignment is accurate based on the outputted kmeans clustering results. Seems like it could be an issue with either the neighbor matrices or distance matrices calculation, which have both been adjusted in the last 4 months. Which previous data should I test out? |
You could either rerun it on some data, like Erin’s or the example dataset,
where you know what it should like look like. Or you could take the same
data, and run it with the commit from a couple months ago before the
refactoring.
I’m not 100% convinced there’s a problem, but I think there might be. So
just some initial validation to figure out if there’s an obvious issue or
not
… —
Reply to this email directly, view it on GitHub
<#968 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADJB47JDPK6F4GMF3L36AZLW6YOOJANCNFSM6AAAAAAWNVSZTA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
I verified that the the generated neighbors matrices have not changed, but it looks like there was an issue with the Kmeans function call itself. I was able to get the same results as before by adding I can open a quick PR now to fix this. |
Sounds good, thanks! Then we can redo the TONIC clustering and see if things still look weird or if this was the issue. |
Please refer to our FAQ and look at our known issues before opening a bug report.
Describe the bug
I'm running into some weird behavior with the neighborhood analysis script. Specifically, it seems like cells with very similar neighborhoods are being assigned to different clusters.
For example, in the upper right hand corner, all of the blue cancer cells seem to have almost exactly the same neighbors
However, they are assigned to different neighborhoods in the output.
I'm not sure if this is related to #967. It could be that the visualization isn't working correctly. However, the heatmap of the clusters roughly lines up with the visual, so I think that's less likely. Not sure exactly what's going on. I think a good first step once #967 is resolved will be to re-run on some previous data and confirm that we still get the qualitatively same clustering results, making sure to re-generate the neighbor_counts, rather than using the previously extracted ones.
The text was updated successfully, but these errors were encountered: