Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

compute center for each treatment #6

Closed
eschen42 opened this issue Dec 16, 2019 · 4 comments
Closed

compute center for each treatment #6

eschen42 opened this issue Dec 16, 2019 · 4 comments

Comments

@eschen42
Copy link
Collaborator

Many multivariate procedures such as PCA suggest through clustering of samples how treatments might differ, but they are not as helpful in showing similarity the way that a heatmap does. However, producing a heatmap based on individual samples intermingles the samples for different treatments, making it more difficult to surmise quickly which treatments are more similar in effect.

It may be useful to identify the "center" of the samples for each treatment and look at the relationship among the centers. There are several conceptual centers from which to choose. This issue is open to add the following options for centering:

  • none - Do not compute centers
  • centroid - For each treatment, compute the center as the vector comprising the mean for each feature.
  • median - For each treatment, compute the center as the vector comprising the median for each feature.
  • medoid - For each treatment, compute the center as the medoid, i.e., the sample that is most "central".
    • Because this calculation is sensitive to correlation among features, perform this computation in principal components space.
    • An elegant solution in R for identification of the medoid is presented here
@eschen42
Copy link
Collaborator Author

Addressed in release v0.98.15

@eschen42
Copy link
Collaborator Author

Closing because release has been published.

@eschen42 eschen42 reopened this Dec 17, 2019
@eschen42
Copy link
Collaborator Author

eschen42 commented Dec 17, 2019

Data should be scaled as part of PCA for more consistent distance measurement: Have prcomp standard-scale the data before performing PCA.

@eschen42
Copy link
Collaborator Author

bug addressed in release v0.98.16

@eschen42 eschen42 removed the bug label Dec 29, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant