Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PFM for amino acid sequences? #137

Open
fcharih opened this issue Mar 10, 2021 · 1 comment
Open

PFM for amino acid sequences? #137

fcharih opened this issue Mar 10, 2021 · 1 comment

Comments

@fcharih
Copy link

fcharih commented Mar 10, 2021

I noticed that the PFM function does not run on protein sequences. Is there a biological rationale for this or is this simply missing functionality?

pfm_matrix
ArgumentError: sequence element must be DNA or RNA

BioSequences.PFM(::Array{BioSequences.LongSequence{BioSequences.AminoAcidAlphabet},1})@pwm.jl:52
top-level scope@Local: 1[inlined]
@jakobnissen
Copy link
Member

As far as I can tell, the PFM/PWM functionality has not been used much.
The functionality has been updated with three commits: The initial back in 2017, and once for v 1.0 and v2.0. It's probably a relic from older times, and I think it belongs in a separate package. Might be worth removing for v3.0

But anyway yeah, it looks to me like it could be implemented with arbitrary alphabets, not just DNA/RNA. It also looks like it handles ambiguous nucleotides a little strangely, too.

A PR would be much welcome! :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants