Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Overhaul rfreplace.pl script for new independent SEED paradigm #40

Open
nawrockie opened this issue Jan 10, 2020 · 0 comments
Open

Overhaul rfreplace.pl script for new independent SEED paradigm #40

nawrockie opened this issue Jan 10, 2020 · 0 comments

Comments

@nawrockie
Copy link
Contributor

This was originally motivated by a suggestion from Franz Lang that users should not need to enforce the Rfam convention for sequence names in SEED alignments - a script should be able to do that. Anton suggested rfreplace could do that. After looking at current rfreplace code it is clear it needs to be scrapped and rewritten anyway for the new independent SEED paradigm (SEED seqs now only need to be in GenBank/RNAcentral as opposed to in Rfamseq (old paradigm)). Whereas current rfreplace looked only at hits in Rfamseq as candidates for replacing existing SEED seqs, we can now look at all seqs in GenBank/RNAcentral. Getting back to Franz's suggestion, if a sequence is found that is 100% identical in GenBank/RNAcentral that sequence's name can be used to rename the sequence in the input SEED.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant