Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Validate FTP files #101

Open
AntonPetrov opened this issue Dec 16, 2021 · 0 comments
Open

Validate FTP files #101

AntonPetrov opened this issue Dec 16, 2021 · 0 comments
Assignees

Comments

@AntonPetrov
Copy link
Member

AntonPetrov commented Dec 16, 2021

We need an automated way to validate FTP files before a release. Based on previous experience, I propose the following checks:

  • The number of FASTA files must match the expected number of families
  • No empty FASTA files
  • No .log files in the fasta_files folder
  • The clanin file is not empty and the number of lines is not smaller than in the previously released clanin file (it is possible that some clans may get smaller but this is rare; it is more likely that fewer lines indicates a problem)
  • The Rfam.cm file contains the expected number of families
  • The Rfam.cm file contains ACC and DESC fields
  • All database_files exist and are non-empty, and for each table there is an .sql and a .txt.gz file
  • Genome browser folder exists and is non-empty (until Trackhub improvements #21 is done, this level of checking is enough)
  • rfam2go and md5 files exist and are non-empty
  • the headed of the rfam2go file contains the correct release number
  • Rfam.seed_tree.tar.gz contains the expected number of .seed_tree files
  • SVN temporary folders are empty (https://xfamsvn.ebi.ac.uk/svn/data_repos/trunk/FamiliesPending/ and https://xfamsvn.ebi.ac.uk/svn/data_repos/trunk/ClansPending/). If they are not, expect problems with the families that are "stuck" in the temporary folders.
  • README is updated with the correct release number and there is a section under NOTABLE CHANGES IN RECENT RELEASES
  • COPYING is updated with the correct release number
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants