Job: https://build.berkeleybop.org/job/panther-update
Using wget, bring new data to geneontology/data/trees/panther SVN repository Then svn commit
We're currently using the following command:
## copy svn content from project sync-svn-cvs-trigger to cvs
# use wget
# -o Log all messages to panther-wgetlog
# -t Set number of attempts
# -N only retrieve files that are new (moot until panther site is returning time-stamps)
# -r recursive retrieval
# -np guarantees that only the files below a certain hierarchy will be downloaded
# -nH Disable generation of host-prefixed directories
# --cut-dirs This will not create the two intermediate directories "PANTHER9.0" and "books"
# -A accept files with these names
# -R reject files with these names
# -X exclude SF folders
# -P Local directory prefix
wget -q -S -N -r -nH -np --cut-dirs=2 -A attr.tab,tree.tree,tree.mia,cluster.wts -R cluster.fasta,cluster.ortholog,"hmm.*",cluster.pir,tree.sfan,index.html -X "/PANTHER10.0/books/*/SF*" -P panther data.pantherdb.org/PAINT_PANTHER10.0/books/
## commit to svn, set proper username and key file location for svn
#SVN_SSH="${global_svn_ssh_commit_line}" svn commit -m "v10 of PANTHER" panther/PTHR*/*
Need to know what's changed from the PANTHER side.
Annotated nodes that move within trees and lose the original experimental annotations that were used as 'evidence'
First: is the list of PTNids that have changed families. These need to be a) moved from the GAF file in the old family to the GAF file in the new family, and 2) the updated GAFs have to be updated in the SVN repository. See liftover code for the java app that does reads and writes the revised GAFs. The excel spreadsheet has to be slightly modified as text to be readable.
Second: is the list of families that are no longer in PANTHER. These need to be moved from the primary paint GAF directory in SVN to the "retired" subdirectory in SVN. Likewise in they need to be removed/retired from the data/trees/panther directory on SVN
Make sure that all the taxa included in this PANTHER release are accounted for by the current taxon checker
Load into GOlr/Amigo