Scoring 3D protein-protein interaction models using deep learning
Generates consistent PSSM and PDB files for protein-protein complexes
PSSMGen is geared toward computing the pssm files for all models of a particular protein-protein complex. This tool assumes your files have following structure: caseID | pdb | fasta | pssm_raw | pssm |_ pdb_raw caseID is the ID of protein-protein complex, e.g. PDB ID 1AK4 or CAPRI target ID T161. Only the pdb dir must exist at run time, and it must contain the PDB files of the models generated by HADDOCK. Based on these PDB files, the code will extract protein sequences and generate the FASTA files (stored in the fasta subdir). It will then use the BLAST tool psiblast to compute PSSM (stored in pssm_raw subdir). Finally the sequences from PSSM files and PDB files will be aligned and then the consistent PSSM files and/or PDB files will generated. The consistent PSSM files are stored in pssm subdir. If applicable, the consistent PDB files are stored in pdb subdir, while the original non-consistent PDB files are moved to pdb_raw subdir. You can then use PDB files from pdb and PSSM files from pssm for further analysis.