Flexible protein structural alignment with a hierarchical segmentation technique.
One of the recurring problems in structural alignment is to compare structures belonging to the same fold family but in different conformations. The classical algorithms are inappropriate because they consider both proteins to align as rigid bodies. This problem is also critical when the proteins belong to the same folding family but were obtained by experimental methods in different conformations.
This project aims to achieve a new method of structural alignment by pair of two proteins. The novelty brought by this method is to cut out one of the two proteins into small subunits beforehand, in order to allow superimposition by "flexible" alignment and facilitate the alignment of proteins in different conformations.
git clone https://github.com/gabrielctn/flexible_structural_alignment.git
cd flexible_structural_alignment
-
A linux distribution.
-
Install the few required python packages :
pip3 install -r requirements.txt
# This command will install the following modules:
# biopython == 1.72
# schema == 0.6.8
# docopt==0.6.2
flex
takes in input two protein structures as two PDB files. The proteins should have only one chain A. In any case the program reindexes both files to make them have 1 chain A starting with residue index 1.
In the terminal run:
./flex data/1MBA.pdb data/101M.pdb
$./flex -h
Usage:
./flex PDB_FILE_1 PDB_FILE_2
Arguments:
PDB_FILE_1 Path to the first PDB file to align
PDB_FILE_2 Path to the second PDB file to align
Options:
-h, --help Show this
This project is licensed under the MIT License.