GitHub - kad-ecoli/CE: Combinatorial Extension (CE)

                         CCCC     EEEEE 
                        CC        EE
                        CC        EEEE     
                        CC        EE    
                         CCCC     EEEEE

Protein Structure Alignment by Incremental COMBINATORIAL EXTENSION of the Optimal Path.

           Authors:   Shindyalov I.N., Bourne P.E. 

             e-mail:   {shindyal,bourne}@sdsc.edu

              URL:   http://ce.sdsc.edu/ce.html

Permission to use, copy, modify and distribute any part of this CE software for educational, research and non-profit purposes, without fee, and without a written agreement is hereby granted, provided that the above copyright notice, this paragraph and the following three paragraphs appear in all copies.

Those desiring to incorporate this CE Software into commercial products or use for commercial purposes should contact the Technology Transfer Office, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0910, Ph: (858) 534-5815, FAX: (858) 534-7345.

IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS CE SOFTWARE, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

THE CE SOFTWARE PROVIDED HEREIN IS ON AN "AS IS" BASIS, AND THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATION TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS. THE UNIVERSITY OF CALIFORNIA MAKES NO REPRESENTATIONS AND EXTENDS NO WARRANTIES OF ANY KIND, EITHER IMPLIED OR EXPRESS, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE, OR THAT THE USE OF THE CE SOFTWARE WILL NOT INFRINGE ANY PATENT, TRADEMARK OR OTHER RIGHTS.

How to compile and run the software.

To compile CE software in this directory, use command:

$ gmake

or

$ make

To run CE software:

$ mkdir -p scratch/
$ CE - pdb_file1 chain_id1 pdb_file2 chain_id2 scratch/

Note: '-' can be used for chain id1 and chain id2 for iterative comparison of all chains parsed from the corresponding files.

Examples

$ mkdir -p scratch/
$ CE - 4hhb.pdb A 1hba.pdb B scratch
$ CE - 4hhb.pdb - 1hba.pdb - scratch

Interpreting results:

Structure Alignment Calculator, version 1.02, last modified: Mar 13, 2003.

CE Algorithm, version 1.00, 1998.

Chain 1: 4hhb.pdb:A (Size=141) 
Chain 2: 1hba.pdb:B (Size=146)

Alignment length = 139 Rmsd = 1.36A Z-Score = 6.5 Gaps = 8(5.8%) CPU = 0s Sequence identities = 43.2%

Chain 1:    1 VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDL------SHGSAQVKGHGKKVAD
Chain 2:    2 HLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPRTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLG

Chain 1:   65 ALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVST
Chain 2:   70 AFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVAN

Chain 1:  135 VLTSKYR
Chain 2:  140 ALAHKYH

     X2 = ( 0.980144)*X1 + ( 0.023409)*Y1 + (-0.196902)*Z1 + (   -0.134999)
     Y2 = ( 0.023575)*X1 + (-0.999721)*Y1 + (-0.001503)*Z1 + (   -0.552679)
     Z2 = (-0.196883)*X1 + (-0.003169)*Y1 + (-0.980422)*Z1 + (    1.982153)

Alignment length - number of aligned positions;

Rmsd - root mean square deviation (in Angstroms) calculated for the best superposition;

Z-Score - z-score from CE statistical model. Typical values:

        >4.5    - family level similarity;

        4.0-4.5 - superfamily level similarities, strong 
                  function related similarities or strong
                  recurring fold;

        3.7-4.0 - twilight zone where some similarities
                  of biological significance can be seen;

        <3.7    - similarities of low significance, but still 
                  some biologically important similarities 
                  can be revealed, but interpretation normally
                  requires additional evidence.

Gaps - number of not aligned position within the alignment;

CPU - computation time in seconds;

Sequence identities - sequence identites for aligned positions;

Sequence alignment based on structure alignment is given followed by transformation matrix based on structure superposition calculated for this alignment.

Reference:

Shindyalov I.N., Bourne P.E. Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Engng, 1998, 9, 739-747.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ce		ce
pom		pom
scratch		scratch
.gitignore		.gitignore
1hba.pdb		1hba.pdb
4hhb.pdb		4hhb.pdb
CE		CE
Makefile		Makefile
README		README
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to compile and run the software.

About

Releases

Packages

Languages

kad-ecoli/CE

Folders and files

Latest commit

History

Repository files navigation

How to compile and run the software.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages