* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Protein Threading - Laboratory of Molecular Modelling
Multi-state modeling of biomolecules wikipedia , lookup
Artificial gene synthesis wikipedia , lookup
Paracrine signalling wikipedia , lookup
Biosynthesis wikipedia , lookup
Amino acid synthesis wikipedia , lookup
Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup
Gene expression wikipedia , lookup
Genetic code wikipedia , lookup
G protein–coupled receptor wikipedia , lookup
Magnesium transporter wikipedia , lookup
Expression vector wikipedia , lookup
Point mutation wikipedia , lookup
Biochemistry wikipedia , lookup
Ancestral sequence reconstruction wikipedia , lookup
Metalloprotein wikipedia , lookup
Bimolecular fluorescence complementation wikipedia , lookup
Interactome wikipedia , lookup
Western blot wikipedia , lookup
Protein purification wikipedia , lookup
Structural alignment wikipedia , lookup
Protein–protein interaction wikipedia , lookup
Protein Threading Zhanggroup 2003 10 22 Overview Background protein structure protein folding and designability  Protein threading  Current limitations to protein threading  Computational complexity of certain formulations of the protein threading problem  Performance of protein threading systems  References  Protein Structure  Primary, secondary, tertiary structure Can only refer to the structure of a protein if a particular environment is assumed solvent environment (aqueous trans-membrane ……)  temperature  pH etc  Different environments yield different structures or no stable structure at all  Proteins molecules are not completely rigid structures kinetic energy energetic collisions with solvent molecules  vibrations sidechain conformational changes  flexible sections of the peptide chain  The native tertiary structure of a protein is thus an average  Protein Folding  Protein folding = searching for a conformation having minimum energy Factors in protein folding hydrophobic effects  electrostatic charges in residues  hydrogen bonding  Chaperonins,ribosomes  3 stages of folding denatured unfolded state  molten globule state  native compact state  most proteins will return to their native state after forced denaturation  The Protein Folding Problem  Given a proteins amino acid sequence what is its tertiary structure  The protein folding problem is hard Direct approach :molecular dynamics simulation Simulate on an atomic level the folding of a single protein molecule  protein = thousands of atoms  solvent environment = hundreds to thousands of molecules => thousands of atoms  Sub-picosecond time scales  run the simulation for 1-5 seconds  We need another years of Moores law to make this computation feasible  Designability A protein with a stable native state can not have another low-energy state nearby in conformational space  A structure is highly designable if its minimum energy state has no lowenergy neighbours  Protein Threading inverse protein folding problem: given  a tertiary structure, find an amino acid sequence that folds to that structure  Protein threading: given a library of possible protein folds and an amino acid sequence find the fold with the  best sequence -> structure alignment (threading)   Evolution depends on designability to preserve function under mutation  Estimate only different protein structures exist in nature (Chothia,1992) four components a library of protein folds (templates)  a scoring function to measure the fitness of a sequence -> structure alignment  a search technique for finding the best alignment between a fixed sequence and structure   a means of choosing the best fold from among the best scoring alignments of a sequence to all possible folds Scoring Schemes for Sequence->Structure Alignments  The scoring scheme for a particular threading of a sequence onto a structure measures the degree to which        environmental preferences are satisfied Different amino acid types prefer different environments e.g. structural preferences: in helix in sheet not exposed to solvent pairwise interactions with neighbouring amino acids Formal Statement of the ProteinThreading Problem C is a protein core having m segments Ci representing a set of contiguous amino acids Let ci be the length of Ci  Sequence a = a1a2…an of amino acids  Current limitations to protein threading   Statistical problems Definition of neighbor and /or pairwise contact environments:  energetic neighbor ? contact neighbor Computational Complexity of Finding an Optimal Alignment The complexity of the protein threading problem depends on whether:  Variable-length gaps are allowed in alignments  the scoring function for an alignment incorporates pairwise interactions between amino acids  Property(I) makes the search space exponential in size to the length of the sequence  Property(Ii) forces a solution to take non-local effects into account  Any protein threading scheme with both properties is NP-complete (3-SAT Lathrop 1994) (MAX-CUT Akutsu,Miyano 1999) Thus all protein threading approaches can be divided into four groups: 1 no variable length gaps allowed 2 no pairwise interactions considered in scoring function 3 no optimal solution guarantee 4 exponential runtime Performance of Protein Threading Systems CASP1(1994) CASP2(1996) CASP3(1998): Critical Assessment of Structure Prediction meetings protein threading methods have consistently been the winners success depends on structural similarity of target to known structures successful even when target sequence and library sequence have low homology Much room for improvement in all areas of protein threading e.g.: algorithms for searching the threading space reliable biologically accurate scoring functions
 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                            