Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Geometric Search and Crystal Structure Determination Andy Wilson Geometric Search and Crystal Structure Determination Overview Driving Problem  Crystallographic Methods  Relation to computational geometry  Solving for phases  Complications and Future Work  Conclusions  Geometric Search and Crystal Structure Determination Driving Problem  What is a protein’s shape? – – – – Shape determines function DNA sequencing gives amino acid sequence AA sequence determines primary structure Unknown mapping between AA sequence and secondary structure Geometric Search and Crystal Structure Determination Two Molecules Geometric Search and Crystal Structure Determination More Molecules Geometric Search and Crystal Structure Determination Driving Problem 2  Mechanical optimization alone won’t work – Problem space has several thousand dimensions – Local minima are everywhere – Can refine a “close” guess  Humans are good at fitting models – Need something to fit to – Can hand off to automated methods Geometric Search and Crystal Structure Determination Objective Generate an approximate electron density map.  Have a scientist fit a model to this map.  Refine using other methods.  Geometric Search and Crystal Structure Determination Observing Proteins  X-ray crystallography to the rescue! – Crystallize protein, exploit repetitive structure – Observe electron density of molecule  XRC alone isn’t good enough – Output is (roughly) the Fourier transform of the electron density map – BUT… the phases are lost irretrievably – Magnitudes aren’t enough to reconstruct input Geometric Search and Crystal Structure Determination Recovering Phases  Random phases – doesn’t work - not enough structure in magnitudes  Similar structure – Guess that trial molecule is like a known one  Direct methods – Exploit relationships between reflections Geometric Search and Crystal Structure Determination Input  Cloud of reflections – each corresponds to a beam of X-rays – has position, magnitude, (unknown) phase – arranged on regular lattice  Symmetry group – If molecule has symmetry, only need solve part of it Geometric Search and Crystal Structure Determination Output  System of linear constraints on phases  Phase and magnitude for reflections  Approximate electron density map Geometric Search and Crystal Structure Determination Direct Methods Solve for sums of phases of a group of reflections  Probabilistic, symbolic method  Relates phases of 3 or 4 reflections  Objective: solve for enough phases to synthesize a rough electron map  Method: search for phase invariants  Geometric Search and Crystal Structure Determination Phase Invariants Groups of 3 or 4 reflections with a certain geometric relationship  Sum of phases remains constant  – probably! Probability increases with strength of reflections  Invariant to rotation of molecule Geometric Search and Crystal Structure Determination Finding Invariants Compute strength of reflections  Choose triplets  Test strength  Incorporate into constraints  Geometric Search and Crystal Structure Determination Reflection Strength  For each reflection: – Find average magnitude of nearby reflections – “Strength” of a reflection is its magnitude divided by average over neighborhood – An especially strong reflection has magnitude greater than 2x average Geometric Search and Crystal Structure Determination Searching for Invariants Choose three reflections h, k, -(h-k)  If vector sum of positions is zero, sum of phases is (probably) zero  Geometric Search and Crystal Structure Determination Searching for invariants  Naïve search is at least N choose 2 (or 3) – O(n2) or O(n3), which is expensive with >20000 reflections  Accelerate search with a spatial data structure – k-D tree is well suited to this task – Has to support nearest-neighbor queries – Could probably fake it with range queries Geometric Search and Crystal Structure Determination Let the computer search  Idea 1: search for third reflection – Pick the first two with for-loops – Search nearest neighbors to look for the third – Accept or reject based on distance, strength  Idea 2: search small chunks – Subdivide space with a regular grid – Choose 3 chunks in “invariant pattern” Geometric Search and Crystal Structure Determination Using the results Invariants specify constraints on phases  Fix one phase, then solve for others  Use phases and magnitudes to construct electron map  Let scientist try to fit model to map  Generate more constraints if necessary  Geometric Search and Crystal Structure Determination Complications  Strength of reflections – As problem size increases, strength goes down  Do certain structures make certain patterns? – Disulfide bonds – Alpha helices – Beta sheets/barrels  Memory locality – Nested loops in search are harmful Geometric Search and Crystal Structure Determination Future Work  Implement invariant search. – CORWIN already has groundwork  Consider substructure invariants. – Finding them is hard – Searching for them is even harder Geometric Search and Crystal Structure Determination For More Information  GRIP library or team members  Dickerson and Geis, Protein Structure and Action. Glusker and Trueblood, Crystal Structure Analysis: A Primer. Schenk, Introduction to Structure Invariants and   Seminvariants.  See Andy or Darlene Freedman to get these. Geometric Search and Crystal Structure Determination