So, what does it do?

3 downloads 0 Views 25MB Size Report
Feb 1, 2018 - Fei Long. Rob Nicholls. Gideon Davies. Kevin Cowtan. Eleanor Dodson. Keith Wilson. Stuart McNicholas. Scott Hoh. Paul Bond. Wendy Offen.
Privateer So, what does it do?

Dr Jon Agirre Royal Society University Research Fellow @alwaysonthejazz

A bit of history Needed to mine PDB for carbohydrates Conformational analysis was critical → wanted nice, low-energy chairs We knew there were issues with nomenclature of monosaccharides and glycosidic linkages

Existing tools didn’t do what we wanted we = Gideon Davies, Keith Wilson, Kevin Cowtan & I at York Structural Biology Laboratory

Let us produce a simple program to leave out the 0.5% of structures that we expect will contain high-energy conformations…

A bit of history Needed to mine PDB for carbohydrates Conformational analysis was critical → wanted nice, low-energy chairs We knew there were issues with nomenclature of monosaccharides and glycosidic linkages

Existing tools didn’t do what we wanted we = Gideon Davies, Keith Wilson, Kevin Cowtan & I at York Structural Biology Laboratory

Let us produce a simple program to leave out the 0.5% of structures that we expect will contain high-energy conformations…

🤣

A look at ring conformation in N-glycosylation

Agirre, Davies, Wilson & Cowtan. Nature Chemical Biology, 11, 2015

A look at ring conformation in N-glycosylation

Agirre, Davies, Wilson & Cowtan. Nature Chemical Biology, 11, 2015

Agirre. Acta Cryst D, CCP4sw special issue, 2017

Privateer Tools for spotting errors Model vs data Omit mFo-DFc maps using model and original data Real space correlation for each monosaccharide

Model Ring puckering and conformation, anomericity, absolute stereochemistry, nomenclature checks 2D diagrams of glycosylation (SVG, vector goodness!), easier to spot funky linkages

Support for fixing errors Keywords for Refmac, scripts for Coot, dictionaries with aperiodic torsions…

Graphical interfaces CCP4i2 (me), CCP4mg (with Stuart McNicholas), Coot (Paul Emsley) CCP-EM (me, coming soon-ish)

Use cases Wrong anomer or absolute stereochemistry Built as beta, but used alpha 3-letter code: rename Used correct code, but linked it wrong: rebuild

Unlikely puckering amplitude (Q) Too puckered? check linkage distances Flat ring? check chiral volume signs, look out for 0º torsions in dictionaries - these are most certainly wrong

High-energy conformations At high resolution: rebuild unless in -1 subsite of an enzyme At low resolution: correct automatically with aperiodic torsions (privateerlib.cif), keep them on in subsequent iterations of Coot and Refmac5

Agirre. Acta Cryst D, CCP4sw special issue, 2017

Stereochemistry Same chemical formula! C6 H12 O6

beta-d-glucose ‘BGC’ in the PDB beta-d-mannose ‘BMA’ in the PDB

Stereochemistry Same chemical formula! C6 H12 O6 Anomeric carbon

⍺-D-mannose ‘MAN’ in the PDB

β-D-mannose ‘BMA’ in the PDB

“About 30% of all carbohydrate containing PDB entries comprise one or several errors” Lütteke, Frank and von der Lieth. Carb Res, 339(5), 2004

GLC

BGC

(alpha anomer)

(beta anomer)

demo

Use cases Wrong anomer or absolute stereochemistry Built as beta, but used alpha 3-letter code: rename Used correct code, but linked it wrong: rebuild

Unlikely puckering amplitude (Q) Too puckered? check linkage distances Flat ring? check chiral volume signs, look out for 0º torsions in dictionaries - these are most certainly wrong

High-energy conformations At high resolution: rebuild unless in -1 subsite of an enzyme At low resolution: correct automatically with aperiodic torsions (privateerlib.cif), keep them on in subsequent iterations of Coot and Refmac5

Cremer-Pople analysis 𝛩 and Φ tell us which atoms move away from the mean ring plane, thus describing the conformation

Q tells us how much:

“total puckering amplitude”

demo Cremer and Pople. JACS 97(6), 1975

Cremer-Pople analysis 𝛩 and Φ tell us which atoms move away from the mean ring plane, thus describing the conformation

Q tells us how much:

Z4

Z5 Z3

Z2

Z0 Z1

“total puckering amplitude”

demo Cremer and Pople. JACS 97(6), 1975

Use cases Wrong anomer or absolute stereochemistry Built as beta, but used alpha 3-letter code: rename Used correct code, but linked it wrong: rebuild

Unlikely puckering amplitude (Q) Too puckered? check linkage distances Flat ring? check chiral volume signs, look out for 0º torsions in dictionaries - these are most certainly wrong

High-energy conformations At high resolution: rebuild unless in -1 subsite of an enzyme At low resolution: correct automatically with aperiodic torsions (privateerlib.cif), keep them on in subsequent iterations of Coot and Refmac5

Cremer-Pople analysis

Pyranoses (Q,Φ,𝛩)

Agirre & Cowtan. Computational Crystallography Newsletter Jan:10-12, 2015

Cremer-Pople analysis

Furanoses (Q,Φ)

Agirre & Cowtan. Computational Crystallography Newsletter Jan:10-12, 2015

Conformations 0.0

beta-D-mannopyranose

beta-D-glucopyranose

20.0 (kcal/mol)

alpha-L-fucopyranose

Ardevol, Biarnés, Planas & Rovira. JACS 132(45), 2010

cost of breaking 3 or 4 H-bonds Sheu, Yang, Selzle and Schlag. PNAS 100(22), 2002

0.0

beta-D-mannopyranose

beta-D-glucopyranose

20.0 (kcal/mol)

alpha-L-fucopyranose

Ardevol, Biarnés, Planas & Rovira. JACS 132(45), 2010

Geometric restraints With weak/incomplete density, conformation should be imposed L(p)=wLX(p)+LG(p)

Geometric restraints With weak/incomplete density, conformation should be imposed L(p)=wLX(p)+LG(p) bond lengths

? ?

bond angles

harmonic torsions

NCS, etc

Geometric restraints With weak/incomplete density, conformation should be imposed L(p)=wLX(p)+LG(p) bond lengths

Eclipsed substituents

? ?

bond angles

NCS, etc

harmonic torsions

Staggered substituents

(…) publication and deposition of incorrect structures informs subsequent statistical analyses that suggest the deposited structures are normal. Agirre, Davies, Wilson and Cowtan. Nature Chemical Biology, 11, 2015

We can now use Privateer to detect and prevent these and other anomalies

How to validate and correct structures with Privateer, Coot and Refmac5 At lower resolution, even a correct starting model can result in a higher-energy conformation after refinement

Doing sphere refine with harmonic torsions

Doing sphere refine with aperiodic torsions

Doing sphere refine with harmonic torsions

Doing sphere refine with aperiodic torsions

Doing sphere refine with harmonic torsions

Doing sphere refine with aperiodic torsions

‘TM9’ in 4K3T

demo

Graphical analysis

The Standard Symbol Nomenclature

McNicholas & Agirre. Acta Cryst D, CCP4sw special issue, 2017

The Standard Symbol Nomenclature 4A5T (human)

4BYH (human)

5FJI (fungal) 5AOG (plant)

demo

Agirre, Davies, Wilson and Cowtan. Curr Op Str Biol, 2017

Got a structure to report?

Acta Crystallographica section F STRUCTURAL BIOLOGY COMMUNICATIONS Special issue: Crystallography of glycoproteins and protein-carbohydrate complexes Edited by Mark van Raaij & Jon Agirre

Submit: https://tinyurl.com/glycoissue-acta-f Final deadline: 1st of February 2018

Acknowledgements Paul Emsley Garib Murshudov Fei Long Rob Nicholls

Robbie Joosten

Tristan Croll

Supported by

Gideon Davies Kevin Cowtan Eleanor Dodson Keith Wilson Stuart McNicholas Scott Hoh Paul Bond Wendy Offen

Carme Rovira

Martyn Winn Tom Burnley Colin Palmer

Acknowledgements Paul Emsley Garib Murshudov Fei Long Rob Nicholls

Robbie Joosten

Tristan Croll

Supported by

Gideon Davies Kevin Cowtan Eleanor Dodson Keith Wilson Stuart McNicholas Scott Hoh Paul Bond Wendy Offen

Carme Rovira

Martyn Winn Tom Burnley Colin Palmer