Automating best practices: Good for users, good for ...

4 downloads 81101 Views 4MB Size Report
Now that we have best practices, let's automate them: Chodera's Yank. • Automates alchemical free energy calculations in implicit or explicit solvent. • Hydration ...
Automating best practices: Good for users, good for science

David Mobley

Hydration free energies are a case where we have good established practices we automate

Hydration free energy predictions have improved greatly over the years

Hydration free energy predictions have improved greatly over the years

Hydration free energy predictions have improved greatly over the years

Now that we have best practices, let’s automate them: Chodera’s Yank

• • •

Automates alchemical free energy calculations in implicit or explicit solvent Hydration or binding We put it on Orion: We have a hydration floe and a draft binding floe

And we have our FreeSolv database of calculated and experimental hydration free energies



• •

643 curated hydration free energies for neutral compounds, with literature references, structures, etc. github.com/mobleylab/freesolv We put it on Orion

So, now we have a Floe for hydration free energies and it will take FreeSolv as input



We ran in implicit (GBSA) solvent as an initial test

Explicit solvent is working also

Explicit solvent is working also

Outcome: Something that took 200K+ CPU hours over a month, plus a couple of weeks of human time now takes a button press and a few hours

The same technology allows for easy automation of relative solubility prediction A

Solvent C

Solvent B A

A A

A A A A

A A A A

A A A A

A

A A A A

A A A A

A A A A

A A A A

A A A A

Relative solubilities can be calculated fairly accurately in arbitrary organic solvents

• •

Process chemistry (separations) Force field testing

Transfer

Partition and distribution coefficients are also accessible via solvation calculations

−ΔGTransfer log P = RT ln10 ΔGTransfer = ΔGSolvation − ΔGHydration

• • •

Passive membrane permeability Separations Force field testing

Transfer

Partition and distribution coefficients are also accessible via solvation calculations

−ΔGTransfer log P = RT ln10 ΔGTransfer = ΔGSolvation − ΔGHydration

• • •

Passive membrane permeability Separations Force field testing

SAMPL5 focused on calculating cyclohexanewater logD, which we could do on Orion Some of the compounds considered

We’ve had a dire need for good automated workflows with interchangeable components; containers and standard data formats Orion is enabling that allow us to evaluate the technology standardized! data formats

standardized! data formats

industry! datasets

standard
 benchmarks preparation! pipeline

modeling tool

automated! analysis/! evaluation

It’s interchanging the components which will reallypractices advance our science best can be

evaluated by testing variations standardized! data formats

standardized! data formats

industry! datasets

standard
 benchmarks

modeling tool preparation! pipeline! variations

automated! analysis/! evaluation

We need to be able to use the latest science in existing workflows to see how much it helps, rather than waiting years (if ever) for software company X to decide it’s valuable and incorporate it in a polished application Orion can change that — we can drop a new method into an existing floe and test immediately Side benefit: If your speciality is in the area of one cube, you don’t have to be responsible for the rest of the floe

One thing to (try to) avoid: forming new silos around many incompatible workflow engines so we STILL can’t compare the latest methods head-tohead easily •

Old silos: simulation packages/modeling ecosystems

AMBER

CHARMM

CHARMM

Desmond

….

One thing to (try to) avoid: forming new silos around many incompatible workflow engines so we STILL can’t compare the latest methods head-tohead easily •

Old silos: simulation packages/modeling ecosystems

AMBER •

CHARMM

CHARMM

Desmond

New silos: workflows/pipelines

Orion

MDT (?)



….

….

Orion is pretty exciting

Orion is pretty exciting

Automation is great

Orion is pretty exciting

Automation is great

Large scale hydration for anyone!

Orion is pretty exciting

Automation is great

Large scale hydration for anyone!

SAMPL5_050

OH N

N N

SAMPL5_050_conformer2

O N

N

N N

N H

We can calculate distribution, partition, and relative solubility

Orion is pretty exciting

Automation is great

Large scale hydration for anyone!

best practices can be evaluated by testing variations

SAMPL5_050

OH N

N N

standardized! data formats

SAMPL5_050_conformer2

standardized data formats

O N

N

standard
 benchmarks

N N modeling tool

N H

We can calculate distribution, partition, and relative solubility

preparation! pipeline! variations

Automation isn’t just for users, but enables science

Acknowledgments • Distribution coefficients: JW Feng, Baiwei Lin, Dan Ortwine, Joe Pease, Justin Dancer, Bas Rustenberg, John Chodera • Yank: John Chodera, Andrea Rizzi, Levi Naden, Chodera lab • Mobley lab present and former including highlighted work from Nathan Lim, Guilherme Matos, Caitlin Bannan, Kalli Burley, Shuai Liu • OpenEye Orion crew (esp. Craig, Forrest, Matt) & Ant • Christopher Bayly • Many, many collaborators including Chris Fennell, Michael Shirts, Mike Gilson, Andrew Paluch, J. Peter Guthrie, and many others.

We also looked at excess solubility in binary mixtures of water/ethanol

We also looked at excess solubility in binary mixtures of water/ethanol

Suggest Documents