Molecular Modeling in Drug Design

Molecular Modeling in Drug Design
Martin Smieško
Molecular Modeling : Department of Pharmaceutical Sciences : University of Basel : Switzerland
Molecular Modeling in Drug Design
Part 5: Virtual Screening + Publication Search
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
Virtual Screening (VS)
●
computational technique used in drug discovery to search libraries of small molecules (publicly
available, commercial, in-house, proprietary) in order to identify structures, which are most likely
to bind to a selected drug target
●
in silico analog of biological high-throughput screening (HTS)
●
score, rank, and/or filter a set of structures using one or more computational procedures
Motivation
●
cheaper and time effective alternative to the HTS
●
decrease the count of molecules than need to be screened experimentally
●
new scaffold identification (patent protection, solubility, selectivity...) → lesson next week
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
High-throughput (experimental) or Virtual Screening
ligand(s) or
protein known?
HTS feasible ?
yes
no
time &
costs OK ?
yes
no
VS
no
protein
available ?
VS
no
diversity
needed ?
yes
yes
structurebased VS
VS
no
ligandbased VS
yes
HTS
VS
docking
& scoring
HTS and VS can be performed in a
complementary manner
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
pharmacophore
& similarity
Molecular Modeling in Drug Design
Virtual Screening
increasing computational costs
decreasing number of compounds
public/commercial/company libraries
simple fast pre-filtering
(e.g. Lipinski, Veber)
in depth filters / similarity evaluation
molecular docking / scoring
(several stages/precisions)
consensus search
best hits (bought/synthesized/retrieved)
experimentally tested
success if a few real hits (Kd < 10 M) found
(enrichment = real hits/computed hits)
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
Virtual Screening Filters
AstraZeneca style :)
Cumming J.G. et al.: Chemical predictive modelling to improve compound quality.
Nat Rev Drug Discov. 2013, 12(12), 948-962
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
Initial property filter:
●
CORE compounds: hit no chemical filters and fulfill the following property filters: molecular
weight between 100 and 550, cLogP between 2 and 6, polar surface area (PSA) between 1 and 160
●
BACKUP compounds: fail on one property filter
●
UGLY compounds: fail on 2 or more property filters or hit at least one chemical filter
The AZ screening collection was split into CORE, BACKUP and UGLY sets, based on these filters. Only
CORE and BACKUP compounds are solubilised for HTS, and usually only CORE compounds are
purchased from external vendors.
And the chemical filters are…
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
AZ Chemical Filters
Class 1: Bland Structures
(remove isotopes, unwanted elements, unbranched structures, structures where heteroatoms are
provided by only one functional group)
●
disconnected structure: salts, hydrates, racemates >= 1
●
alkyl/aryl amine (bin if >=1 and no other heteroatom)
●
isotopes >= 1
●
OH/SH (bin if >=1 and no other heteroatom)
●
compounds containing atoms other than: H, C, N, O, S,
●
(thio)ethers (bin if >=1 and no other heteroatom)
F, Cl, Br, I >= 1
●
quaternary (bin if >=1 and no other heteroatom)
●
<= 3 C atoms, must have at least 4 carbon atoms
●
N-oxide (bin if >=1 and no other heteroatom)
●
less than 12 heavy atoms
●
halophenols
●
no polar atoms (N, O, S)
●
haloketones
●
straight/unbranched structures
●
mono and dinitro phenols
●
positive charged atoms >= 1
●
no aromatic carbon polyamines
●
compounds with >=3 acidic groups
●
only hetero is 1 acid or derivatives
●
C-nitro (bin if >=1 and no other heteroatom)
●
p-diaminobiphenyl
●
C-C≡N (bin if >=1 and no other heteroatom)
●
m- or p- diaminobenzene with no other heteroatoms
●
sulphone SO2 (bin if >=2 and no other heteroatom)
●
unusual valence (C, N, O, P, S, Hal)
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
AZ Chemical Filters
Class 2: Reactive structures
●
●
●
isourea
Michael acceptors: C=C-C=O, C=C-CN, C=C-SO2, C=C-
●
benzoquinone
NO2
●
aliphatic and aromatic C-aldehyde RC(=O)H
acetylene Michael acceptors: C≡C-C=O, C≡C-CN, C ≡C-
●
peroxides
SO2, C≡C-NO2
●
Schiffs base/imine RN=CRR (excluding benzodiazepine,
●
vinyl and Hal, C=C-Hal
●
reactive beta keto quaternary O=CCN+
●
reactive
cyclic imines, ArN=C(Ar) NO)
●
imine halide
●
N-halogen & S-halogen & P-halogen & O-halogen
(X=Hal/CN/Ph)
●
diazetidinone
sulphonamide imine but not sulphonamide amidine nor
●
azetidin-2-one
in ring
●
epoxides & aziridines & thiiranes & oxazirane
●
anhydride
●
thiocyanate
●
alpha halo (hal=1,2,3) ketone
●
isocyanates & isothiocyanates
●
halo methylene ether
●
isocyanide = isonitrile -N≡C
●
CH2Hal: CBr3, CBr2, CBr, CCl2, CCl; hal-C-C-R is OK
●
azoanhydrides
●
acid halide & thio acid halide
●
1,3-benzodioxan-2,4-diones
●
halo (incl. F) triazine or 2-, 4-pyrimidine but do not bin
●
furazanes 4,7-disubstituted
2-amino, 4-Cl pyrimidine nor 2-amino, 4-Cl triazine
●
nitrite
●
esters
and
thioesters
RCOOCX2
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
AZ Chemical Filters
Class 3: Frequent hitters
Cass 5: Unlikely drug candidates
●
more than 2 nitro groups
●
di-OH benzene
●
Large ring >= C9
nitro phenols
●
C9 chain not in any rings
●
crown ethers
●
multi alkene chain C=CC=CC=C and N=CC=CC=C
Class 4: Dye-like structures
●
diyne -C≡C-C≡C-
●
>= 2 nitro on same aromatic ring including naphthalene
●
annelated rings phenanthrene, anthracene, phenalene
●
diphenyl ethylene cyclohexadiene
●
all compounds with 4 fused aromatic rings and some
●
aliphatic quaternary N and C9 chain
●
SO3H/R plus benztriazole
●
SO3H/R plus aminotriazine
●
SO3H/R plus aminobenzene
●
beta (mesyl or halo) amine or O/S mustard
●
SO3H/R plus naphthalene
●
two S-atoms (not sulfones SO2 ) in 5-rings or 6-rings
●
quaternary aromatic N+(amine or aminoethylene)
●
cyclobut-3-ene-1,2-dione
●
quaternary pyridine
●
triphenylmethyl
●
neutral p-amino pyridine bin if >=2
●
acridine
●
& Unsuitable fragments
with 3 rings
●
beta naphthylamine but not if alpha keto, nor if
substituted with aromatic carbon
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
AZ Chemical Filters
Class 6: Difficult series / Natural compounds
Cass 8: General UGLY Oxygen
●
steroid
●
too many OH's - bin if >= 5
●
penicillin or cephalosporin
●
p-, p'-dihydroxy biphenyl
●
prostaglandin
●
p-, p'-dihydroxy stilbene
●
exocyclic double bond
●
acids
(COOH/COSH/CSOH/CSSH)
(esters, lactones), bin if COOR >= 3
●
di tBu ester
Class 7: General UGLY Halogenated Structures
●
poly (CCO)
halogen counts: bin if (Cl + Br + I + X >= 4) OR (CF3 >=
●
formic acid esters
4) OR (Br + I >=2) where X can be F or CF3
●
triacyloximes
●
di or trivalent halogens
●
sugar
●
N-halogen & S-halogen & P-halogen & O-halogen
●
2,3,4, & 2,4,5 trihydroxyphenyl
●
sulfonyl halide
●
triflates SO3CX3
●
perhalo ketones CH 2 C(=O)CX3
●
perfluoroacetates
●
C(Br)NO2
●
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
and
derivatives
Molecular Modeling in Drug Design
AZ Chemical Filters
Class 9: General UGLY Nitrogen
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
hydrazine RRN-NRR not in ring, nor diketo
hydrazone RRC=N-NRR and not in ring (the C can be
aromatic)
guanidine >=3
azine C=N-N=C and azide NNN and not in ring (see
azodiaz)
quaternary N >= 2
N oxide >= 2
phenanthroline [or di(2-imidazolyl)methane] chelator
CNOC in chain not in ring (keep hydroxamic acid)
azo N=N or diazonium N≡N
carbodiimide
N-nitroso
aromatic nitroso
geminal cyanides C(CN)2
cyanohydrins and (thio)acylcyanides
cloramidine
nitrite
nitramines
oxime
Cass 10: General UGLY Sulphur
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
sulphur >= 5; disulfide >= 1; sulfate >= 1
R-SO3 -polar atom and R-SO2 -S-R, not sulfonic acids
nor esters
sulfonic acid; disulfone; thioketone
sulfonic esters except aryl/alkyl-SO 3 -aryl
cyclic –NSN- or –SNS- not -NS(=O)(=O)SN not sulphonamides; SO not S=O nor in ring
1,2-thiazol-3-one
dithiocarbamate NC(=S)S
CNS or CSO eg thiourea, isothiourea, thiocarbamic acid
or thiocarbonate
S+ but not S(O-)3
sulfonyl cyanides
isocyanates & isothiocyanates, thiocyanate
alkyl-SH, carbaryl-SH
dithioic and thioic acids O- and S-esters
sulphonamide imine (Michael) but not sulphonamide
amidine nor in ring
1,3-oxathiolane (not oxathiolane-3,3-dioxides)
dithioic and thioic acids
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
Virtual Screening Approaches
Ligand-based
- only 1 active molecule known: perform similarity search (structure, descriptors, fingerprints)
- a few active molecules known: identify 3D pharmacophore → perform a 3D database search
- multiple active and inactive structures known: train a machine learning technique (QSAR, CoMFA)
- typically faster
Structure-based
- many good algorithms for pose generation available (molecular docking)
- scoring functions are limiting factor for the reliability of poses and predictions (approximations,
water treatment, partial charges, entropic terms; general lack of reference systems)
- extremely complex → high computational costs
Are we about to see automated tools based on artificial intelligence?
We know what we want, but we do not yet have completely reliable methods. We can:
- define desired physico-chemical properties (pharmacokinetics, toxicology)
- find the active principle using 2D or 3D methods
- define feasible/accessible fragments and reactions to be used for chemical synthesis
- fully automatize processes → revolution in robotics → automatic synthesis and in vitro testing
http://www.nature.com/nrd/journal/v12/n12/pdf/nrd4128.pdf
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
How to find a suitable case study
&
How to design an electronic report
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
Literature search – where and how?
Search by topic (keywords) = Molecular Modeling in Drug Design :)
●
try advanced queries – combine more specialized fields (keyword in the title, abstract, year)...
●
ScienceDirect (http://www.sciencedirect.com/)
●
PubMed (https://www.ncbi.nlm.nih.gov/pubmed/)
●
General purpose search engines like Google, Yahoo, Bing, Wikipedia...
Search by journal/editing house (e.g. latest/most cited papers)
American Chemical Society Journals (pubs.acs.org):
●
Journal of Chemical Information and Modeling (http://pubs.acs.org/journal/jcisd8)
●
Journal of Chemical Theory and Computation (http://pubs.acs.org/journal/jctcce)
●
Journal of Medicinal Chemistry (http://pubs.acs.org/journal/jmcmar)
●
Journal of the American Chemical Society (http://pubs.acs.org/journal/jacsat)
Nature (http://www.nature.com/siteindex/index.html)
●
Nature Reviews Drug Discovery (http://www.nature.com/nrd/)
Wiley (http://onlinelibrary.wiley.com/subject/code/000131) e.g. Angewandte Chemie int. Ed.
Tailor and Francis (http://www.tandfonline.com/), e.g. Expert Opinion on Drug Discovery
SpringerOpen (https://www.springeropen.com/journals), e.g. Journal of Cheminformatics
Royal Society of Chemistry (http://www.rsc.org/)
Elsevier (https://www.elsevier.com/books-and-journals)
...
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017
Molecular Modeling in Drug Design
No exam, but electronic report (PDF) of a case study
Papers selected should contain some point of novelty and be Pharma relevant
●
published preferably in the last 5 years (= 2012 – 2017), if justified then at max. 7 years
●
in a good (CADD) journal → good impact factor is important, but not critical
What to search for
●
completely new topic, which has not yet been addressed in EiMM, CMoAE or MMiDD
●
topic already reported (e.g. MD, docking), but specially tailored, extended, combined...
NO DUPLICATES, please!
What to do
●
carefully read and understand
●
write a short report 2-3 pages (12 font, 1.5 line spacing) with own words (do not copy the abstract!)
●
may include some pictures/graphs/tables/outlines/schemes (own contribution appreciated)
●
comment/mention motivation, main goal, method used, result, conclusion
●
+ answers to the following questions:
- Why did you choose this particular publication?
- What did you like?
- What did you don’t like?
Idea behind:
●
expose you to the literature, terminology, multitude of computational (supported) methods
●
in depth reading of a topic that might be of interest for you
M. Smieško – Departement Pharmazeutische Wissenschaften, Universität Basel, 2017