Abstract
Computer-aided drug design and discovery methods have been essential in developing small molecules with therapeutic properties over the last decades. Application of computational resources includes drug target identification, hit discovery, and lead optimization. Accordingly, with tremendous research efforts and the availability of financial support from government agencies across the world, and multinational drug companies, the overall research level in this area will continue to advance. The methodology used in this review paper entailed a thorough examination of research studies on relevant literature on drug design and development using computational resources. Extensive searches using Scopus, International Pharmaceutical Abstracts (OvidSp, WHO Global Health Library, Cochrane, Google Scholar, Web of Science, Science Direct, ProQuest dissertation & theses, Worldwide Political Science Abstracts (CSA), and PubMed was carried out. A standardized template was used to ensure that the selected papers met the inclusion criteria, and relevant to the review. Ultimately, there are robust technologies developed to enhance the drug discovery process. Therefore, this review provides insights into computational resources in Silico and ab initio methods and algorithms, not restricted to drug metabolism predictions for drug design, and the practical applications of artificial intelligence (AI) in drug discovery. Computational tools and methods for drug design and development such as molecular dynamics (MD), molecular docking, quantum mechanics (QM), hybrid quantum mechanics/molecular mechanics (QM/MM), and Density functional theory (DFT) have been reviewed. Accordingly, the emerging technique of synergistically employing these techniques influences the fundamental challenges of conventional medicines for complex diseases. Herein, we discuss ligand-based and structure-based drug discoveries, force field models in MD simulations, docking algorithms, subtractive and additive QM/MM coupling. Nonetheless, as computer-aided drug (CADD) approaches continue to evolve with significant improvements, the focus areas will be on docking and virtual screening, scoring functions, optimization of hits, and assessment of adsorption, distribution, metabolism, excretion, and toxicity (ADMET) properties. With the current success, the present computational resources will aid in the future discovery of novel compounds with high therapeutic performance. The ongoing oncology research efforts will also significantly contribute to UN sustainable development goals – good health and well-being, sustainable innovation and industrialization.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
For a long time, plants have been known to have therapeutic properties, and most plant-derived medicines have been used to manage various pathological conditions [1, 2]. While these medicines have been effective when used as concentrated plant extracts or concoctions, modern medicine requires the isolation and purification of active compounds [3]. Despite the identification of drugs for treating hypertension, malaria, HIV/AIDS, cancer, and diabetes, these diseases continue to pose global health concerns with modern medicine struggling to find sustainable cures [4]. Research activities on medicinal chemistry show that antimalarial drugs such as Artemisinin (Artemisia annua), quinine (Cinchona spp.), and anticancer drugs such as Vinblastine (Catharanthus roseus) and Taxol (Taxus brevifolia) were discovered from natural products and were effective in managing and treating serious health problems and diseases [5, 6]. Similar reports show that natural products are relevant in drug discovery for communicable and non-communicable diseases [1]. By leveraging the recent advances in technology, the success rate for new therapeutic moieties will be increased, hence solving global health challenges. Modern drug discovery methods in the realm of natural products stand as a major discovery in achieving the United Nation sustainable development goals (SDGs) on health [7]. In particular, aspects of drug discovery and development are important translational science activities that contribute to SDG number 3 (good health and well-being) and number 9 (sustainable innovation and industrialization) [8]. The urgency of the provision of affordable essential vaccines and medicine in accordance with the Doha Declaration on the Trade-Related Aspects of Intellectual Property Rights (TRIPS) Agreement and Public Health cannot be overstated within the context of sustainable healthcare [9].
Conventional drug design and discovery methods are well known to be cost-intensive and time-consuming, taking an average of 10 to 15 years and cobbling more than US$2 billion before they reach the market [10]. The long-standing impediment in drug design and discovery is the reproducibility of the experimental designs [11,12,13]. With the rapid development of computer algorithms, hardware, and software, drug development has significantly benefited from vast computational methods that have drastically reduced drug discovery time. Studying biological systems requires techniques that provide substantial information within a limited period of time. More prominently, computational tools have come into the limelight to represent essential and valuable tools in the discovery of chemical compounds with potential therapeutic effects [14,15,16]. Several computational tools have tremendously helped researchers identify and optimize new drug candidates and predict their potential interactions. They have allowed for the investigations of thermodynamic and structural features of target proteins on different levels, which helps study drug action mechanisms and drug binding sites [17].
The high failure rate of the traditional pathways for drug discovery and design has prompted the scientific community to use computer-aided drug discovery (CADD), which involves system-based, structure-based and ligand-based drug strategies [18, 19]. The scientific progress involving CADD begins with wet-lab experiments, which comprise the identification of the hit or target compound and subsequently through high-throughput screening (HTS). Typically, the side effects associated with the use of drugs necessitate the screening of adsorption, distribution, metabolism, excretion, and toxicity (ADMET) properties at early stages [20,21,22]. Accordingly, the CADD screening of library compounds narrows drug candidates into a smaller cluster, thereby increasing the success rates and reducing the time used in screening potential drugs with therapeutic impacts. However, CADD screening is associated with high-resource requirements and extensive costs [23], prompting the use of virtual screening methods, including molecular docking, whereby a known target is screened against a virtual library of compounds [24, 25]. The first step in drug design usually identifies the target molecule (a protein of a biochemical pathway associated with the disease). The lead compounds are then designed or identified to promote or inhibit the biochemical pathway [26]. Another crucial step in drug design and discovery is lead optimization so as to achieve maximum interaction between the target molecules. Consequently, CADD plays a central role in lead optimization [27].
As noted in Fig. 1, CADD is broadly classified into ligand-based and system-based approaches. The ligand-based approach exploits the experimental nature of inactive/active molecules. This strategy needs only bioactivity data and the structural information of small molecules rather than prior knowledge of mechanisms of action. In a ligand-based approach, similar molecules likely have similar properties [28]. Numerous studies have reported that in the absence of the experimental three-dimensional (3D) structure, a ligand-based approach is used to understand the physicochemical and structural properties of the ligand associated with the desired pharmacological activity [29, 30]. Besides the known ligand molecules, ligand-based methods are also applied to substrate analogues or natural products that interact with target molecules to yield the required pharmacological effects [30]. Similarly, structure-based methods such as in silico chemical alteration or molecular docking are used in the presence of 3D structures for lead optimization [29]. Structure-based methods help identify the structural requirements of the ligands and the nature of target-ligand interactions to optimize the interaction [30]. Figure 1 presents a summary of drug discovery processes and the corresponding computation tools used.
Multiscale simulations of biomolecular systems used in drug design depend on the biological problem being investigated. For instance, a combination of molecular mechanics (MM) and quantum mechanics (QM), (QM/MM), have been more prominently used to simulate chemical reactions such as enzyme catalysis, calculate spectra, and study electronic properties in a single simulation [31, 32], which can be used to elucidate drug action mechanisms [33, 34]. Another popular biomolecular simulation technique is molecular dynamics (MD), which applies molecular mechanics force fields and is based on classical Newtonian physics. Typically, MD simulations have been used to calculate the binding free energy between the drug molecules and target proteins to identify the potential drug binding sites and drug molecule action mechanisms [35, 36].
Further to this, depending on different accuracy requirements, coarse-grained (CG), united-atom (UA), and all-atom (AA) MD simulations, as well as implicit/explicit solvent models, have been used in simulations of spatial and temporal scales in drug design [38, 39]. Various biomolecular simulation methods are required to work on a specific problem. Accordingly, each of these methods is associated with practical limitations, disadvantages, and advantages in terms of the type of phenomena that can be modelled, the size of the biomolecular system that can be modelled, and the length of simulation required [38, 40]. In drug design and discovery, the scientific understanding of the action mechanism of a drug, the dynamics of drug targets, and the elucidation of binding sites on the target protein require multiple simulation methods for the analysis of cross-scale connections [38]. Figure 2 shows computational strategies, including MADE prediction, QM/MM, structure and ligand-based structure designs, protein structure design, and MD.
The computation resources used in predicting molecules with potential therapeutic effects [37]
A significant thrust in most preclinical drug discovery research activities is the identification of orally efficacious and safe drug candidates amenable to low daily dosing. In this pursuit, drug metabolism has been critical in identifying chemical compounds with optimal ADMET characteristics [41, 42]. Research efforts to reduce drug metabolism are not limited to introducing conformational constraints, altering stereochemistry, modulating electronic and steric factors, and reducing lipophilicity [43]. Computational resources are doubtlessly pertinent to drug discovery, and researchers are always excited to study new kinds of interactions [44]. In view of this, docking using software such as AutoDock, Glide, and GOLD–is first conducted, and then MD simulations are performed to gain insights into the effects of interactions at a temporal scale [45,46,47]. Despite the wide range of computational methods, a free energy perturbation (FEP), in combination with Monte Carlo (MC) with MD, is an attractive option because it offers a rigorous theoretical means to calculate changes in binding free energy [48, 49].
These computational tools can be synergistically employed in pre-clinical drug design and discovery. For instance, Amsterdam density functional (ADF) and DFT are used to study reactivity, molecular properties, and electronic structures of molecules [50, 51]. Conversely, MM simulations are pivotal in studying the dynamics involved in protein–ligand interactions [52]. MD simulations help to predict the binding modes of ligands to target proteins [53]. On the other hand, hybrid methods like QM/MM combine efficiency and accuracy in studying enzyme-catalyzed reactions [54]. The objective of this review is to evaluate the current trends in using robust computational tools in pre-clinical drug discovery. This review also aims to explore the current use of ADF and DFT, MD, MM (Monte Carlo Simulations and MD), and hybrid methods (QM/MM and Quantum MC) which are remarkable computational tools in drug design and discovery. Also, the review provides an overview on the use of artificial intelligence in drug design and discovery. We believe this contribution will provide valuable insights into the use of computational resources in drug design and development.
2 Methodology
The methodology used in this review paper entails a thorough examination of research studies on relevant literature on drug design and development using computational resources. By embracing diverse viewpoints, this work provides an in-depth summary of computational resources that have been used in the identification of potential compounds with therapeutic properties. In addition, this paper offers a concise synthesis of numerous research findings in this field, showcasing the progress that has been made in leveraging emerging technologies such as AI, machine learning, and deep learning in medicinal chemistry. A systematic literature was conducted on research reports and empirical and theoretical studies that focused on drug discovery and development. Extensive searches using Scopus, International Pharmaceutical Abstracts (OvidSp, WHO Global Health Library, Cochrane, Google Scholar, Web of Science, ScienceDirect, ProQuest dissertation & thesis, Worldwide Political Science Abstracts (CSA), and PubMed was carried out. The identified studies were cross-checked to ensure that they were peer-reviewed, written in English, and published in scientific journals. Due to the long history of drug design and development, search strategies combining keywords “drug discovery and development” with concepts of “drug metabolism”, “artificial intelligence”, “machine learning”, “computational”, “in silico methods” or “theoretical” were used; nevertheless, elaborate details of the search strategies were available on request. The search keywords and terms were also not restricted to “docking”, “therapeutic agents”, and “virtual screening”.
Over 800 scientific papers in the medical field, published between 1990 and 2024 were identified for this review paper but only 263 were relevant. Papers were selected in two phases using pre-defined inclusion and exclusion criteria. Oftentimes, the titles of literature studies are vague, thus, in the first phase we selected studies by reading abstracts and titles. Abstracts and titles that mentioned drug design and development, in silico methods, metabolism, and pharmacokinetics, were included. Studies that combined studies on drug discovery, computational drug design, drug metabolism, AI-assisted drug design, and drug design were included. If the abstract was not available electronically, the articles were passed along to the second phase of full-text review. A standardized template was used to ensure that the selected papers met the inclusion criteria. Owing to the long-history of medicinal chemistry and computational tools, year of publication was not strictly adhered to. The template collected data on drug discovery studied using theoretical/computational methods, years of publication, and drug classes. Studies that only focused on protein structure prediction or traditional medicines without drug development projects or those that examined wet laboratory approaches describing drug discovery projects also met the inclusion criteria. The search results were then imported into EndNote Version X8, Clarivate Analytics, USA.
2.1 Systematic approach to data synthesis
Two authors extracted data from the publications that met the inclusion criteria. The data included first author, year of publication, computational method used in drug discovery, molecular docking tool, and metabolism in drug design, potential therapeutic candidates, the visualization tool used, in silico approach employed in pre-clinical drug design trials, drug pharmacokinetics, and AI in drug discovery. In case of any discrepancies or disagreements, the two authors discussed and sought consensus.
3 Fundamentals of drug discovery and development using computational tools and software
There are two drug discovery strategies–drug re-positioning and de novo drug discovery strategies. Drug repurposing refers to an approach of identifying new uses of the already existing products [55]. Drug repurposing (also called drug repositioning) has various inherent advantages including cost efficient and faster drug development time due to prior knowledge toxicity profiles, dosage, and safety of existing medications [56]. The interest of drug repositioning has risen in recent times. For instance, thalidomide which was formally known to cause birth defects has been repurposed to treat multiple myeloma and leprosy [57, 58]. Similarly, bufulfone and chlorambucil which were initially developed as alkylating agents based on toxic chemical warfare agent mustard gas, have been repurposed for treating leukemia [55]. Thus, drug repositioning may be a viable and compelling strategy for enhancing treatment options for various diseases. conversely, de novo drug design involves creating a pool of drug candidates for sequential analysis in order to create “anew”, “from the beginning” or “afresh” molecules with desirable therapeutic properties from scratch [59].
Similarly, drug research and development follow three experimentation pathways: in silico, in vivo and in vitro [60]. In silico refers to the studies performed via computer simulation while in vitro refers to the studies performed with biological molecules, cells or microorganisms in flasks and test tubes [61]. These experimentations are performed outside living organisms. On the other hand in vivo experimentations are mainly involved clinical trials and animal studies [62]. Accordingly, in silico study is easy occasioned with the rapid development of numerical algorithms, ability of pharmaceutical companies to reproduce human environments, the advancement of computational power and medical imaging.
Novel computational drug design methods, better machine algorithms, and faster architectures of Graphic Processing Unit (GPU)-based clusters that have been developed until recently have significantly helped researchers perform high-level computations, aiding in the discovery of promising drug candidates by improving arears such as scoring functions [63, 64]. Undoubtedly, computational strategies in drug discovery have uncovered numerous therapeutic undertakings. In particular, advanced in nuclear magnetic resonance spectroscopy, advances in protein purification and structural analysis, and crystallography stand out as successes that have scaled the drug-discovery process. Despite the tremendous research undertakings in the academic environment for concept validation, there are a variety of obstacles that limit their application. Accordingly, this work compiles succinct discussion on developments and challenges on in silico approaches in drug discovery and discovery, hybrid methods based on QM/MM, drug metabolism, machine learning, and AI in pre-clinical drug research.
3.1 In Silico approaches in pre-clinical drug design
In Silico approaches have been successfully applied to the in vivo prediction of ADMET and have been developed to replace in vivo models in the study of pharmacokinetics [65]. Two widely used in Silico methods for ADMET prediction are molecular modelling and data modelling [65]. Molecular modelling is based on 3D protein structures and includes techniques such as QM simulations, MD simulations, pharmacophore modelling, and molecular docking calculations [66,67,68]. There is some evidence that the computational ability of molecular modelling can complement or even surpass the quantitative structure–activity relationship (QSAR) as more 3D proteins become available [69]. It is still a scientific challenge to predict ADMET properties using molecular modelling because of the large binding cavities and the flexible nature of proteins. Nonetheless, other studies have successfully used molecular modelling to predict metabolism [70].
3.2 Applications of molecular dynamics in drug design
Molecular dynamics (MD) is a computational resource that employs Newtonian second law of motion to evaluate motions of small molecules, ions, water, complex systems or macromolecules to represent the biological environment, including lipid membranes and water molecules [35, 71]. Therefore, MD simulations have been widely used to study drugs against specific proteins, considering the movement of proteins in a solvated environment [35]. These simulations generate data on conformational, energetics, and binding free energy changes, proving useful in drug design [71].
The MD simulations have been partnered with experiments to understand the underlying behaviour of biological and chemical systems in their ever-growing role in drug discovery [49]. Notably, MD simulations appear destined to impact drug discovery significantly [35, 52, 72]. MD simulations follow virtual screening simulations. It is considered an advanced technique that complements docking, and most research scholars have used it to complement virtual screening. MD employs Newtonian mechanics in improving the efficacy and the binding properties of lead compounds [44]. For this reason, researchers have popularly used MD to confirm the validity of the docked results. Figure 3 presents the steps involved in MD simulations.
Schematic representation of molecular dynamics simulations [44]
The computational chemistry methods based on MD are used to study various types of macromolecules – carbohydrates, nucleic acids, proteins – of medicinal or biological interest [73]. They are well-suited to the study of membrane proteins and their behaviour. In MD simulations, dihedral angles are depicted using a sinusoidal function, while bond angles and chemical bonds are handled using simple virtual springs. MD applications, including free energy perturbation methods (FEP), linear interaction energy (LIE), and molecular mechanics Poisson-Boltzmann surface area (MM/PBSA), are used to calculate free energy to correlate calculated and experimental binding affinities of small molecules of proteins [74, 75].
MD simulations employ force fields like CHARMM, GROMOS, Amber, OPLS, and coarse-grained (CG) [28]. These force field models include Assisted Model Building with Energy Refinement (AMBER) which was developed in the 1970s by a group of scholars led by Peter Kollman at the University of California, San Francisco [76]. This toolkit has been commonly used to explore the dynamic behaviour of carbohydrates, organic molecules, lipids, RNA, DNA, and proteins, among other macromolecules [77]. AMBER has the computational ability to compute the potential energy of a molecular system. It uses water models such as OPC, PIP4PEW, SPC/E, and TIP3P to represent the solvent environment when performing biomolecular simulations [28]. Another pivotal program in MD simulations is CHARMM (Chemistry at Harvard Macromolecular Mechanics), which Martin Karplus and colleagues at Harvard University developed. CHARMM accurately describes molecular interactions of complex biomolecular systems [78]. Another force field model is optimized potentials for liquid simulations (OPLS), which offers accurate prediction of thermodynamic and structural properties obtained by MC computations compared to experimental results [79]. OPLS-AA was developed to reproduce QM conformational energy profiles of small molecules, whereas Groningen Molecular Simulation (GROMOS) is a multipurpose computational resource used to study biomolecular systems, including sugars, nucleotides, and proteins [73]. GROMOS has been commonly used by researchers to model polymers, liquid crystals, solutions of biomolecules, and glasses. Coarse-grained (CG) is a popular computation approach that simplifies the cost of computational calculations by reducing the number of degrees of freedom in the model. Similarly, CG Martini has the computational ability of simulating large-scale systems [28, 73]. These force fields have been summarized in Table 1.
3.3 Application of molecular docking in drug design
Molecular docking encompasses three key objectives: binding affinity estimation, pose prediction, and virtual prediction [103]. Reliable molecular docking techniques must differentiate non-binding sites from binding sites and their molecular interactions [47]. Besides, these techniques must also classify molecules as non-binding or binding and rate binding molecules among the best compounds when dealing with large compound libraries. The success of virtual screening relies on the accuracy and the amount of structural information known about the ligand and the target protein being docked. The first step in molecular docking is the analysis of protein–ligand to identify novel binding pockets using in silico methods such as PATCH-SURFER [104], AFT, POCKET SURFER [105], catalytic site Atlas [106], and SURFACE. Molecular docking based on virtual screening is the most preferred, and a helpful technique if the binding site of the target biomolecule is known [107, 108]. Therefore, ligands with inhibitory potential against the target protein could be docked against it in search of novel therapeutic candidates. Further to this, blind docking techniques could also be employed when the binding site in a protein is not known [109]. However, blind docking is associated with downsides such as low success rates and high time requirements compared to docking into a known binding pocket [47].
Accuracy and speed are key features required for a successful molecular docking simulation. Typically, docking algorithms are developed to speed up the discovery of novel lead compounds in virtual screening or to validate experimental data at higher accuracies [107, 110, 111]. There are various docking programs: Zdock [112], GOLD [113], MSDOCK [114], FLExX [115], AUTODOCK [116], MOE-DOCK [117], FRED [118] Surflex [119] among others. These computational packages have specific search algorithms such as MC, genetic algorithm, incremental construction, and cavity detection algorithm [47]. These algorithms have their specific search method and specific parameters. Docking has computational capabilities of searching for the best fit between molecules, considering various parameters obtained from ligand and receptor input coordinates as: ligand or receptor structure flexibility, interatomic interactions such as hydrophobic contacts and hydrogen bonds, and geometric complementary [47, 120, 121]. Docking applications usually return the pose (predicted predictions) of a ligand in the binding site of the target. Figure 4 represents the scheme of molecular docking techniques.
Active site determination of 3D-receptors, protein–ligand docking simulation (blind and site-specific docking), and analysis of docking complex [47]
Docking applications can be classified according to defined parameters and rules applied to predict the conformations. For instance, the flexibility of receptor and/or ligand docking algorithm can be classified as flexible or rigid-body docking [122, 123]. Rigid-body docking does not account for the flexibility of either the receptor or ligand, limiting the accuracy and specificity of results, considering geometrical complementarities between two molecules [123]. However, rigid-body docking is still capable of predicting the correct position of ligands when compared with crystallographic structures. It has also been used to perform initial screening of a small molecule database in a rapid manner. Conversely, flexible docking considers several possible conformations of the receptor or ligand for both molecules simultaneously, although with higher computation time [120].
Rigid-body, also called the "lock-and-key model", emphasizes the importance of geometric complementary, and in flexible docking, the process has to be flexible that ligands and receptors change their conformations to fit each other well, and thus, an "induced-fit model" (c.f. Fig. 5) [124].
Two models of molecular docking—(a) rigid-body and (b) flexible (induced-fit model) [124]
As ascribed in Fig. 5, rigid-body docking is a traditional docking technique that assumes that the ligand and the receptor are rigid and, thus, relies on the scoring function to obtain the optimal binding orientation of the ligand to the receptor pockets [125]. Based on their geometric complementarity, the scoring function considers the electrostatic and steric interactions between the ligand and the receptor [124, 125]. This technique is simpler and more efficient when generating multiple docking poses. The drawbacks of rigid-body docking is its inability to account for the conformational changes that may arise when the ligand and the receptor bind [124]. It is known that several protein receptors undergo conformational changes that promote the binding of ligands, thus lacking the computational ability to predict the binding affinity of ligands [124]. These limitations has prompted the research community to use flexible-rigid docking, which allows for the flexibility of the receptor structure [126]. This docking technique considers specific predefined conformational changes in the receptors. The specific rotatable bonds in the receptor structure may be redefined, and the algorithm defines various conformational changes of these bonds to enhance docking prediction. It offers realistic representations of the receptor-ligand interactions. Also, it provides an accurate prediction of the binding affinity [122].
Flexible (soft) is also an advanced docking technique that explores the full flexibility of the ligand and the receptor structures. Its dynamic nature accounts for the receptor and docking flexibility to investigate possible conformations, making it more accurate than flexible-rigid and rigid-docking methods [124]. Docking programs are known to use one or a combination of the following ways: MC, distance geometry, incremental construction, simulated annealing, geometric algorithms, etc. [103]. Figure 6 demonstrates the three types of docking techniques; rigid-body, flexible-rigid, and flexible docking techniques.
Rigid-body, flexible-rigid, and flexible docking methods [124]
3.3.1 Docking algorithms
Various docking algorithms are used depending on the features of the optimization problem and the problem domain. For instance, the genetic algorithm (GA) uses a population-based approach inspired by biological evolution, where chromosomes evolve over generations [127]. This method is influenced by the Darwin's Theory of Evolution [128]. The solutions are chosen and combined with others to develop new solutions and are mainly implemented in AUTODOCK 4.0 and GOLD 3.1 [28]. Stochastic, or MC, is another essential algorithm that employs random sampling to obtain solutions to numerical problems [129]. Its working principle is based on probability distribution to solve numerical integrals. MC algorithm uses probabilistic selection to obtain global optima [130]. Incremental construction (IC) is also an important algorithm that obtains new solutions by adding small modifications iteratively to a solution and evaluating it [131].
The conformation selection algorithm also finds valuable applications in biomolecular simulations. It works by exploring the conformations of biomolecules with the lowest energy defined by the degrees of freedom of a system [28]. It applies a combination of energy minimization and conformational sampling to obtain the global minimum energy conformation of biomolecules. Besides, the non-stochastic methods do not apply probability or randomness but employ mathematical tools such as gradients and derivatives in the search process [132]. Cavity detection is another algorithm that employs a random initial solution and iteratively uses local search techniques to obtain new solutions. The various docking algorithms used by docking software are reported in Table 2.
The over 40 years evolution of molecular docking algorithms, together with the advancement of computing power and architecture, has led to more sophisticated applications in docking nucleic acids (NA) as potential therapeutic targets [157]. The druggability of particular ribonucleic acid (RNA) has piqued research activities on RNA-ligand binding with a specific focus on docking-and-scoring methods. With the ever-increasing findings on powerful methods of RNA structure determination, RNA-based therapeutics has become a prospective approach to treating human diseases such as cancer and human immunodeficiency virus (HIV) [158, 159]. Typically, there are two types of RNA-based therapeutics: (1) RNA molecule serves as the target for the drug binding, which is analogous to protein-targeted drug discovery, and (2) the therapeutic RNAs (i.e. guide RNAs, small interfering RNAs, RNA aptamers, antisense oligonucleotides) bind to target (protein targets, DNA targets, and RNA transcripts) to induce or inhibit targeted biochemical reactions [157]. Studies show that genes that are difficult to drug or undruggable by targeting their associated proteins may be inhibited by drugs targeting the corresponding messenger (coding) RNA sequence [160, 161]. Unlike proteins, RNAs show broader druggability, garnering tremendous interest in gene therapy [157].
Computational approaches based on small-molecule docking are aimed at predicting the binding poses of biomacromolecules and small molecules such as NA or protein targets such as RNA or DNA [162, 163]. Various docking algorithms help in predicting and understanding binding modes (molecular recognition) and predicting scoring (estimation of binding energy). While the molecular methods for proteins are well-developed, docking methods for NA are comparatively underdeveloped [164]. Researchers in medicinal chemistry have used docking programs such as GRAMM, ZDOCK, HDOCK, and FTDOCK, initially developed for protein–ligand docking to RNA-ligand docking and developed new scoring functions and methods for NA-docking [144]. Emerging transcriptomics studies, particularly studies on non-coding RNA such as small interfering RNA (siRNA) and microRNAs (miRNA) in the development of anti-cancer drugs [158]. Machine learning (ML) and algorithms such as HNADOCK for RNA-ligand docking, though still in their infancy, have shown promising performance, and the improved in vivo efficacy would accelerate RNA-targeted drug discovery [165]. A noteworthy interdisciplinary collaboration and cutting-edge scientific knowledge in medicinal chemistry is the development of these powerful docking software currently in use [166]. However no sole docking algorithms fits in every system and thus the use of more than one software increases the quality of the output [166].
While molecular docking was used in the study of interactions between targets and the ligands, its application scope currently is wider including drug repurposing, polypharmacology, virtual screening drug side effect prediction, target profiling and discovery (c.f Fig. 7) [167, 168]. Remarkably, there exists vast opportunities provided by molecular docking in drug discovery process. It has been typically integrated with workflow that includes a variety of experimental and in silico techniques. Wet lab experiments and computational methods in conjunction with docking may potentially be utilized in elucidating drug metabolism to gain meaningful insights from cytochrome P450 system, development of new antibacterial agents such as DNA gyrase [169]. Typically, the protein–ligand docking algorithm is twofold; conformation generation and scoring [170]. During conformation generation, different ligand orientations are generated at different positions inside the protein binding pocket [171]. Besides the various applications of docking in drug discovery, they have also been used in target fishing and target (identifying a series of targets for which the ligands exhibit desirable complementarity, drug repositioning (identifying uses of novel pharmaceutical products with already optimized profiles), and in polypharmacology (identification of ligands that simultaneously bind to a pool of selected targets of interest [168]. Consequently, these conformations are evaluated by a scoring function, as illustrated in Fig. 7.
Applications of molecular docking in drug discovery and development [168]
Docking involves a combination of a search algorithm and scoring function. Molecular docking is a method that employs various docking algorithms based on molecular dynamics, Monte Carlo, fragment-based and genetic algorithm [171]. The methodology employed in molecular docking is as presented in Fig. 8.
Nevertheless, the shortcomings of docking are well-documented in the literature [172]. Docking is associated with challenges such as receptor flexibility because typically a protein/biomolecule adopts a variety of conformations depending on the ligand with to which it binds [168, 173]. Conformational states of proteins accounts for better affinity to be attained between the target and the drug [174]. Another scientific challenge directly linked to docking is imperfect scoring function because electrostatic interactions and entropy among other physical phenomenon are disregard by scoring schemes [172]. Other studies have also reported that the success of molecular docking is impeded by incomplete understanding of the underlying molecular protein–ligand binding mechanism involving various classes of ligands and modeling receptor flexibility [168]. Most docking software employs force field calculations that estimate binding energy guided by experimental data and quantum mechanics. Molecular dynamics (MD) and MM simulate motions of molecules and atoms in a complex [45]. The binding affinity is estimated by computing the energy of the systems using force fields like GROMOS, CHARMM, or AMBER [175, 176]. However, it is known that accurate binding energies can only be obtained from ab initio methods such as MM simulations and DFT [177]. Most docking programs remove protons (hydrogens) of inhibitors or enzymes under study, potentially excluding important information and leading to inaccuracies [178]. MD simulations have been coupled with docking to obtain important information on solvated and protonation on docking preliminary results [179]. Remarkably, the cutting-edge advancement in force fields and MD simulations has improved the accuracy of these simulations.
The significance of bioinformatics within the context of drug discovery cannot be overstated in the context of drug discovery and development. Bioinformatics entails the study of proteomic and genomic data that in analyzing vast amount of biological information generated through various experimental and sequencing techniques [180], whereas docking predicts how small molecules interact with biological macromolecules. The integration of docking and bioinformatics has proven to be a powerful approach in understanding structural biology and computational drug design [181]. These techniques have been successfully integrated in network pharmacology where multi-target drug candidates have been identified, analysis of protein–ligand interactions, and combined with AI and machine learning to improve accuracy hits identification and enhance the accuracy of docking results [182].
However, bioinformatics and docking are not without challenges which require considerations during drug development process. These drawbacks are not limited to conformation changes (docking typically assumes rigid protein structures), achieving high scoring function accuracy is challenging due to dynamic protein ligand interactions, and these simulations require high computational demands which hinder large-scale virtual screening undertakings [182]. The frequently cited mentionable shortcomings of docking techniques include assumptions and simplifications in scoring functions that lead to inaccuracies; the programs fail to account for entropic, solvent, directional interactions, and hydrogen bonding interactions [168]. Other limitations include flexibility issues related to conformation and structural flexibility, and inadequate resolution of crystallography targets [168].
4 Hybrid quantum mechanical/molecular mechanics in drug design
QM method is a computational technique that models covalent and non-covalent intermolecular interactions [183]. Computer experiments based on QM methods, such as ab initio molecular orbital or DFT, have been used to explore interactions of molecular systems of up to hundreds of atoms [184]. They have also been used to relate the structure of proteins and enzymes to their biological function. On the other hand, MM methods have been used in modelling substrate-protein binding [185]. MM simulations have been reported to lack accuracy and precision in describing bond formation and breaking, which is essential in drug discovery [186]. Modelling large systems using QM methods has been an enduring scientific challenge, prompting the need to combine MM and QM methods in the QM/MM model [187,188,189,190]. Accordingly, MM and QM calculations are invaluable in studying long-range electrostatic interactions affecting biological macromolecules' electronic structures [190].
Several programs are available for QM/MM calculations. For example, Gaussian is used for QM calculations, while CHARMM and AMBER are widely used for MM calculations [85]. However, an effective way to perform QM/MM calculations is to jointly use the existing MM and QM packages with an interface program such as PUPIL, QoMMM, and ChemShell [191]. The hybrid QM/MM method allows a combination of computational abilities that each package has (e.g., algorithms to obtain stationary points and optimal reaction paths on potential energy surfaces implemented in QM packages and algorithms for expanding conformational spaces sampled in MD simulations executed in MM packages. The hybrid QM/MM approach is suitable for biophysical and biochemical mechanisms by combining the low computation cost of MM (with additive classical force fields) and the high accuracy obtained from the QM part (with first principles) [190].
Hybrid methods based on QM/MM have allowed researchers to study enzyme catalysis in drug discovery [54, 192]. They have also been used to gain meaningful insights into the binding between drug candidates and target proteins. Hybrid QM/MM calculations have helped obtain detailed information on binding pathways, identifying specific interactions, and studying ligand binding [193]. Furthermore, they have also been used to account for the impact of protein and solvent dynamics on drug behaviour. Previous authors, for instance, Alonso-Cotchico et al. [194], studied the metalloenzymes involved in drug research using the hybrid QM/MM method. QM/MM calculations can be used to explore the influence of metal ions in enzymes, which offers crucial information on metal-related toxicity [191]. This method has also been used to describe phenomena such as tautomerization, electron transfer, and proton design during drug design [191].
With the demand to enhance the speed of the drug discovery process, QM/MM has been used to describe the binding ability of the ligand with the receptor and the ligand-receptor interactions (c.f. Fig. 9).
The partitioning of the protein-ligand complex into the MM applied region, QM applied region, and QQ/MM applied regions [195]
The Hamiltonian operator is essential for quantum calculations and corresponds to the total energy of the system [195, 196]. Equation 1 is the Hamiltonian of the system.
Here, \({\text{H}}_{\text{QM}}\), \({\text{H}}_{\text{MM}}\), and \({\text{H}}_{\text{QM}/\text{MM}}\) are the Hamiltonians accounting for all QM particles of the ligand, MM particles of the protein, and the interactions between QM and MM particles within the system, respectively. Simple functions like Leonard-Jones potential describe the van der Waal's forces at the MM level. Conversely, the electrostatic term enters Fock Matrices as a self-consistent field method. The hybrid QM/MM calculations are divided into additive QM/MM coupling and substantive QM/MM coupling [195].
4.1 Subtractive and additive QM/MM coupling
Subtractive QM/MM coupling includes three steps in calculating the energy of systems: calculation of the total energy at the MM level, the addition of QM energy of the isolated system, and calculating the energy value of the MM system and subtracting this value [197, 198]. It is a simple method which mainly uses the ONION method in calculations. However, it is limited by the fact that it requires a flexible force field to describe the effect of chemical changes during reactions. Moreover, modelling biological charge transfer is challenging because of the absence of polarization by the MM environment on QM electron density [195].
The QM system is embedded within the MM system, and therefore, the sum of MM, QM, and QM/MM energy terms gives the energy of the system. Equation 2 gives the mathematical expression of additive QM/MM coupling.
Various approaches have been used to describe these interactions – mechanical embedding, polarization embedding, and electrostatic embedding [199]. Mechanical embedding extends the computational ability of QM/MM by accounting for mechanical degrees of freedom, treating electronic interactions quantum mechanically, and describing molecular deformations and vibrations [199]. However, mechanical embedding has limitations: it requires accurate MM parameters for MM and QM systems and ignores the perturbation of the electronic structure of the MM system [195]. Polarization embedding accounts for the influence of the surrounding polarizable environments on the quantum regions. Electrostatic embedding is used within the ONIOM framework [200]. It describes the electrostatic interactions between MM and QM regions within a system. It does not require MM or QM electrostatic parameters as in mechanical embedding and offers a more sophisticated treatment of electrostatics [200]. The hybrid QM/MM methods have been employed in drug design. For instance, Tuttle [201] used this approach to explore the binding modes of Latrunculin A, Latrunculin B (naturally occurring analogue), and synthetic l32 to G actin. The research group concluded that L32 possess biological activity like naturally occurring latrunculins.
5 Drug metabolism predictions for drug design
Drug metabolism scientists have been able to control the pharmacokinetic profiles of drugs such as half-life through metabolism-guided drug design, and subsequently in reducing the attrition rates [202,203,204]. The well-established in vivo and in vitro methods, along with the in silico computational predictions, have been fundamental in reducing toxicological factors at early stages. Studies have shown that drug toxicity is mainly due to their metabolites, either as a result of free radicals or reactive electrophiles [205]. Therefore, the identification of potential liabilities in new chemical series is arguably one of the critical roles of preclinical drug metabolism prediction. To date, many chemical compounds with therapeutic effects have been identified from quantum chemical, docking and MD chemical calculations. Pharmaceutical companies have leveraged in silico tools such as Autodock combined with other methods to introduce drugs in the market [206]. Lead optimization for optimal pharmacodynamics (PD) and pharmacokinetics (PK) and comparison of the preclinical metabolism in in animals with humans for supporting human dose prediction are the main breakthroughs in medicinal chemistry to date [207].
The two phases of drug metabolism–phase I and phase II metabolism are well documented in the literature [209,210,211,212,213]. Typically, phase I metabolism involves hydrolysis, reduction, and oxidation of the drug, which produces metabolites that can be toxic, active or inactive [212]. On the other hand, phase II metabolism involves the conjugation of the modified drug with another molecule, such as amino acid, sulfate, and glucuronic acid [213]. Phase II metabolism increases the water solubility of the drug, easing its excretion. Various enzymes carry out these phases of metabolism. Cytochrome P450 (CPY) family of enzymes carry out phase I metabolism while a variety of enzymes such as glutathione S-transferases (GSTs), sulfotransferases, and UDP-glucuronosyltransferases (UGTs) carry out phase II metabolism (c.f Fig. 10) [208]. Various factors, including disease status and individual characteristics such as age, genetics, and sex, influence drug metabolism [214].
Phase I and Phase II Drug Metabolism [208]
Figure 10 depicts CYP-mediated metabolism, which accounts for 75% of the overall metabolism. The human CYP family has 57 isoenzymes, and the CYP-mediated metabolism plays a significant role in drug discovery and development because it significantly affects the safety profile, desired activity, and bioavailability [208, 215]. These enzymes are highly concentrated in the liver, and this is where the majority of the drug metabolism takes place [215, 216]. Evidence in the literature suggests that the drug-drug interaction due to the uptake of two drugs, whereby one is an inducer and the other is an inducer of drug metabolism, potentially has adverse pharmacological effects on the body [217, 218]. For this reason, enzymatic metabolic studies are critical to identifying and quantifying main metabolites, assessing potential drug-drug interactions and resolving metabolic stability [219]. The prediction of sites of metabolism and metabolite structures in CYP-mediated reactions is mainly employed in silico, and AI approaches are the starting point of the metabolic research pathway towards lead optimization [208]. There exist vast software tools to predict sites of metabolism, including BioTransformer, FAME 3, GLORYx, CypReact, PreMetabo, virtual Rat, FP-ADMET, and Cyproduct [208]. These software employ various methods include various methods including rule-based, machine learning (ML), knowledge-based, and Random Forest, among others. Table 3 summarises the public metabolism tools and their corresponding methods [208].
5.1 The role of machine learning algorithms in drug metabolism prediction
The metabolic reactions that are mediated by the enzymes in the human body may transform the drug administered into metabolites that show different biological activity [216]. The effect of metabolic reactions in deactivating the administered drug cannot be overlooked [226]. At the same time, metabolism is essential for the formation of active substances in prodrugs (biologically inactive compounds). To this end, the metabolic fate of drug candidates requires thorough investigation. The growing body of literature on drug metabolism has made data available, and machine learning (ML) models such as Random Forests and Support Vector machines to become the main choices that offer faster inference [226]. The rule-based methods have been used alongside ML models to predict drug metabolism to reduce false positives and filter unlikely predictions. The recent ML models are trained on data to cover the entire human metabolism, including endogenous compounds. For instance, researchers have used Graph Convolutional Neural Networks to provide insights on molecule interactions with the enzyme [227]. These models also postulate reaction outcomes as well as reveal valuable information on reaction mechanisms [228].
However, Litsa and others [226] reported that the current ML approaches used in metabolism prediction of drug candidates are based on shallow ML models and mainly classification models for distinguishing sites of metabolism and non-sites of metabolism and enzyme binders from non-binders. Besides, the existing ML models also fail to provide valuable insights on why and how a prediction is made like in computational approaches. ML models also suffer limitations related to inconsistent labelling of the sites of metabolism due to the different limitations. Also, these models fail to classify metabolites as primary or secondary [226]. These issues often avert medicinal chemists from not only taking an active role in developing novel models for predicting drug metabolism but also the comparative assessment and evaluation of methods.
6 Case example of drug pharmacokinetics with cancer cells
Cancer is the most challenging and devastating disease, threatening the health and life of millions of people [229]. It is the prominent cause of high mortality rates in the world. Cancer develops when abnormal cells proliferate, invading the surrounding tissues, which then spread to body organs and other parts of the body through lymphatic and circulatory systems [230]. Several strategies, including hormonal therapy, radiation therapy, chemotherapy, immunotherapy, and surgery, have been used in fighting against cancer [230]. Despite these strategies being effective in fighting cancer cells, they are known to have potential toxicity, accompanied by adverse drug reactions, pain, and cost, limiting their application in clinical settings and public health in general [231]. Further to this, the risk of tumours acquiring multidrug resistance prompts medicinal chemists to develop efficient, safer, and novel antitumor agents for fighting cancer [232]. Pharmacokinetic (PK) studies are often conducted to determine the schedule and dose of treatment of the drug relations between drug concentrations and functional or biochemical effects, route of administration, and drug interaction when more than one drug is administered [3].
In the current drug development paradigm, animal studies provide a framework for human clinical trials [233]; however, a drug that works in animals may be ineffective in humans due to inappropriate translation of a drug dose from animals to humans [3]. Scientific experts in the pharmaceutical industry employ pharmacodynamics (PD, effect vs. time) and pharmacokinetics (PK, concentration vs. time) for optimal design and the discovery of new drugs to combat cancer-cell growth [3]. While PD has been defined as “how the drug affects the body”, PK has been defined as “how the body handles the drug”. Accordingly, PK/PD data is becoming more available in literature and consequently has piqued significant scholarly attention in industry, academia, and regulatory authorities as an advanced method for exposure–response analysis [234]. Often time, a combination of therapies is widely used in treating cardiovascular diseases, cancer, and infectious diseases.
Research activities in developing anticancer treatments have yielded desirable outcomes, but many of these drugs have been reported to have significant systemic toxicity [235]. This challenge is attributed to the drug’s lack of selectivity, having healthy tissues and cells that undergo fast turnover, resulting in toxicity. For instance, approximately 30% of colorectal cancer patients develop tumour metastasis, therefore posing a great challenge in diagnosis and treatment [3]. Whereas chemotherapy is known to be a conventional treatment for patients with this type of cancer, the therapeutic benefits are limited by toxicity to normal drugs and drug-drug resistance [236,237,238]. Capecitabine (CAP) is a tumour-selective pro-drug which has been approved by the Food and Drug Administration (FDA) for the management of pancreatic cancer, breast cancer, gastric cancer, and colorectal cancer, tumours known to be resistant to 5-fluorouracil (5-FU) among other malignancies [3, 239]. CAP is an oral chemotherapy tumor-selective pro-drug which is preferentially converted to 5-FU (most active compound) in targeting tumour tissues via three metabolic steps [240]. After oral administration, CAP is absorbed in the intestine and metabolized to 5’-deoxy-5-fluorocytidine (5’-5’-DFCR) by carboxylesterase (CES) enzyme located in the liver [3]. The ubiquitous enzyme cytidine deaminase (CyD) then converts 5’-DFCR to 5’deoxy-5-fluorouridine (5’-DFUR). Finally, 5’-DFUR is then converted to toxic and active metabolite 5-FU by thymidine phosphorylase (TP) (Fig. 11), which is more concentrated in solid tumours than normal adjacent tissues, decreasing the effect of 5-FU on normal cells.
The structure of CAP and metabolic conversion of CAP to 5-FU [3]
Previous scholars performed a pharmacology study on CAP to assess clinical pharmacokinetics and disposition of CAP and its metabolites 5’-DFUR, 5’-5’-DFCR and 5-FU. Their study aimed to determine the initial and steady-state PK parameters, including area under the curve, maximum plasma concentrations (Cmax), volume of distribution (Vd), and clearance (Cl) of CAP and its metabolites [241]. The authors reported that CAP and its metabolites PK were linear and consistent over time when studied in a dosage range of 500–3500 mg/m2/day among cancer patients. The study also found that lower CAP depicted an acceptable response rate which may also suggest that higher CAP doses are not always required to achieve the required therapeutic effects [241].
Accumulating evidence from oncology research shows that PK-PD studies have been pivotal in tailoring drug dosage schedules to minimize toxicity and achieve desirable pharmacodynamic effects [242,243,244]. They also guide in selection of drug candidates for further pre-clinical studies. Medicinal chemists have used PK-PD data to identify and prioritize compounds with favourable profiles, and streamlining the drug discovery process. However, there are scientific challenges associated with PK-PD studies including complexity of PK-PD data, variability in drug metabolism, response, and distribution [245, 246]. These impede the extrapolation of laboratory findings to real-life applications. Further to this, analysis of anticancer drugs using PK-PD data is sophisticated and cost-intensive, limiting the scalability of PK-PD studies in preclinical drug research.
7 The accuracy of DFT in drug design
The Hartree–Fock (HF) methods exclude electron correlations and thus may not describe some properties of drug candidates precisely [247]. Similarly, the post-HF methods, configuration interaction (CI) and Møller-Plesset perturbation (MPn) theory consider electron correlations flawed by high computational costs. DFT is cost-effective, with high computational accuracy and less computational time than MPn, HF, and CI methods [247]. In view of this, DFT has inspired intense scholarly interest in studying molecular properties [248, 249]. DFT has demonstrated its efficacy in studying the geometries of smaller drug molecules, demonstrating its suitability in drug design [247].
A crucial step in drug design using DFT is the study of energetic properties such as metal–ligand bond strengths, relative energies, ionization energies, and electron affinities [247, 250]. The properties and the interactions of drug molecules with their receptors are governed by their molecular structures. Therefore, the DFT technique has been conveniently used in computational drug design to predict relative conformational energies [251]. Numerous studies have reported that possible interactions between a drug and a receptor include charge transfer, dipole–dipole interactions, ionic interactions, ion–dipole interactions, covalent bonds, hydrophobic interactions, and hydrogen bonding [252,253,254]. DFT method (c.f Fig. 12) provides reasonable predictions for covalent, ionic, and hydrogen bonding; however, interactions for weaker bonds have been difficult to predict [255].
The DFT method (B3LYP and CAM-B3YLP) in Drug Design [256]
Quantum chemistry software packages such as ADF employ DFT in electronic structure calculations and elucidate spectroscopic properties such as nuclear magnetic resonance (NMR), infrared (IR), and vibrational frequency spectra [50, 51]. Further, ADF can also be used for molecular geometry optimization, to explore the role of catalysis in reactions, and to study reaction channels and activation energies of systems being studied [257]. There are a significant number of research reports that have shown that DFT has been used for structure elucidation and modelling the interactions between the drug and the receptor [44, 258,259,260]. This computational tool has also shown proficiency in modelling metal-containing biological systems in the study of inorganic therapeutics.
Figure 12 depicts a graphical representation of ligands’ structural accuracy and binding affinity. The ligands are predicted using two DFT-based energy functionals, namely B3LYP and CAM-B3LYP [256]. The predictions were then compared to experimental data. The x-axis represents the experimental binding affinity value, while the y-axis represents the calculated binding affinity value using the two DFT methods [256]. Data points closer to the diagonal line indicate better structural accuracy and binding affinity prediction as they are more closely aligned with the experimental data. The graph demonstrates that both methods exhibit strong performance in predicting structural accuracy and binding affinity [256]. However, B3LYP yields slightly superior results compared to the CAM-B3LYP method. The effectiveness and usefulness of DFT methods in drug design are demonstrated by their ability to predict the binding affinity of ligands accurately [186, 247]. This prediction is crucial for the development of new and effective drugs. The image also demonstrates the potential of DFT in reducing the time and cost associated with traditional drug design methods. This is achieved through the provision of accurate computational predictions [256].
7.1 Deficiencies of computational methods in pre-clinical drug design
It is well-established in the literature that each in silico method has its application scope and features [261, 262]. Therefore, in drug design and development, appropriate methods must be selected for accurate prediction; however, some methods have deficiencies that may affect the prediction of results. Theoretical limitations associated with different approaches such as ADMET, QSAR, cavity prediction, pharmacophore building, modeling, and structure optimization have been reported [263]. Molecular modeling has been widely used in identifying possible interactions between metabolic enzymes and compounds, as well as predicting metabolism [264]. However, it is known that the scoring function in molecular docking affects the accuracy of ADMET prediction [265]. On the other hand, the accuracy of data modeling methods such as QSAR depends on the quantity and quality of data [263]. These models cannot predict chemicals outside the chemical space where the models were developed. Furthermore, the simplistic nature of data models limits their ability to predict drug interaction and behavior in a whole system. QSAR prediction methods assume that similar molecules have similar properties, but in other instances such as CYP metabolism, similar molecules exhibit different activities [266]. Although ADMET software has been widely used in predicting multiple properties and qualitative analysis of compounds [267], it cannot give an accurate prediction of the quantitative values [268].
DFT, on the other hand, has been important in drug design and development but has its limitations. The DFT computation approach introduces errors for systems with dispersion interactions and electron correlation [269]. DFT, in particular, diffusion quantum Monte Carlo (DQMC) is also associated with high computational requirements in terms of time and cost, thus medicinal chemists often prefer lower-level computational methods [270, 271]. Incorrect choices of binding sites for ligands give false DFT results, affecting the accuracy of the approach. Another notable challenge is that for a drug candidate to be licensed as an effecting drug or lead, it has to satisfy various therapeutic necessities. CADD relies on predefined codes, algorithms, and theoretical parameters/principles [272]. Therefore, all CADD tools that are used for pharmacophore modeling, QSAR, virtual screening, molecular modeling, and MD have limitations and merits [272]. The accuracy of these models relies on experimental data and knowledge to make them accurate [273]. Hybrid QM/MM methods are limited lack of new methodologies for virtual screening and high cost of computational resources [274]. These methods have shown to be more efficient for medium-sized systems unlike complex drug-protein interactions and large biomolecules. These hybrid methods are also known to have deficiencies in capturing conformational changes and binding modes [275]. Whereas is known to be accurate, QM computations involving extensive conformational sampling or large systems require high expensive computational resources [189].
8 Use of artificial intelligence in drug design
Applying Artificial Intelligence (AI) and computational methods in drug design has fundamentally changed the scientific landscape in drug design. With the help of new technology, drug discovery is now more efficient, accurate, and cost-effective than before. By using AI, scientists can design and verify new medicines without first making synthesizing them in the laboratory. Conventional drug discovery approaches are labor-intensive and expensive [12, 36, 276]. However, AI is a game changer since it can quickly screen a range of molecules and determine their characteristics in interacting with specific targets by analyzing their chemical and structural features [12]. This helps scientists to reduce the list of possible medications and focus on the ones most crucial to work. The use of AI in analyzing chemical compounds is done mainly through employing algorithms and utilizing computer simulation capabilities. By employing these techniques, researchers can screen many potential drugs more accurately than in traditional approaches [276]. This helps determine the effectiveness of drug designs and make better predictions regarding the outcome, saving resources and avoiding potential risks. The use of AI and computational methods in drug design allows for the examination of considerable chemical space with different compounds and elements. In the past, drug discovery programs could only examine a limited number of substances because of the high cost and time it took to create and evaluate each drug. But, with the incorporation of AI and computational technologies, it is now possible to scrutinize countless compounds, thus enhancing the chances of detecting a powerful and harmless drug applicant [277].
Predicting a compound's physical and chemical properties using AI has been critical in drug design and research. Computational methods suggest the various properties of a chemical substance like whether it will dissolve in water or not, if it is safe to use, and how it might interact with living cells [277]. Scientists can leverage this data to adjust the compound's formulation to perform better and increase the potential of achieving the desired outcomes. With the help of AI, researchers can design and optimize drug delivery systems based on factors such as particle size, surface charge, and drug release kinetics [276]. Another benefit of using AI and computational techniques is the ability to individualize drug treatment by tailoring them to a particular disease condition, and the causing agent. Pharmacogenomics is a technique that uses an individual's genetic information to predict the efficacy and side effects of a particular drug for a known group of patients [277]. This is important in the medicinal treatment of complex diseases, where a general solution may need to be revised. Applying AI and computational approaches can help detect disease-specific biomarkers, leading to the development of targeted treatment strategies [276,277,278].
Whereas AI and computational methods have many advantages, they also have some drawbacks that cannot be overlooked. These techniques are dependent on the accuracy of the data used in programming the AI. Data bias or incomplete data can lead to incorrect model predictions [278, 279]. Hence, researchers must guarantee that the data used is genuine and uninfluenced [278]. Moreover, AI and computational methods cannot account for unique chemical reactions and biological pathways. While these methods can predict the behavior of a compound under certain conditions, they may need to reflect the complex interactions that occur in living systems accurately. To overcome this limitation, experimental validation of the predictions is crucial, and a combination of both computational and experimental methods is often used for the most accurate results [279].
There have been numerous successful applications of AI and computational methods in drug design. For example, in developing the HIV protease inhibitor, darunavir, computational methods were used to design and screen a library of compounds for potential inhibitors [280]. Chuntakaruk and colleagues [281] investigated the ligand–protein interaction of the darunavir/HIV-1 protease using computational methods and noted that while darunavir and its three analogues 19–0–14–3, 19–8–10–0 and 19–8–14–3 are potential protease inhibitors. This led to the identification of darunavir, which is now widely used in the treatment of HIV. Similarly, the development of the influenza drug, Relenza was aided by computational methods that predicted the binding affinity of the drug with the viral protein (HIV-1 protease) [280].
Figure 13 illustrates the integration of AI into drug discovery and development processes and the various novel applications of AI in the pharmaceutical industry. The drug discovery process can be divided into four main components: drug design, polypharmacology, drug repurposing, and drug screening [282]. The main application of AI is in predicting drug properties. This can potentially decrease the necessity for clinical trials and the involvement of live study participants. Such a development would bring benefits from both financial and ethical perspectives. This section discusses the studies identified in this review that support the integration of AI into the drug discovery procedure. These studies highlight how AI can improve efficiency, accuracy, and productivity [282].
Application of AI in various aspect of pharmaceutical industry in the context of drug design [282]
8.1 Limitations of AI approach in pre-clinical drug research
Accumulated evidence suggest that the use of AI in pre-clinical drug research is slowly living up to the expectations of the scientific community, with noteworthy advances in de novo molecular design, QSAR modeling, and synthesis planning, among other crucial pre-clinical research activities [283, 284]. Nevertheless, the efficacy of AI in accelerating drug research is yet to be demonstrated. There exist challenges that impede the integration and implementation of these technologies in search of compounds with therapeutic properties. One key challenge is inefficient data integration as a result of diversity between datasets, which include raw data, metadata, candidate data, race data or processed data [282]. For efficient analysis, these datasets must be collated or collected, and there exists no method of doing so as it currently standards. Without appropriately formatted data, the current AI systems often yield erroneous results [277]. The second impediment is the immobility of experts and occupational experts as many people in the realm of medicinal chemistry lack the required skills and qualifications required to operate AI systems [282]. Few experts are competent in a combination of molecular biology and chemistry, as well as data science to generate AI algorithms in a pharmaceutical context. Another notable problem is the lack of adequate financing for the development of novel AI systems in drug discovery and development [285]. The hesitancy to invest in AI is attributed to the skepticism surrounding the results of machine learning and AI in drug research, slowing AI-relevant advances [282]. Towards this end, lack of understanding of the methodology in AI systems causes distrust and researchers will be skeptical about using this technology, which may hold the industry back for the next few years [282, 286].
9 Conclusions and outlook
Over the past decade, the rates of identifying disease-associated targets have been higher than identifying novel compounds with promising therapeutic effects. There is a drastic increase in computational methods, such as docking or virtual screening, and MD that have accelerated drug design and discovery methods in the pharmaceutical industry. Well-developed computational tools have been used in the pharmaceutic industry and academia and have shown success and remarkably huge prospects for providing even a faster and cheaper approach in the drug discovery landscape. The computational capability of predicting molecules to be optimized and filtering a large molecular database has reduced the amount of time and challenges involved. Nowadays, structure-based methods have been popularly used in studying protein targets. Molecular docking and MD simulations are prominent examples of these methods, and they have been applied in the characterization of binding sites, determining the thermodynamics and kinetics involved in ligand-target recognition. Conversely, ligand-based methods such as QSAR have been popularly used to determine and improve the activity of active molecules. They have also been used to determine bioactive molecules in the drug discovery journey. CADD has significant applications, including the identification of hit compounds, lead optimization, and assessment of the ADMET profile of bioactive compounds. Considering drug discovery as a succession of decisions made in identifying the right target and dosing regimen, in silico and ab initio methods can support these decisions. Computational chemistry will undoubtedly yield an unprecedented drug design and development revolution, making the complex and time-consuming drug discovery more effective and affordable. There is an exponential growth in the number of pharmaceutical companies adopting these approaches, and numerous opportunities exist for further growth. Notably, drug discovery and design rely on human expertise and the synergy between CADD and human intelligence for success. CADD approaches are known to have vast computational capabilities in conducting unsupervised analysis of databases; human judgement and input are necessary to consider ethical concerns related to drug research. Human expertise is also required to validate real-world clinical studies and wet-lab experiments. All the classical and new methods jointly used with the existing computational disciplines are given an advantage at nearly every drug discovery and development stage. With the required expertise in biophysics, biochemistry and biology, significant investment in time and money, and the availability of the necessary computational software and hardware, drug discovery has become an affordable task in the pharmaceutic industry and research institutions. Accordingly, we foresee the impact of computational chemistry in the form of accelerated drug discovery and the reduction of attrition rates due to negative ADMET results.
With the entry of AI in the field of drug design and development, the drug industry will be inspired to develop drugs not only in a faster manner but also drugs that can target complex diseases with unrivaled accuracy. The synergistic application of computation tools and AI technologies has contributed to public health improvement and aligning the United Nation’s SDG number 3 on good health and well-being. Innovative computational resources have accelerated the drug research pipeline, and subsequently, cancer patients have benefited from innovative cancer therapies. The AI-driven predictions have enabled medicinal chemists to tailor cancer management regimens to patient characteristics such as drug response patterns and tumour heterogeneity, reducing any potential adverse reactions and promoting healthy access and equity globally. The emerging cutting-edge approaches in oncology research have also contributed to the growth of the biotechnology and pharmaceutical sectors, which aligns with UN SDG innovation and sustainable industrialization. By promoting oncology research, medicinal chemists can address the gaps in medicinal research, improve accessibility and affordability of healthcare services and contribute towards the achievement of SDGs related to public health and industrialization.
Data availability
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
References
Alotaibi BS, Ijaz M, Buabeid M, Kharaba ZJ, Yaseen HS, Murtaza G. Therapeutic effects and safe uses of plant-derived polyphenolic compounds in cardiovascular diseases: a review. Drug Des Devel Ther. 2021;31:4713–32.
Ebrahimi F, Farzaei MH, Bahramsoltani R, Heydari M, Naderinia K, Rahimi R. Plant-derived medicines for neuropathies: a comprehensive review of clinical evidence. Rev Neurosci. 2019;30(6):671–84.
Ge C, Huang X, Zhang S, Yuan M, Tan Z, Xu C, Jie Q, Zhang J, Zou J, Zhu Y. In vitro co-culture systems of hepatic and intestinal cells for cellular pharmacokinetic and pharmacodynamic studies of capecitabine against colorectal cancer. Cancer Cell Int. 2023;23(1):14.
Sanders D. The struggle for health: medicine and the politics of underdevelopment. Oxford University Press. 2023.
Moraes DF, de Mesquita LS, do Amaral FM, de Sousa Ribeiro MN, Malik S. Anticancer drugs from plants. Biotechnology and production of anti-cancer compounds. 2017: 121–142.
Gusain P, Uniyal DP, Joga R. Conservation and sustainable use of medicinal plants. In: preparation of phytopharmaceuticals for the management of disorders. Elsevier. 2021: pp. 409–427
Nedungadi P, Salethoor SN, Puthiyedath R, Nair VK, Kessler C, Raman R. Ayurveda research: emerging trends and mapping to sustainable development goals. J Ayurveda Integr Med. 2023;14(6): 100809.
Sorooshian S. The sustainable development goals of the United Nations: a comparative midterm research review. J Cleaner Product. 2024: 142272.
Solovy EM. The doha declaration at twenty: interpretation, implementation, and lessons learned on the relationship between the TRIPS agreement and global health. Nw J Int’l L Bus. 2021;42:253.
Berdigaliyev N, Aljofan M. An overview of drug discovery and development. Future Med Chem. 2020;12(10):939–47.
Schaduangrat N, Lampa S, Simeon S, Gleeson MP, Spjuth O, Nantasenamat C. Towards reproducible computational drug discovery. J Cheminform. 2020;12:1–30.
Brown N, Ertl P, Lewis R, Luksch T, Reker D, Schneider N. Artificial intelligence in chemistry and drug design. J Comput Aided Mol Des. 2020;34:709–15.
Korshunova M, Ginsburg B, Tropsha A, Isayev O. OpenChem: a deep learning toolkit for computational chemistry and drug design. J Chem Inf Model. 2021;61(1):7–13.
Hasan MR, Alsaiari AA, Fakhurji BZ, Molla MHR, Asseri AH, Sumon MAA, Park MN, Ahammad F, Kim B. Application of mathematical modeling and computational tools in the modern drug design and development process. Molecules. 2022;27(13):4169.
Prieto-Martínez FD, López-López E, Juárez-Mercado KE, Medina-Franco JL. Computational drug design methods—current and future perspectives. In silico drug design. 2019: 19–44.
Garg A, Dewangan HK. Recent advances in drug design and delivery across biological barriers using computational models. Lett Drug Des Discovery. 2022;19(10):865–76.
Decherchi S, Cavalli A. Thermodynamics and kinetics of drug-target binding by molecular simulation. Chem Rev. 2020;120(23):12788–833.
Niu Y, Lin P. Advances of computer-aided drug design (CADD) in the development of anti-Azheimer’s-disease drugs. Drug Discovery Today. 2023;28:103665.
del CarmenQuintalBojórquez N, Campos MR. Traditional and novel computer-aided drug design (CADD) approaches in the anticancer drug discovery process. Curr Cancer Drug Targets. 2023;23(5):333–45.
Kommalapati HS, Pilli P, Golla VM, Bhatt N, Samanthula G. In silico tools to thaw the complexity of the data: revolutionizing drug research in drug metabolism, pharmacokinetics and toxicity prediction. Curr Drug Metab. 2023;24:735–55.
Komura H, Watanabe R, Mizuguchi K. The trends and future prospective of in silico models from the viewpoint of ADME evaluation in drug discovery. Pharmaceutics. 2023;15(11):2619.
Hussein D, Saka M, Baeesa S, Bangash M, Alghamdi F, Al Zughaibi T, AlAjmi MF, Haque S, Rehman MT. Structure-based virtual screening and molecular docking approaches to identify potential inhibitors against KIF2C to combat glioma. J Biomol Struct Dynam. 2023;708:1–14.
Moinul M, Khatun S, Amin SA, Jha T, Gayen S. Recent trends in fragment-based anticancer drug design strategies against different targets: a mini-review. Biochem Pharmacol. 2022;206:115301.
Yadav R, Imran M, Dhamija P, Chaurasia DK, Handu S. Virtual screening, ADMET prediction and dynamics simulation of potential compounds targeting the main protease of SARS-CoV-2. J Biomol Struct Dyn. 2021;39(17):6617–32.
Barge S, Jade D, Ayyamperumal S, Manna P, Borah J, Nanjan CMJ, Nanjan MJ, Talukdar NC. Potential inhibitors for FKBP51: an in silico study using virtual screening, molecular docking and molecular dynamics simulation. J Biomol Struct Dyn. 2022;40(24):13799–811.
Deore AB, Dhumane JR, Wagh R, Sonawane R. The stages of drug discovery and development process. Asian J Pharmaceut Res Develop. 2019;7(6):62–7.
Verma S, Pathak RK. Discovery and optimization of lead molecules in drug designing. In: Bioinformatics. Elsevier. 2022. pp. 253–267
Vemula D, Jayasurya P, Sushmitha V, Kumar YN, Bhandari V. CADD, AI and ML in drug discovery: a comprehensive review. Eur J Pharm Sci. 2023;181: 106324.
Fadaka AO, Aruleba RT, Sibuyi NRS, Klein A, Madiehe AM, Meyer M. Inhibitory potential of repurposed drugs against the SARS-CoV-2 main protease: a computational-aided approach. J Biomol Struct Dyn. 2022;40(8):3416–27.
Acharya C, Coop A, Polli J, MacKerell A. Recent advances in ligand-based drug design: relevance and utility of the conformationally sampled pharmacophore approach. Curr Comput Aided Drug Des. 2011;7(1):10–22.
Saura P, Röpke M, Gamiz-Hernandez AP, Kaila VR. Quantum chemical and QM/MM models in biochemistry. biomolecular simulations: methods and protocols. 2019: 75–104.
Demapan D, Kussmann JR, Ochsenfeld C, Cui Q. Factors that determine the variation of equilibrium and kinetic properties of QM/MM enzyme simulations: QM region, conformation, and boundary condition. J Chem Theory Comput. 2022;18(4):2530–42.
Raghavan B, Paulikat M, Ahmad K, Callea L, Rizzi A, Ippoliti E, Mandelli D, Bonati L, De Vivo M, Carloni P. Drug design in the exascale era: a perspective from massively parallel QM/MM simulations. J Chem Inf Model. 2023;63:3647–58.
Kulkarni PU, Shah H, Vyas VK. Hybrid quantum mechanics/molecular mechanics (QM/MM) simulation: a tool for structure-based drug design and discovery. Mini Rev Med Chem. 2022;22(8):1096–107.
Salo-Ahen OM, Alanko I, Bhadane R, Bonvin AM, Honorato RV, Hossain S, Juffer AH, Kabedev A, Lahtela-Kakkonen M, Larsen AS. Molecular dynamics simulations in drug discovery and pharmaceutical development. Processes. 2020;9(1):71.
Lin X, Li X, Lin X. A review on applications of computational methods in drug screening and design. Molecules. 2020;25(6):1375.
Chang Y, Hawkins BA, Du JJ, Groundwater PW, Hibbs DE, Lai F. A guide to in silico drug design. Pharmaceutics. 2023;15(1):49.
Lin X, Li X, Lin X. A review on applications of computational methods in drug screening and design. Molecules. 2020;25(6):1375.
Ma M, Song J, Dong Y, Fang W, Gao L. Structural and thermodynamic properties of bulk triglycerides and triglyceride/water mixtures reproduced using a polarizable coarse-grained model. Phys Chem Chem Phys. 2023;25(33):22232–43.
Aminpour M, Montemagno C, Tuszynski JA. An overview of molecular modeling for drug discovery with specific illustrative examples of applications. Molecules. 2019;24(9):1693.
Sohlenius-Sternbeck A-K, Terelius Y. Evaluation of ADMET predictor in early discovery drug metabolism and pharmacokinetics project work. Drug Metab Disposition. 2022;50(2):95–104.
Borah P, Hazarika S, Deka S, Venugopala KN, Nair AB, Attimarad M, Sreeharsha N, Mailavaram RP. Application of advanced technologies in natural product research: a review with special emphasis on ADMET profiling. Curr Drug Metab. 2020;21(10):751–67.
van der Kolk MR, Janssen MA, Rutjes FP, Blanco-Ania D. Cyclobutanes in small-molecule drug candidates. ChemMedChem. 2022;17(9): e202200020.
Sabe VT, Ntombela T, Jhamba LA, Maguire GE, Govender T, Naicker T, Kruger HG. Current trends in computer aided drug design and a highlight of drugs discovered via computational techniques: a review. Eur J Med Chem. 2021;224: 113705.
Badar MS, Shamsi S, Ahmed J, Alam MA. Molecular dynamics simulations: concept, methods, and applications. In: Transdisciplinarity. Springer. 2022: pp. 131–151
Taldaev A, Terekhov R, Nikitin I, Zhevlakova A, Selivanova I. Insights into the pharmacological effects of flavonoids: the systematic review of computer modeling. Int J Mol Sci. 2022;23(11):6023.
Adelusi TI, Oyedele A-QK, Boyenle ID, Ogunlana AT, Adeyemi RO, Ukachi CD, Idris MO, Olaoba OT, Adedotun IO, Kolawole OE. Molecular modeling in drug discovery. Informat Med Unlocked. 2022;29: 100880.
Wang L, Chambers J, Abel R. Protein–ligand binding free energy calculations with FEP+. Biomol Simulat Methods Proto. 2019: 201–232.
Cournia Z, Chipot C, Roux B, York DM, Sherman W. Free energy methods in drug discovery—introduction. In: Free energy methods in drug discovery: current state and future directions. ACS Publications. 2021: pp. 1–38.
Ji Y, Yang X, Ji Z, Zhu L, Ma N, Chen D, Jia X, Tang J, Cao Y. DFT-calculated IR spectrum amide I, II, and III band contributions of N-methylacetamide fine components. ACS Omega. 2020;5(15):8572–8.
Sajid H, Addicoat MA. Computational insights of dimensional organic materials. 2023.
Shukla R, Tripathi T. Molecular dynamics simulation in drug discovery: opportunities and challenges. Innovations and Implementations of Computer Aided Drug Discovery Strategies in Rational Drug Design. 2021: 295–316.
Guterres H, Im W. Improving protein-ligand docking results with high-throughput molecular dynamics simulations. J Chem Inf Model. 2020;60(4):2189–98.
Sharma H, Raju B, Narendra G, Motiwale M, Sharma B, Verma H, Silakari O. QM/MM studies on enzyme catalysis and insight into designing of new inhibitors by ONIOM approach: recent update. ChemistrySelect. 2023;8(1): e202203319.
Xia Y, Sun M, Huang H, Jin W-L. Drug repurposing for cancer therapy. Signal Transduct Target Ther. 2024;9(1):92.
Pinzi L, Bisi N, Rastelli G. How drug repurposing can advance drug discovery: challenges and opportunities. Front Drug Discov. 2024;4:1460100.
Fadnis JA, Sawale AV, Padmawar SS. Thalidomide: the journey from curse to boon. World J Bio Pharm Health Sci. 2023;14(3):149–59.
Wimmelbücker L, Kar A. A history of thalidomide in India. Med Hist. 2023;67(3):228–46.
Devi RV, Sathya SS, Coumar MS. Evolutionary algorithms for de novo drug design–A survey. Appl Soft Comput. 2015;27:543–52.
Jean-Quartier C, Jeanquartier F, Jurisica I, Holzinger A. In silico cancer research towards 3R. BMC Cancer. 2018;18:1–12.
Kashkooli FM, Soltani M, Souri M, Meaney C, Kohandel M. Nexus between in silico and in vivo models to enhance clinical translation of nanomedicine. Nano Today. 2021;36: 101057.
Mukherjee P, Roy S, Ghosh D, Nandi S. Role of animal models in biomedical research: a review. Laborat Anim Res. 2022;38(1):18.
Banegas-Luna AJ, Imbernon B, Llanes Castro A, Perez-Garrido A, Ceron-Carrasco JP, Gesing S, Merelli I, D’Agostino D, Perez-Sanchez H. Advances in distributed computing with modern drug discovery. Expert Opin Drug Discov. 2019;14(1):9–22.
Vitali E, Ficarelli F, Bisson M, Gadioli D, Accordi G, Fatica M, Beccari AR, Palermo G. GPU-optimized approaches to molecular docking-based virtual screening in drug discovery: a comparative analysis. JPDC. 2024;186: 104819.
Wu F, Zhou Y, Li L, Shen X, Chen G, Wang X, Liang X, Tan M, Huang Z. Computational approaches in preclinical studies on drug discovery and development. Front Chem. 2020;8:726.
Hu Y, Zhou L, Zhu X, Dai D, Bao Y, Qiu Y. Pharmacophore modeling, multiple docking, and molecular dynamics studies on Wee1 kinase inhibitors. J Biomol Struct Dyn. 2019;37(10):2703–15.
Jayaraj JM, Krishnasamy G, Lee J-K, Muthusamy K. In silico identification and screening of CYP24A1 inhibitors: 3D QSAR pharmacophore mapping and molecular dynamics analysis. J Biomol Struct Dyn. 2019;37(7):1700–14.
Panwar U, Singh SK. Atom-based 3D-QSAR, molecular docking, DFT, and simulation studies of acylhydrazone, hydrazine, and diazene derivatives as IN-LEDGF/p75 inhibitors. Struct Chem. 2021;32:337–52.
Belfield SJ, Firman JW, Enoch SJ, Madden JC, Tollefsen KE, Cronin MT. A review of quantitative structure-activity relationship modelling approaches to predict the toxicity of mixtures. Computat Toxicol. 2023;25: 100251.
Ferreira LL, Andricopulo AD. ADMET modeling approaches in drug discovery. Drug Discovery Today. 2019;24(5):1157–65.
Huggins DJ, Biggin PC, Dämgen MA, Essex JW, Harris SA, Henchman RH, Khalid S, Kuzmanic A, Laughton CA, Michel J. Biomolecular simulations: From dynamics and mechanisms to computational assays of biological activity. Wiley Interdisciplin Rev. 2019;9(3): e1393.
Bera I, Payghan PV. Use of molecular dynamics simulations in structure-based drug discovery. Curr Pharm Des. 2019;25(31):3339–49.
Nian B, Xu Y-J, Liu Y. Molecular dynamics simulation for mechanism revelation of the safety and nutrition of lipids and derivatives in food: State of the art. Food Res Int. 2021;145: 110399.
Wang E, Sun H, Wang J, Wang Z, Liu H, Zhang JZ, Hou T. End-point binding free energy calculation with MM/PBSA and MM/GBSA: strategies and applications in drug design. Chem Rev. 2019;119(16):9478–508.
King E, Aitchison E, Li H, Luo R. Recent developments in free energy calculations for drug discovery. Front Mol Biosci. 2021;8: 712085.
Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. J Comput Chem. 2004;25(9):1157–74.
Kumari I, Sandhu P, Ahmed M, Akhter Y. Molecular dynamics simulations, challenges and opportunities: a biologist’s prospective. Curr Protein Pept Sci. 2017;18(11):1163–79.
Leonard AN, Wang E, Monje-Galvan V, Klauda JB. Developing and testing of lipid force fields with applications to modeling cellular membranes. Chem Rev. 2019;119(9):6227–69.
Ghahremanpour MM, Tirado-Rives J, Jorgensen WL. Refinement of the optimized potentials for liquid simulations force field for thermodynamics and dynamics of liquid alkanes. J Phys Chem B. 2022;126(31):5896–907.
Hart K, Foloppe N, Baker CM, Denning EJ, Nilsson L, MacKerell AD Jr. Optimization of the CHARMM additive force field for DNA: Improved treatment of the BI/BII conformational equilibrium. J Chem Theory Comput. 2012;8(1):348–62.
Denning EJ, Priyakumar UD, Nilsson L, Mackerell AD Jr. Impact of 2′-hydroxyl sampling on the conformational properties of RNA: update of the CHARMM all-atom additive force field for RNA. J Comput Chem. 2011;32(9):1929–43.
Foloppe N, MacKerell J, Alexander D. Alexander D: All-atom empirical force field for nucleic acids: I. Parameter optimization based on small molecule and condensed phase macromolecular target data. J Computat Chem. 2000;21(2):86–104.
Schlenkrich M, Brickmann J, MacKerell Jr AD, Karplus M. An empirical potential energy function for phospholipids: criteria for parameter optimization and applications. In: Biological membranes: a molecular perspective from computation and experiment. Springer. 1996: pp. 31–81
Feller SE, Yin D, Pastor RW, MacKerell A. Molecular dynamics simulation of unsaturated lipid bilayers at low hydration: parameterization and comparison with diffraction studies. Biophys J. 1997;73(5):2269–79.
Tian C, Kasavajhala K, Belfon KA, Raguette L, Huang H, Migues AN, Bickel J, Wang Y, Pincay J, Wu Q. ff19SB: Amino-acid-specific protein backbone parameters trained against quantum mechanics energy surfaces in solution. J Chem Theory Comput. 2019;16(1):528–52.
Kirschner KN, Yongye AB, Tschampel SM, González-Outeiriño J, Daniels CR, Foley BL, Woods RJ. GLYCAM06: a generalizable biomolecular force field. Carbohydrates. J Computat Chem. 2008;29(4):622–55.
Galindo-Murillo R, Robertson JC, Zgarbova M, Sponer J, Otyepka M, Jurecka P, Cheatham TE III. Assessing the current state of amber force field modifications for DNA. J Chem Theory Comput. 2016;12(8):4114–27.
Bergonzo C, Cheatham TE III. Improved force field parameters lead to a better description of RNA structure. J Chem Theory Comput. 2015;11(9):3969–72.
Pol-Fachin L, Rusu VH, Verli H, Lins RD. GROMOS 53A6GLYC, an improved GROMOS force field for hexopyranose-based carbohydrates. J Chem Theory Comput. 2012;8(11):4681–90.
Marzuoli I, Margreitter C, Fraternali F. Lipid head group parameterization for GROMOS 54A8: a consistent approach with protein force field description. J Chem Theory Comput. 2019;15(10):5175–93.
Oostenbrink C, Soares TA, Van Der Vegt NF, Van Gunsteren WF. Validation of the 53A6 GROMOS force field. Eur Biophys J. 2005;34:273–84.
Kony D, Damm W, Stoll S, Van Gunsteren WF. An improved OPLS–AA force field for carbohydrates. J Comput Chem. 2002;23(15):1416–29.
Robertson MJ, Qian Y, Robinson MC, Tirado-Rives J, Jorgensen WL. Development and testing of the OPLS-AA/M Force Field for RNA. J Chem Theory Comput. 2019;15(4):2734–42.
Maciejewski A, Pasenkiewicz-Gierula M, Cramariuc O, Vattulainen I, Rog T. Refined OPLS all-atom force field for saturated phosphatidylcholine bilayers at full hydration. J Phys Chem B. 2014;118(17):4571–81.
Dodda LS, Cabeza de Vaca I, Tirado-Rives J, Jorgensen WL. LigParGen web server: an automatic OPLS-AA parameter generator for organic ligands. Nucleic Acids Res. 2017;45(1):W331–6.
Kalimeri M, Derreumaux P, Sterpone F. Are coarse-grained models apt to detect protein thermal stability? The case of OPEP force field. J Non-Cryst Solids. 2015;407:494–501.
Liwo A, Ołdziej S, Pincus MR, Wawak RJ, Rackovsky S, Scheraga HA. A united-residue force field for off-lattice protein-structure simulations. I Functional forms and parameters of long-range side-chain interaction potentials from protein crystal data. J Computat Chem. 1997;18(7):849–73.
Pasi M, Lavery R, Ceres N. PaLaCe: a coarse-grain protein model for studying mechanical properties. J Chem Theory Comput. 2013;9(1):785–93.
Gautieri A, Russo A, Vesentini S, Redaelli A, Buehler MJ. Coarse-grained model of collagen molecules using an extended MARTINI force field. J Chem Theory Comput. 2010;6(4):1210–8.
Uusitalo JJ, Ingólfsson HI, Akhshi P, Tieleman DP, Marrink SJ. Martini coarse-grained force field: extension to DNA. J Chem Theory Comput. 2015;11(8):3932–45.
Siani P, de Souza R, Dias L, Itri R, Khandelia H. An overview of molecular dynamics simulations of oxidized lipid systems, with a comparison of ELBA and MARTINI force fields for coarse grained lipid simulations. Biochim Biophys Acta. 2016;1858(10):2498–511.
Alessandri R, Barnoud J, Gertsen AS, Patmanidis I, de Vries AH, Souza PC, Marrink SJ. Martini 3 coarse-grained force field: small molecules. Adv Theory Simulat. 2022;5(1):2100391.
Singh S, Baker QB, Singh DB. Molecular docking and molecular dynamics simulation. In: Bioinformatics. Elsevier. 2022: pp. 291–304.
Sael L, Kihara D. Detecting local ligand-binding site similarity in nonhomologous proteins by surface patch comparison. Prot Struct Funct Bioinformat. 2012;80(4):1177–95.
Shin W-H, Bures MG, Kihara D. PatchSurfers: two methods for local molecular property-based binding ligand prediction. Methods. 2016;93:41–50.
Kumar R, Kumar S, Sangwan S, Yadav IS, Yadav R. Protein modeling and active site binding mode interactions of myrosinase–sinigrin in Brassica juncea—An in silico approach. J Mol Graph Model. 2011;29(5):740–6.
Patel AR, Patel HB, Mody SK, Singh RD, Sarvaiya VN, Vaghela SH, Tukra S. Virtual screening in drug discovery. J Veterin Pharmacol Toxicol. 2021;20(2):1–9.
Mohammad T, Mathur Y, Hassan MI. InstaDock: a single-click graphical user interface for molecular docking-based virtual high-throughput screening. Brief Bioinform. 2021;22(4):bbaa279.
Grasso G, Di Gregorio A, Mavkov B, Piga D, Labate GFDU, Danani A, Deriu MA. Fragmented blind docking: a novel protein–ligand binding prediction protocol. J Biomol Struct Dyn. 2022;40(24):13472–81.
Jin Z, Du X, Xu Y, Deng Y, Liu M, Zhao Y, Zhang B, Li X, Zhang L, Peng C. Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors. Nature. 2020;582(7811):289–93.
Hu X, Shrimp JH, Guo H, Xu M, Chen CZ, Zhu W, Zakharov AV, Jain S, Shinn P, Simeonov A. Discovery of TMPRSS2 inhibitors from virtual screening as a potential treatment of COVID-19. ACS Pharmacol Translat Sci. 2021;4(3):1124–35.
Pierce BG, Wiehe K, Hwang H, Kim B-H, Vreven T, Weng Z. ZDOCK server: interactive docking prediction of protein–protein complexes and symmetric multimers. Bioinformatics. 2014;30(12):1771–3.
Wang Z, Sun H, Yao X, Li D, Xu L, Li Y, Tian S, Hou T. Comprehensive evaluation of ten docking programs on a diverse set of protein–ligand complexes: the prediction accuracy of sampling power and scoring power. Phys Chem Chem Phys. 2016;18(18):12964–75.
Sauton N, Lagorce D, Villoutreix BO, Miteva MA. MS-DOCK: accurate multiple conformation generator and rigid docking protocol for multi-step virtual ligand screening. BMC Bioinformat. 2008;9(1):1–12.
Taylor RD, Jewsbury PJ, Essex JW. FDS: flexible ligand and receptor docking with a continuum solvent model and soft-core energy function. J Comput Chem. 2003;24(13):1637–56.
Goodsell DS, Olson AJ. Automated docking of substrates to proteins by simulated annealing. Proteins Struct Funct Bioinformat. 1990;8(3):195–202.
Corbeil CR, Williams CI, Labute P. Variability in docking success rates due to dataset preparation. J Comput Aided Mol Des. 2012;26(6):775–86.
Mcgann MR, Almond HR, Nicholls A, Grant JA, Brown FK. Gaussian docking functions. Biopolym Origin Res Biomol. 2003;68(1):76–90.
Jain AN. Surflex: fully automatic flexible molecular docking using a molecular similarity-based search engine. J Med Chem. 2003;46(4):499–511.
Stanzione F, Giangreco I, Cole JC. Use of molecular docking computational tools in drug discovery. Prog Med Chem. 2021;60:273–343.
Singh DB, Pathak RK. Computational approaches in drug designing and their applications. Exp Protoc Biotechnol. 2020: 95–117.
Agnihotry S, Pathak RK, Srivastav A, Shukla PK, Gautam B. Molecular docking and structure-based drug design. Computer-Aided Drug Design. 2020: 115–131.
Kumar S, Kumar S. Molecular docking: a structure-based approach for drug repurposing. In: In Silico Drug Design. Elsevier. 2019: pp. 161–189.
Fan J, Fu A, Zhang L. Progress in molecular docking. Quantitat Bio. 2019;7:83–9.
Chen R, Li L, Weng Z. ZDOCK: an initial-stage protein-docking algorithm. Prote Struct Funct Bioinformat. 2003;52(1):80–7.
Surana KR, Ahire ED, Sonawane VN, Talele SG. Biomolecular and molecular docking: a modern tool in drug discovery and virtual screening of natural products. In: Applied Pharmaceutical Practice and Nutraceuticals. Apple Academic Press. 2021: pp. 209–223.
Duela S, Umamageswari A, Prabavathi R, Umapathy P, Raja K. Quantum assisted genetic algorithm for sequencing compatible amino acids in drug design. In: 2023 Third international conference on advances in electrical, computing, communication and sustainable technologies (ICAECT) 2023, pp. 1–7. IEEE
Steinmann C, Jensen JH. Using a genetic algorithm to find molecules with good docking scores. PeerJ Phys Chem. 2021;3: e18.
Luengo D, Martino L, Bugallo M, Elvira V, Särkkä S. A survey of Monte Carlo methods for parameter estimation. EURASIP J Adv Signal Process. 2020;2020(1):1–62.
Balandat M, Karrer B, Jiang D, Daulton S, Letham B, Wilson AG, Bakshy E. BoTorch: a framework for efficient Monte-Carlo Bayesian optimization. Adv Neural Inf Process Syst. 2020;33:21524–38.
Torres PH, Sodero AC, Jofily P, Silva-Jr FP. Key topics in molecular docking for drug design. Int J Mol Sci. 2019;20(18):4574.
Ghosh A, Panda P, Halder AK, Cordeiro MND. In silico characterization of aryl benzoyl hydrazide derivatives as potential inhibitors of RdRp enzyme of H5N1 influenza virus. Front Pharmacol. 2022;13:1004255.
Morris GM, Goodsell DS, Halliday RS, Huey R, Hart WE, Belew RK, Olson AJ. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J Comput Chem. 1998;19(14):1639–62.
Meiler J, Baker D. ROSETTALIGAND: protein–small molecule docking with full side-chain flexibility. Prot Struct Funct Bioinformat. 2006;65(3):538–48.
Totrov M, Abagyan R. Flexible protein–ligand docking by global energy optimization in internal coordinates. Prot Struct Funct Bioinformat. 1997;29(1):215–20.
Taylor JS, Burnett RM. DARWIN: a program for docking flexible molecules. Prot Struct Funct Bioinformat. 2000;41(2):173–91.
Jones G, Willett P, Glen RC, Leach AR, Taylor R. Development and validation of a genetic algorithm for flexible docking. J Mol Biol. 1997;267(3):727–48.
Choi V. YUCCA: an efficient algorithm for small-molecule docking. Chem Biodivers. 2005;2(11):1517–24.
Trosset JY, Scheraga HA. PRODOCK: software package for protein modeling and docking. J Comput Chem. 1999;20(4):412–27.
Hart TN, Read RJ. A multiple-start Monte Carlo docking method. Prot Struct Funct Bioinformat. 1992;13(3):206–22.
Liu M, Wang S. MCDOCK: a Monte Carlo simulation approach to the molecular docking problem. J Comput Aided Mol Des. 1999;13:435–51.
Rarey M, Kramer B, Lengauer T, Klebe G. A fast flexible docking method using an incremental construction algorithm. J Mol Biol. 1996;261(3):470–89.
Venkatachalam CM, Jiang X, Oldfield T, Waldman M. LigandFit: a novel method for the shape-directed rapid docking of ligands to protein active sites. J Mol Graph Model. 2003;21(4):289–307.
He J, Wang J, Tao H, Xiao Y, Huang S-Y. HNADOCK: a nucleic acid docking server for modeling RNA/DNA–RNA/DNA 3D complex structures. Nucleic Acids Res. 2019;47(W1):W35–42.
Tietze S, Apostolakis J. GlamDock: development and validation of a new docking tool on several thousand protein−ligand complexes. J Chem Inf Model. 2007;47(4):1657–72.
Miller MD, Kearsley SK, Underwood DJ, Sheridan RP. FLOG: a system to select ‘quasi-flexible’ligands complementary to a receptor of known three-dimensional structure. J Comput Aided Mol Des. 1994;8:153–74.
Pang YP, Perola E, Xu K, Prendergast FG. EUDOC: a computer program for identification of drug interaction sites in macromolecules and drug leads from chemical databases. J Comput Chem. 2001;22(15):1750–71.
Clark KP. Ajay: Flexible ligand docking without parameter adjustment across four ligand–receptor complexes. J Comput Chem. 1995;16(10):1210–26.
Pei J, Wang Q, Liu Z, Li Q, Yang K, Lai L. PSI-DOCK: Towards highly efficient and accurate flexible ligand docking. Prot Struct Funct Bioinformat. 2006;62(4):934–46.
Corbeil CR, Englebienne P, Moitessier N. Docking ligands into flexible and solvated macromolecules. 1. Development and validation of FITTED 10. J Chem Inf Model. 2007;47(2):435–49.
Charifson PS, Corkery JJ, Murcko MA, Walters WP. Consensus scoring: a method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins. J Med Chem. 1999;42(25):5100–9.
Schnecke V, Kuhn LA. Database screening for HIV protease ligands: the influence of binding-site conformation and representation on ligand selectivity. In: ISMB 1999: pp. 242–251
Zsoldos Z, Reid D, Simon A, Sadjad BS, Peter Johnson A. eHiTS: an innovative approach to the docking and scoring function problems. Curr Protein Pept Sci. 2006;7(5):421–35.
Welch W, Ruppert J, Jain AN. Hammerhead: fast, fully automated docking of flexible ligands to protein binding sites. Chem Biol. 1996;3(6):449–62.
Seifert MH, Schmitt F, Herz T, Kramer B. ProPose: a docking engine based on a fully configurable protein–ligand interaction model. J Mol Model. 2004;10:342–57.
Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, Repasky MP, Knoll EH, Shelley M, Perry JK. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J Med Chem. 2004;47(7):1739–49.
Tessaro F, Scapozza L. How ‘protein-docking’translates into the new emerging field of docking small molecules to nucleic acids? Molecules. 2020;25(12):2749.
Parikesit AA, Ansori ANM, Kharisma VD. A computational design of siRNA in SARS-CoV-2 spike glycoprotein gene and its binding capability toward mRNA. Indonesian J Chem. 2022;22(5):1163–76.
Valeska MD, Parikesit AA. Determination of 3D structure and molecular interaction for mir-135b and its silencer as Triple Negative Breast Cancer (TNBC) biomarkers. Berkala Penelitian Hayati. 2022;28(1):62–6.
Sztuba-Solinska J, Chavez-Calvillo G, Cline SE. Unveiling the druggable RNA targets and small molecule therapeutics. Biorg Med Chem. 2019;27(10):2149–65.
Berdasco M, Esteller M. Towards a druggable epitranscriptome: Compounds that target RNA modifications in cancer. Br J Pharmacol. 2022;179(12):2868–89.
Nithin C, Ghosh P, Bujnicki JM. Bioinformatics tools and benchmarks for computational docking and 3D structure prediction of RNA-protein complexes. Genes. 2018;9(9):432.
Umare M, Alkathiri FA, Chikhale R. Development of nucleic acid targeting molecules: Molecular docking approaches and recent advances. Mol Dock Recent Adv. 2022. https://doi.org/10.5772/intechopen.107349.
Luo J, Wei W, Waldispühl J, Moitessier N. Challenges and current status of computational methods for docking small molecules to nucleic acids. Eur J Med Chem. 2019;168:414–25.
Sato K, Hamada M. Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery. Brief Bioinform. 2023;24(4):bbad186.
Muhammed MT, Aki-Yalcin E. Molecular docking: principles, advances, and its applications in drug discovery. Lett Drug Des Discover. 2024;21(3):480–95.
Mullins MR. How Yasukuni shrine survived the occupation: a critical examination of popular claims. Monum Nippon. 2010;65(1):89–136.
Mathur N, Sai S, Shandily S, Santoki KM, Vadhavana NN, Shah S, Chandra M. In silico docking: protocols for computational exploration of molecular interactions. 2024.
Kumar S, Kumar Y. Innovations in molecular docking: a detailed analysis of methodological developments and their applications in drug discovery. Int J Pharma Profession Res. 2024;15(3):52–67.
Zhu J, Xia Y, Liu C, Wu L, Xie S, Wang Y, Wang T, Qin T, Zhou W, Li H. Direct molecular conformation generation. arXiv preprint arXiv:2202.01356. 2022.
Amit R. 7 limitations of molecular docking & computer aided drug design. https://amitray.com/7-limitations-of-molecular-docking-computer-aided-drug-design-and-discovery/. Accessed 27 August 2024
Jakhar R, Dangi M, Khichi A, Chhillar AK. Relevance of molecular docking studies in drug designing. Curr Bioinform. 2020;15(4):270–8.
Zhang X, Shen C, Zhang H, Kang Y, Hsieh C-Y, Hou T. Advancing ligand docking through deep learning: challenges and prospects in virtual screening. Acc Chem Res. 2024;57(10):1500–9.
Friedman R. Computational studies of protein–drug binding affinity changes upon mutations in the drug target. Wiley Interdisciplin Rev. 2022;12(1): e1563.
Çınaroğlu SIS, Biggin PC. Evaluating the performance of water models with host–guest force fields in binding enthalpy calculations for cucurbit [7] uril–guest systems. J Phys Chem B. 2021;125(6):1558–67.
Plazinska A, Plazinski W. Comparison of carbohydrate force fields in molecular dynamics simulations of protein–carbohydrate complexes. J Chem Theory Comput. 2021;17(4):2575–85.
Zhang H, Wang Z, Ren J, Liu J, Li J. Ultra-fast and accurate binding energy prediction of shuttle effect-suppressive sulfur hosts for lithium-sulfur batteries using machine learning. Energy Storage Materials. 2021;35:88–98.
Kaya ED, Türkhan A, Gür F, Gür B. A novel method for explaining the product inhibition mechanisms via molecular docking: inhibition studies for tyrosinase from Agaricus bisporus. J Biomol Struct Dyn. 2022;40(17):7926–39.
Santos LH, Ferreira RS, Caffarena ER. Integrating molecular docking and molecular dynamics simulations. Docking Screens For Drug Discovery. 2019: 13–34
Rastogi S, Rastogi P, Mendiratta N. Bioinformatics: methods and applications-genomics, proteomics and drug discovery. PHI Learning Pvt. Ltd.. 2022
Noor F, Tahir ul Qamar M, Ashfaq UA, Albutti A, Alwashmi AS, Aljasir MA. Network pharmacology approach for medicinal plants: review and assessment. Pharmaceuticals. 2022;15(5):572.
Okpo E, Agboke A, Udobi C, John G, Andy I. The synergy of molecular docking and bioinformatics: an in depth review in drug discovery. Biotechnol J Int. 2024;28(4):119–36.
Aucar MG, Cavasotto CN. Molecular docking using quantum mechanical-based methods. Quantum Mechan Drug Discov. 2020;18:269–84.
Dawson W, Degomme A, Stella M, Nakajima T, Ratcliff LE, Genovese L. Density functional theory calculations of large systems: Interplay between fragments, observables, and computational complexity. Wiley Interdisciplin Rev. 2022;12(3): e1574.
Chen B, Mansour B, Zheng E, Liu Y, Gauld JW, Wang Q. Fundamentals behind the specificity of Cysteinyl-tRNA synthetase: MD and QM/MM joint investigations. Prot Struct Funct Bioinformat. 2023;91(3):354–62.
Ye N, Yang Z, Liu Y. Applications of density functional theory in COVID-19 drug modeling. Drug Discovery Today. 2022;27(5):1411–9.
Magalhães RP, Fernandes HS, Sousa SF. Modelling enzymatic mechanisms with QM/MM approaches: current status and future challenges. Isr J Chem. 2020;60(7):655–66.
Giese TJ, Zeng J, Ekesan S, York DM. Combined QM/MM, machine learning path integral approach to compute free energy profiles and kinetic isotope effects in RNA cleavage reactions. J Chem Theory Comput. 2022;18(7):4304–17.
Vennelakanti V, Nazemi A, Mehmood R, Steeves AH, Kulik HJ. Harder, better, faster, stronger: large-scale QM and QM/MM for predictive modeling in enzymes and proteins. Curr Opin Struct Biol. 2022;72:9–17.
Kar RK. Benefits of hybrid QM/MM over traditional classical mechanics in pharmaceutical systems. Drug Discovery Today. 2023;28(1): 103374.
Kang J, Tateno M. Recent applications of hybrid Ab initio quantum mechanics–molecular mechanics simulations to biological macromolecules. In: Some Applications of Quantum Mechanics. IntechOpen. 2012.
Sousa SF, Ribeiro AJ, Neves RP, Brás NF, Cerqueira NM, Fernandes PA, Ramos MJ. Application of quantum mechanics/molecular mechanics methods in the study of enzymatic reaction mechanisms. Wiley Interdisciplin Rev. 2017;7(2): e1281.
Kumar S, Rao NS, Reddy KP, Padole MC, Deshpande PA. Enzyme–substrate interactions in orotate-mimetic OPRT inhibitor complexes: a QM/MM analysis. Phys Chem Chem Phys. 2023;25(4):3472–84.
Alonso-Cotchico L, Rodrı́guez-Guerra J, Lledos A, Marechal JD. Molecular modeling for artificial metalloenzyme design and optimization. Acc Chem Res. 2020: 53(4); 896–905.
Omer A, Suryanarayanan V, Selvaraj C, Singh SK, Singh P. Explicit drug re-positioning: predicting novel drug–target interactions of the shelved molecules with qm/mm based approaches. Adv Protein Chem Struct Biol. 2015;100:89–112.
Schwinn K, Ferré N, Huix-Rotllant M. Efficient analytic second derivative of electrostatic embedding QM/MM energy: normal mode analysis of plant cryptochrome. J Chem Theory Comput. 2020;16(6):3816–24.
Sauer S. Implementation and Application of QM/MM Hybrid Methods. Universität Würzburg. 2021
Yusef Buey M, Mineva T, Rapacioli M. Coupling density functional based tight binding with class 1 force fields in a hybrid QM/MM scheme. Theor Chem Acc. 2022;141(3):16.
Pérez-Barcia Á, Cárdenas G, Nogueira JJ, Mandado M. Effect of the QM size, basis set, and polarization on QM/MM interaction energy decomposition analysis. J Chem Inf Model. 2023;63(3):882–97.
Rivera M, Dommett M, Crespo-Otero R. ONIOM (QM: QM′) electrostatic embedding schemes for photochemistry in molecular crystals. J Chem Theory Comput. 2019;15(4):2504–16.
Tuttle T. Quantum mechanical/molecular mechanical approaches in drug design. drug design strategies computational techniques and applications. 2012: 1–26.
Kramlinger VM, Dalvie D, Heck CJ, Kalgutkar AS, O’Neill J, Su D, Teitelbaum AM, Totah RA. Future of biotransformation science in the pharmaceutical industry. Drug Metab Disposition. 2022;50(3):258–67.
Wang S, Ballard TE, Christopher LJ, Foti RS, Gu C, Khojasteh SC, Liu J, Ma S, Ma B, Obach RS. The importance of tracking “missing” metabolites: how and why? J Med Chem. 2023;66(23):15586–612.
Hwang D-J, He Y, Ponnusamy S, Thiyagarajan T, Mohler ML, Narayanan R, Miller DD. Metabolism-guided selective androgen receptor antagonists: design, synthesis, and biological evaluation for activity against enzalutamide-resistant prostate cancer. J Med Chem. 2023;66(5):3372–92.
Nassar AF. Chemical structural alert and reactive metabolite concept as applied in medicinal chemistry to minimize the toxicity of drug candidates. Drug Metabol Handbook. 2022;1:345–72.
Agamah FE, Mazandu GK, Hassan R, Bope CD, Thomford NE, Ghansah A, Chimusa ER. Computational/in silico methods in drug target and lead prediction. Brief Bioinform. 2020;21(5):1663–75.
Yadav J, El Hassani M, Sodhi J, Lauschke VM, Hartman JH, Russell LE. Recent developments in in vitro and in vivo models for improved translation of preclinical pharmacokinetics and pharmacodynamics data. Drug Metab Rev. 2021;53(2):207–33.
Tran TTV, Tayara H, Chong KT. Artificial intelligence in drug metabolism and excretion prediction: recent advances, challenges, and future perspectives. Pharmaceutics. 2023;15(4):1260.
de Bruyn Kops C, Šícho M, Mazzolari A, Kirchmair J. GLORYx: prediction of the metabolites resulting from phase 1 and phase 2 biotransformations of xenobiotics. Chem Res Toxicol. 2020;34(2):286–99.
Hassenberg C, Clausen F, Hoffmann G, Studer A, Schürenkamp J. Investigation of phase II metabolism of 11-hydroxy-Δ-9-tetrahydrocannabinol and metabolite verification by chemical synthesis of 11-hydroxy-Δ-9-tetrahydrocannabinol-glucuronide. Int J Legal Med. 2020;134:2105–19.
Guo J, Zhu X, Badawy S, Ihsan A, Liu Z, Xie C, Wang X. Metabolism and mechanism of human cytochrome P450 enzyme 1A2. Curr Drug Metab. 2021;22(1):40–9.
Eddershaw P, Dickins M. Phase I metabolism. In: A handbook of bioanalysis and drug metabolism. CRC Press. 2021: pp. 208–221
Farrukh M, Shahzadi S, Irfan M. Drug metabolism: phase I and phase II metabolic pathways. In: drug metabolism and pharmacokinetics. IntechOpen. 2024. p. 382–437.
Valodara AM, Kaid Sr J. Sexual dimorphism in drug metabolism and pharmacokinetics. Curr Drug Metab. 2019;20(14):1154–66.
Jamwal R, Barlock BJ. Nonalcoholic fatty liver disease (NAFLD) and hepatic cytochrome P450 (CYP) enzymes. Pharmaceuticals. 2020;13(9):222.
Zhao M, Ma J, Li M, Zhang Y, Jiang B, Zhao X, Huai C, Shen L, Zhang N, He L. Cytochrome P450 enzymes and drug metabolism in humans. Int J Mol Sci. 2021;22(23):12808.
Malki MA, Pearson ER. Drug–drug–gene interactions and adverse drug reactions. Pharmacogenomics J. 2020;20(3):355–66.
Bettonte S, Berton M, Marzolini C. Magnitude of drug-drug interactions in special populations. Pharmaceutics. 2022;14(4):789.
Krishna MV, Padmalatha K, Madhavi G. In vitro metabolic stability of drugs and applications of LC-MS in metabolite profiling. Drug Metab. 2021: 77.
Djoumbou-Feunang Y, Fiamoncini J, Gil-de-la-Fuente A, Greiner R, Manach C, Wishart DS. BioTransformer: a comprehensive computational tool for small molecule metabolism prediction and metabolite identification. J Cheminform. 2019;11:1–25.
Tian S, Cao X, Greiner R, Li C, Guo A, Wishart DS. CyProduct: a software tool for accurately predicting the byproducts of human cytochrome P450 metabolism. J Chem Inf Model. 2021;61(6):3128–40.
Hwang S, Shin HK, Shin SE, Seo M, Jeon H-N, Yim D-E, Kim D-H, No KT. PreMetabo: an in silico phase I and II drug metabolism prediction platform. Drug Metab Pharmacokinet. 2020;35(4):361–7.
Šícho M, Stork C, Mazzolari A, de Bruyn Kops C, Pedretti A, Testa B, Vistoli G, Svozil D, Kirchmair J. FAME 3: predicting the sites of metabolism in synthetic compounds and natural products for phase 1 and phase 2 metabolic enzymes. J Chem Inf Model. 2019;59(8):3400–12.
Hsiao Y, Su B-H, Tseng YJ. Current development of integrated web servers for preclinical safety and pharmacokinetics assessments in drug development. Brief Bioinform. 2021;22(3):bbaa160.
Venkatraman V. FP-ADMET: a compendium of fingerprint-based ADMET prediction models. J Cheminform. 2021;13:1–12.
Litsa EE, Das P, Kavraki LE. Machine learning models in the prediction of drug metabolism: challenges and future perspectives. Expert Opin Drug Metab Toxicol. 2021;17(11):1245–7.
Jiménez-Luna J, Grisoni F, Schneider G. Drug discovery with explainable artificial intelligence. Nat Mach Intellig. 2020;2(10):573–84.
Schwaller P, Hoover B, Reymond J-L, Strobelt H, Laino T. Extraction of organic chemistry grammar from unsupervised learning of chemical reactions. Sci Adv. 2021;7(15):eabe4166.
Banerjee D. Enduring cancer: life, death, and diagnosis in Delhi. Duke University Press. 2020.
Sriharikrishnaa S, Suresh PS, Prasada K S. An introduction to fundamentals of cancer biology. In: optical polarimetric modalities for biomedical research. Springer. 2023: 307–330
Anand U, Dey A, Chandel AKS, Sanyal R, Mishra A, Pandey DK, De Falco V, Upadhyay A, Kandimalla R, Chaudhary A. Cancer chemotherapy and beyond: current status, drug candidates, associated risks and progress in targeted therapeutics. Genes & Diseases. 2023;10(4):1367–401.
Nussinov R, Tsai C-J, Jang H. Anticancer drug resistance: an update and perspective. Drug Resist Updates. 2021;59: 100796.
Ma C, Peng Y, Li H, Chen W. Organ-on-a-chip: a new paradigm for drug development. Trends Pharmacol Sci. 2021;42(2):119–33.
Zhu R, Vora B, Menon S, Younis I, Dwivedi G, Meng Z, Datta-Mannan A, Manchandani P, Nayak S, Tammara BK. Clinical Pharmacology applications of real-world data and real-world evidence in drug development and approval–an industry perspective. Clin Pharmacol Ther. 2023;114(4):751–67.
Aggarwal D, Yang J, Salam MA, Sengupta S, Al-Amin MY, Mustafa S, Khan MA, Huang X, Pawar JS. Antibody-drug conjugates: the paradigm shifts in the targeted cancer therapy. Front Immunol. 2023;14:1203073.
Sharma M, Bakshi AK, Mittapelly N, Gautam S, Marwaha D, Rai N, Singh N, Tiwari P, Agarwal N, Kumar A. Recent updates on innovative approaches to overcome drug resistance for better outcomes in cancer. J Control Release. 2022;346:43–70.
Roy S, Kumar A, Islam MS, Rabbi FA, Paul P, Mia MM, Islam A, Ray AK. Drug resistance and its future perspectives in cancer treatment. Asian Oncol Res J. 2020;3:26–46.
Knezevic CE, Clarke W. Cancer chemotherapy: the case for therapeutic drug monitoring. Ther Drug Monit. 2020;42(1):6–19.
Courtin A, Richards FM, Bapiro TE, Bramhall JL, Neesse A, Cook N, Krippendorff B-F, Tuveson DA, Jodrell DI. Anti-tumour efficacy of capecitabine in a genetically engineered mouse model of pancreatic cancer. PLoS ONE. 2013;8(6): e67330.
Zhang Y, Cui H, Zhang R, Zhang H, Huang W. Nanoparticulation of prodrug into medicines for cancer therapy. Adv Sci. 2021;8(18):2101454.
Alqahtani S, Alzaidi R, Alsultan A, Asiri A, Asiri Y, Alsaleh K. Clinical pharmacokinetics of capecitabine and its metabolites in colorectal cancer patients. Saudi Pharmaceut J. 2022;30(5):527–31.
Ioele G, Chieffallo M, Occhiuzzi MA, De Luca M, Garofalo A, Ragno G, Grande F. Anticancer drugs: recent strategies to improve stability profile, pharmacokinetic and pharmacodynamic properties. Molecules. 2022;27(17):5436.
Crombag M-RB, Joerger M, Thürlimann B, Schellens JH, Beijnen JH, Huitema AD. Pharmacokinetics of selected anticancer drugs in elderly cancer patients: focus on breast cancer. Cancers. 2016;8(1):6.
Cardoso E, Csajka C, Schneider MP, Widmer N. Effect of adherence on pharmacokinetic/pharmacodynamic relationships of oral targeted anticancer drugs. Clin Pharmacokinet. 2018;57(1):1–6.
Yu Y, Rüppel D, Weber W, Derendorf H. PK/PD approaches. Drug Discover Evaluat Methods Clin Pharmacol. 2020;57:1047–69.
Rodríguez-Gascón A, Solinís MÁ, Isla A. The role of PK/PD analysis in the development and evaluation of antimicrobials. Pharmaceutics. 2021;13(6):833.
Tandon H, Chakraborty T, Suhag V. A brief review on importance of DFT in drug design. Res Med Eng Stud. 2019;39:46.
Noureddine O, Gatfaoui S, Brandan SA, Sagaama A, Marouani H, Issaoui N. Experimental and DFT studies on the molecular structure, spectroscopic properties, and molecular docking of 4-phenylpiperazine-1-ium dihydrogen phosphate. J Mol Struct. 2020;1207: 127762.
Bakheit AH, Abuelizz HA, Al-Salahi R. A DFT study and Hirshfeld surface analysis of the molecular structures, radical scavenging abilities and ADMET properties of 2-Methylthio (methylsulfonyl)-[1, 2, 4] triazolo [1, 5-a] quinazolines: guidance for antioxidant drug design. Crystals. 2023;13(7):1086.
Mollaamin F, Monajjemi M. Application of DFT/TD-DFT frameworks in the drug delivery mechanism: investigation of chelated bisphosphonate with transition metal cations in bone treatment. Chemistry. 2023;5(1):365–80.
Bursch M, Hansen A, Pracht P, Kohn JT, Grimme S. Theoretical study on conformational energies of transition metal complexes. Phys Chem Chem Phys. 2021;23(1):287–99.
Chandrasekaran B, Al-Joubi H, Samarneh S, Kassab G, Deb PK, Kumar P, Al-Jaidi BA, Al-Thaher Y, Bataineh YA. Drug-Receptor Interactions. Front Pharmacol Neurotransmitt. 2020;721:31–68.
Jayashankar J, Hema M, Mahmoudi G, Masoudiasl A, Dušek M, Montazerozohori M, Karthik C, Lokanath N. N, N’-bis (2-bromobenzylidene)-2, 2’-diaminodiphenyldisulfide (BBDD): insights of crystal structure, DFT, QTAIM, PASS, ADMET and molecular docking studies. J Mol Struct. 2022;1268: 133657.
Srivastava R. Theoretical studies on the molecular properties, toxicity, and biological efficacy of 21 new chemical entities. ACS Omega. 2021;6(38):24891–901.
Huang Y, Ouyang D, Ji Y. The role of hydrogen-bond in solubilizing drugs by ionic liquids: a molecular dynamics and density functional theory study. AIChE J. 2022;68(6): e17672.
Huo C-M, Chen L, Wang H-Y, Luo S-M, Wang X, Shi Y-F, Zhu J-Y, Xue W. Density functional theory-guided drug loading strategy for sensitized tumor-homing thermotherapy. Chem Eng J. 2021;423: 130146.
Vermeeren P, van der Lubbe SC, Fonseca Guerra C, Bickelhaupt FM, Hamlin TA. Understanding chemical reactivity using the activation strain model. Nat Protoc. 2020;15(2):649–67.
Rajee AO, Obaleye JA, Louis H, Aliyu AA, Lawal A, Chima CM, Ekereke EE, Manicum A-LE. Structural elucidation, DFT study, molecular docking, and biological studies of ruthenium polypyridyl mercaptopurine complexes. J Iran Chem Soc. 2023;20(9):2383–97.
Akkoc S, Karatas H, Muhammed MT, Kökbudak Z, Ceylan A, Almalki F, Laaroussi H, Ben Hadda T. Drug design of new therapeutic agents: molecular docking, molecular dynamics simulation, DFT and POM analyses of new Schiff base ligands and impact of substituents on bioactivity of their potential antifungal pharmacophore site. J Biomol Struct Dyn. 2023;41(14):6695–708.
Islam M, Khan IM, Shakya S, Alam N. Design, synthesis, characterizing and DFT calculations of a binary CT complex co-crystal of bioactive moieties in different polar solvents to investigate its pharmacological activity. J Biomol Struct Dyn. 2023;41(20):10813–29.
Brogi S, Ramalho TC, Kuca K, Medina-Franco JL, Valko M. In silico methods for drug design and discovery. Frontiers Media SA. 2020;8:612.
Shaker B, Ahmad S, Lee J, Jung C, Na D. In silico methods and tools for drug discovery. Comput Biol Med. 2021;137: 104851.
Gramatica P. Principles of QSAR modeling: comments and suggestions from personal experience. Int J Quantitat Struct Property Relationsh. 2020;5(3):61–97.
Raunio H, Kuusisto M, Juvonen RO, Pentikäinen OT. Modeling of interactions between xenobiotics and cytochrome P450 (CYP) enzymes. Front Pharmacol. 2015;6:123.
Hasan AH, Murugesan S, Amran SI, Chander S, Alanazi MM, Hadda TB, Shakya S, Pratama MRF, Das B, Biswas S. Novel thiophene Chalcones-Coumarin as acetylcholinesterase inhibitors: Design, synthesis, biological evaluation, molecular docking, ADMET prediction and molecular dynamics simulation. Bioorg Chem. 2022;119: 105572.
Sheridan RP, Feuston BP, Maiorov VN, Kearsley SK. Similarity to molecules in the training set is a good discriminator for prediction accuracy in QSAR. J Chem Inf Comput Sci. 2004;44(6):1912–28.
Cheng F, Li W, Liu G, Tang Y. In silico ADMET prediction: recent advances, current challenges and future trends. Curr Top Med Chem. 2013;13(11):1273–89.
Gola J, Obrezanova O, Champness E, Segall M. ADMET property prediction: the state of the art and current challenges. QSAR Comb Sci. 2006;25(12):1172–80.
Gräfenstein J, Cremer D. The self-interaction error and the description of non-dynamic electron correlation in density functional theory. Theor Chem Acc. 2009;123:171–82.
Kent PR, Annaberdiyev A, Benali A, Bennett MC, Landinez Borda EJ, Doak P, Hao H, Jordan KD, Krogel JT, Kylänpää I. QMCPACK: Advances in the development, efficiency, and application of auxiliary field and real-space variational and diffusion quantum Monte Carlo. J Chem Phys. 2020;152(17): 174105.
Malone FD, Benali A, Morales MA, Caffarel M, Kent PR, Shulenburger L. Systematic comparison and cross-validation of fixed-node diffusion Monte Carlo and phaseless auxiliary-field quantum Monte Carlo in solids. PhRvB. 2020;102(16): 161104.
Chandershekar A, Bhaskar A, Mekkanti MR, Rinku M. A review on computer aided drug design (CAAD) and it’s implications in drug discovery and development process. Int J Health Care Bio Sci. 2020;8(1):27–33. https://doi.org/10.20959/wjpps20177-9450.
Rajkishan T, Rachana A, Shruti S, Bhumi P, Patel D. Computer-aided drug designing. Adv Bioinformat. 2021;1168:151–82.
Khandelwal A, Lukacova V, Comez D, Kroll DM, Raha S, Balaz S. A combination of docking, QM/MM methods, and MD simulation for binding affinity estimation of metalloprotein ligands. J Med Chem. 2005;48(17):5437–47.
Ahmadi S, Barrios Herrera L, Chehelamirani M, Hostaš J, Jalife S, Salahub DR. Multiscale modeling of enzymes: QM-cluster, QM/MM, and QM/MM/MD: a tutorial review. Int J Quantum Chem. 2018;118(9): e25558.
Jiménez-Luna J, Grisoni F, Weskamp N, Schneider G. Artificial intelligence in drug discovery: recent advances and future perspectives. Expert Opin Drug Discov. 2021;16(9):949–59.
Schneider P, Walters WP, Plowright AT, Sieroka N, Listgarten J, Goodnow RA Jr, Fisher J, Jansen JM, Duca JS, Rush TS. Rethinking drug design in the artificial intelligence era. Nat Rev Drug Discovery. 2020;19(5):353–64.
Patel L, Shukla T, Huang X, Ussery DW, Wang S. Machine learning methods in drug discovery. Molecules. 2020;25(22):5277.
Panwar U, Chandra I, Selvaraj C, Singh SK. Current computational approaches for the development of anti-HIV inhibitors: an overview. Curr Pharm Des. 2019;25(31):3390–405.
Batool A, Bibi N, Amin F, Kamal MA. Drug designing against NSP15 of SARS-COV2 via high throughput computational screening and structural dynamics approach. Eur J Pharmacol. 2021;892: 173779.
Chuntakaruk H, Hengphasatporn K, Shigeta Y, Aonbangkhen C, Lee VS, Khotavivattana T, Rungrotmongkol T, Hannongbua S. FMO-guided design of darunavir analogs as HIV-1 protease inhibitors. Sci Rep. 2024;14(1):3639.
Patel V, Shah M. Artificial intelligence and machine learning in drug discovery and development. Intellig Med. 2022;2(3):134–40.
Vijayan R, Kihlberg J, Cross JB, Poongavanam V. Enhancing preclinical drug discovery with artificial intelligence. Drug Discovery Today. 2022;27(4):967–84.
Gupta R, Srivastava D, Sahu M, Tiwari S, Ambasta RK, Kumar P. Artificial intelligence to deep learning: machine intelligence approach for drug discovery. Mol Divers. 2021;25:1315–60.
Belousova OA, Groen AJ, Ouendag AM. Opportunities and barriers for innovation and entrepreneurship in orphan drug development. Technol Forecast Soc Change. 2020;161: 120333.
Starke G, Ienca M. Misplaced trust and distrust: how not to engage with medical artificial intelligence. Camb Q Healthcare Ethics. 2022. https://doi.org/10.1017/S0963180122000445.
Acknowledgements
The Directorate of Research and Extension of Egerton University is appreciated for supporting this study.
Funding
This study received no specific grants from any funding agency.
Author information
Authors and Affiliations
Contributions
RBOO: Method development, writing & editing, JKK: Conceptualization, Editing & Supervision, SMN: Visualization, Editing, & supervision. All authors have read and approved the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
This article has the consent of all the authors.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Ouma, R.B.O., Ngari, S.M. & Kibet, J.K. A review of the current trends in computational approaches in drug design and metabolism. Discov Public Health 21, 108 (2024). https://doi.org/10.1186/s12982-024-00229-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12982-024-00229-3