Publications Search
Explore how scientists all over the world use DrugBank in their research.
Published on January 4, 2018
READ PUBLICATION →

The IUPHAR/BPS Guide to PHARMACOLOGY in 2018: updates and expansion to encompass the new guide to IMMUNOPHARMACOLOGY.

Authors: Harding SD, Sharman JL, Faccenda E, Southan C, Pawson AJ, Ireland S, Gray AJG, Bruce L, Alexander SPH, Anderton S, Bryant C, Davenport AP, Doerig C, Fabbro D, Levi-Schaffer F, Spedding M, Davies JA

Abstract: The IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb, www.guidetopharmacology.org) and its precursor IUPHAR-DB, have captured expert-curated interactions between targets and ligands from selected papers in pharmacology and drug discovery since 2003. This resource continues to be developed in conjunction with the International Union of Basic and Clinical Pharmacology (IUPHAR) and the British Pharmacological Society (BPS). As previously described, our unique model of content selection and quality control is based on 96 target-class subcommittees comprising 512 scientists collaborating with in-house curators. This update describes content expansion, new features and interoperability improvements introduced in the 10 releases since August 2015. Our relationship matrix now describes approximately 9000 ligands, approximately 15 000 binding constants, approximately 6000 papers and approximately 1700 human proteins. As an important addition, we also introduce our newly funded project for the Guide to IMMUNOPHARMACOLOGY (GtoImmuPdb, www.guidetoimmunopharmacology.org). This has been 'forked' from the well-established GtoPdb data model and expanded into new types of data related to the immune system and inflammatory processes. This includes new ligands, targets, pathways, cell types and diseases for which we are recruiting new IUPHAR expert committees. Designed as an immunopharmacological gateway, it also has an emphasis on potential therapeutic interventions.
Published on January 4, 2018
READ PUBLICATION →

ActiveDriverDB: human disease mutations and genome variation in post-translational modification sites of proteins.

Authors: Krassowski M, Paczkowska M, Cullion K, Huang T, Dzneladze I, Ouellette BFF, Yamada JT, Fradet-Turcotte A, Reimand J

Abstract: Interpretation of genetic variation is needed for deciphering genotype-phenotype associations, mechanisms of inherited disease, and cancer driver mutations. Millions of single nucleotide variants (SNVs) in human genomes are known and thousands are associated with disease. An estimated 21% of disease-associated amino acid substitutions corresponding to missense SNVs are located in protein sites of post-translational modifications (PTMs), chemical modifications of amino acids that extend protein function. ActiveDriverDB is a comprehensive human proteo-genomics database that annotates disease mutations and population variants through the lens of PTMs. We integrated >385,000 published PTM sites with approximately 3.6 million substitutions from The Cancer Genome Atlas (TCGA), the ClinVar database of disease genes, and human genome sequencing projects. The database includes site-specific interaction networks of proteins, upstream enzymes such as kinases, and drugs targeting these enzymes. We also predicted network-rewiring impact of mutations by analyzing gains and losses of kinase-bound sequence motifs. ActiveDriverDB provides detailed visualization, filtering, browsing and searching options for studying PTM-associated mutations. Users can upload mutation datasets interactively and use our application programming interface in pipelines. Integrative analysis of mutations and PTMs may help decipher molecular mechanisms of phenotypes and disease, as exemplified by case studies of TP53, BRCA2 and VHL. The open-source database is available at https://www.ActiveDriverDB.org.
Published on January 4, 2018
READ PUBLICATION →

CR2Cancer: a database for chromatin regulators in human cancer.

Authors: Ru B, Sun J, Tong Y, Wong CN, Chandra A, Tang ATS, Chow LKY, Wun WL, Levitskaya Z, Zhang J

Abstract: Chromatin regulators (CRs) can dynamically modulate chromatin architecture to epigenetically regulate gene expression in response to intrinsic and extrinsic signalling cues. Somatic alterations or misexpression of CRs might reprogram the epigenomic landscape of chromatin, which in turn lead to a wide range of common diseases, notably cancer. Here, we present CR2Cancer, a comprehensive annotation and visualization database for CRs in human cancer constructed by high throughput data analysis and literature mining. We collected and integrated genomic, transcriptomic, proteomic, clinical and functional information for over 400 CRs across multiple cancer types. We also built diverse types of CR-associated relations, including cancer type dependent (CR-target and miRNA-CR) and independent (protein-protein interaction and drug-target) ones. Furthermore, we manually curated around 6000 items of aberrant molecular alterations and interactions of CRs in cancer development from 5007 publications. CR2Cancer provides a user-friendly web interface to conveniently browse, search and download data of interest. We believe that this database would become a valuable resource for cancer epigenetics investigation and potential clinical application. CR2Cancer is freely available at http://cis.hku.hk/CR2Cancer.
Published on January 4, 2018
READ PUBLICATION →

MultitaskProtDB-II: an update of a database of multitasking/moonlighting proteins.

Authors: Franco-Serrano L, Hernandez S, Calvo A, Severi MA, Ferragut G, Perez-Pons J, Pinol J, Pich O, Mozo-Villarias A, Amela I, Querol E, Cedano J

Abstract: Multitasking, or moonlighting, is the capability of some proteins to execute two or more biological functions. MultitaskProtDB-II is a database of multifunctional proteins that has been updated. In the previous version, the information contained was: NCBI and UniProt accession numbers, canonical and additional biological functions, organism, monomeric/oligomeric states, PDB codes and bibliographic references. In the present update, the number of entries has been increased from 288 to 694 moonlighting proteins. MultitaskProtDB-II is continually being curated and updated. The new database also contains the following information: GO descriptors for the canonical and moonlighting functions, three-dimensional structure (for those proteins lacking PDB structure, a model was made using Itasser and Phyre), the involvement of the proteins in human diseases (78% of human moonlighting proteins) and whether the protein is a target of a current drug (48% of human moonlighting proteins). These numbers highlight the importance of these proteins for the analysis and explanation of human diseases and target-directed drug design. Moreover, 25% of the proteins of the database are involved in virulence of pathogenic microorganisms, largely in the mechanism of adhesion to the host. This highlights their importance for the mechanism of microorganism infection and vaccine design. MultitaskProtDB-II is available at http://wallace.uab.es/multitaskII.
Published on January 4, 2018
READ PUBLICATION →

ADReCS-Target: target profiles for aiding drug safety research and application.

Authors: Huang LH, He QS, Liu K, Cheng J, Zhong MD, Chen LS, Yao LX, Ji ZL

Abstract: Delivering safe and effective therapeutic treatment to patients is one of the grand challenges in modern medicine. However, drug safety research has been progressing slowly in recent years, compared to other fields such as biotechnologies and precision medicine, due to the mechanistic complexity of adverse drug reactions (ADRs). To fill up this gap, we develop a new database, the Adverse Drug Reaction Classification System-Target Profile (ADReCS-Target, http://bioinf.xmu.edu.cn/ADReCS-Target), which provides comprehensive information about ADRs caused by drug interaction with protein, gene and genetic variation. In total, ADReCS-Target includes 66,573 pairwise relations, among which 1710 are protein-ADR associations, 2613 are genetic variation-ADR associations, and 63,298 are gene-ADR associations. In a case study of exploring the mechanism of rash, we find that HLAs, C1QA and APOA1 are the key gene players and thus can be potential targets (or biomarkers) in monitoring or countermining rashes. In summary, ADReCS-Target can be a useful resource for the biomedical scientific community by serving researchers in the fields of drug development, clinical pharmacology, precision medicine, and from web lab to high-throughput computational platform. Particularly, it helps to identify drug with better ADR profile and design safer drug therapy regimen.
Published on January 4, 2018
READ PUBLICATION →

TCMID 2.0: a comprehensive resource for TCM.

Authors: Huang L, Xie D, Yu Y, Liu H, Shi Y, Shi T, Wen C

Abstract: As a traditional medical intervention in Asia and a complementary and alternative medicine in western countries, Traditional Chinese Medicine (TCM) is capturing worldwide attention in life science field. Traditional Chinese Medicine Integrated Database (TCMID), which was originally launched in 2013, was a comprehensive database aiming at TCM's modernization and standardization. It has been highly recognized among pharmacologists and scholars in TCM researches. The latest release, TCMID 2.0 (http://www.megabionet.org/tcmid/), replenished the preceding database with 18 203 herbal ingredients, 15 prescriptions, 82 related targets, 1356 drugs, 842 diseases and numerous new connections between them. Considering that chemical changes might take place in decocting process of prescriptions, which may result in new ingredients, new data containing the prescription ingredients was collected in current version. In addition, 778 herbal mass spectrometry (MS) spectra related to 170 herbs were appended to show the variation of herbal quality in different origin and distinguish genuine medicinal materials from common ones while 3895 MS spectra of 729 ingredients were added as the supplementary materials of component identification. With the significant increase of data, TCMID 2.0 will further facilitate TCM's modernization and enhance the exploration of underlying biological processes that are response to the diverse pharmacologic actions of TCM.
Published on January 4, 2018
READ PUBLICATION →

NPASS: natural product activity and species source database for natural product research, discovery and tool development.

Authors: Zeng X, Zhang P, He W, Qin C, Chen S, Tao L, Wang Y, Tan Y, Gao D, Wang B, Chen Z, Chen W, Jiang YY, Chen YZ

Abstract: There has been renewed interests in the exploration of natural products (NPs) for drug discovery, and continuous investigations of the therapeutic claims and mechanisms of traditional and herbal medicines. In-silico methods have been employed for facilitating these studies. These studies and the optimization of in-silico algorithms for NP applications can be facilitated by the quantitative activity and species source data of the NPs. A number of databases collectively provide the structural and other information of approximately 470 000 NPs, including qualitative activity information for many NPs, but only approximately 4000 NPs are with the experimental activity values. There is a need for the activity and species source data of more NPs. We therefore developed a new database, NPASS (Natural Product Activity and Species Source) to complement other databases by providing the experimental activity values and species sources of 35 032 NPs from 25 041 species targeting 5863 targets (2946 proteins, 1352 microbial species and 1227 cell-lines). NPASS contains 446 552 quantitative activity records (e.g. IC50, Ki, EC50, GI50 or MIC mainly in units of nM) of 222 092 NP-target pairs and 288 002 NP-species pairs. NPASS, http://bidd2.nus.edu.sg/NPASS/, is freely accessible with its contents searchable by keywords, physicochemical property range, structural similarity, species and target search facilities.
Published on January 1, 2018
READ PUBLICATION →

Potent pairing: ensemble of long short-term memory networks and support vector machine for chemical-protein relation extraction.

Authors: Mehryary F, Bjorne J, Salakoski T, Ginter F

Abstract: Biomedical researchers regularly discover new interactions between chemical compounds/drugs and genes/proteins, and report them in research literature. Having knowledge about these interactions is crucially important in many research areas such as precision medicine and drug discovery. The BioCreative VI Task 5 (CHEMPROT) challenge promotes the development and evaluation of computer systems that can automatically recognize and extract statements of such interactions from biomedical literature. We participated in this challenge with a Support Vector Machine (SVM) system and a deep learning-based system (ST-ANN), and achieved an F-score of 60.99 for the task. After the shared task, we have significantly improved the performance of the ST-ANN system. Additionally, we have developed a new deep learning-based system (I-ANN) that considerably outperforms the ST-ANN system. Both ST-ANN and I-ANN systems are centered around training an ensemble of artificial neural networks and utilizing different bidirectional Long Short-Term Memory (LSTM) chains for representing the shortest dependency path and/or the full sentence. By combining the predictions of the SVM and the I-ANN systems, we achieved an F-score of 63.10 for the task, improving our previous F-score by 2.11 percentage points. Our systems are fully open-source and publicly available. We highlight that the systems we present in this study are not applicable only to the BioCreative VI Task 5, but can be effortlessly re-trained to extract any types of relations of interest, with no modifications of the source code required, if a manually annotated corpus is provided as training data in a specific file format.
Published on January 1, 2018
READ PUBLICATION →

Drug Target Commons 2.0: a community platform for systematic analysis of drug-target interaction profiles.

Authors: Tanoli Z, Alam Z, Vaha-Koskela M, Ravikumar B, Malyutina A, Jaiswal A, Tang J, Wennerberg K, Aittokallio T

Abstract: Drug Target Commons (DTC) is a web platform (database with user interface) for community-driven bioactivity data integration and standardization for comprehensive mapping, reuse and analysis of compound-target interaction profiles. End users can search, upload, edit, annotate and export expert-curated bioactivity data for further analysis, using an application programmable interface, database dump or tab-delimited text download options. To guide chemical biology and drug-repurposing applications, DTC version 2.0 includes updated clinical development information for the compounds and target gene-disease associations, as well as cancer-type indications for mutant protein targets, which are critical for precision oncology developments.
Published on January 1, 2018
READ PUBLICATION →

ANCO-GeneDB: annotations and comprehensive analysis of candidate genes for alcohol, nicotine, cocaine and opioid dependence.

Authors: Hu R, Dai Y, Jia P, Zhao Z

Abstract: Studies have shown that genetic factors play an important role in the risk to substance addiction and abuse. So far, various genetic and genomic studies have reported the related evidence. These rich, but highly heterogeneous, data provide us an unprecedented opportunity to systematically collect, curate and assess the genetic and genomic signals from published studies and to perform a comprehensive analysis of their features, functional roles and druggability. Such genetic data resources have been made available for other disease or phenotypes but not for major substance dependence yet. Here, we report comprehensive data collection and secondary analyses of four phenotypes of dependence: alcohol dependence, nicotine dependence, cocaine dependence and opioid dependence, collectively named as Alcohol, Nicotine, Cocaine and Opioid (ANCO) dependence. We built the ANCO-GeneDB, an ANCO-dependence-associated gene resource database. ANCO-GeneDB includes resources from genome-wide association studies and candidate gene-based studies, transcriptomic studies, methylation studies, literature mining and drug-target data, as well as the derived data such as spatial-temporal gene expression, promoters, enhancers and expression quantitative trait loci. All associated genes and genetic variants are well annotated by using the collected evidence. Based on the collected data, we performed integrative, secondary analyses to prioritize genes, pathways, eQTLs and tissues that are significantly enriched in ANCO-related phenotypes.