Zur Kurzanzeige

Bridging in-vitro, in-silico and corporate realms for pharmaceutical drug discovery

dc.contributor.advisorHofmann-Apitius, Martin
dc.contributor.authorGadiya, Yojana
dc.date.accessioned2025-11-14T09:04:42Z
dc.date.available2025-11-14T09:04:42Z
dc.date.issued14.11.2025
dc.identifier.urihttps://hdl.handle.net/20.500.11811/13685
dc.description.abstractThe onset of the COVID-19 pandemic in 2020 highlighted the urgent need for efficient navigation of big data. In response, numerous workflows and algorithms for data processing, aggregation, and analysis were developed and widely shared within the scientific community to address these emerging health challenges. However, these workflows have limitations, which can compromise their effectiveness and reliability. First, many workflows suffer from insufficient documentation and poor version control, making them difficult to reproduce, validate, and adapt for broader use. Second, they are often tailored to specific communities or domains, limiting their robustness and generalizability across diverse applications. Third, these workflows prioritize decisions on scientific data while overlooking critical aspects such as market applicability. Fourth, integration challenges stem from an overreliance on single-modality data analysis, neglecting the incorporation of heterogeneous data types. This limitation undermines the ability to make comprehensive go/no-go decisions based on a more holistic understanding of the problem. Finally, most existing workflows remain predominantly in silico, with minimal or no in vitro validation. This lack of experimental translation raises concerns about their applicability in real-world scenarios. In our work, we developed reproducible and well-documented pipelines to integrate and consolidate knowledge across various domains to enhance pandemic preparedness. These pipelines contextualize knowledge through graphs constructed from data extracted via manual curation of scientific publications and experimental results. Designed with flexibility in mind, the pipelines are agnostic, allowing their application across multiple domains, including diverse therapeutic indication areas. Next, to expand the scope of the underlying data beyond research, we mined patent literature, a valuable resource capturing the marketing and commercial landscape of drug discovery. Using a tool we developed called PEMT, we identified patterns in compound and target-agnostic strategies employed in the commercial sector. Moreover, to address integration challenges between knowledge graphs and omics-based technologies, we merged knowledge graphs with transcriptomics data. This enabled us to decipher the mechanisms of action of key drug and drug-like candidates. Furthermore, we demonstrated the successful translation of in silico work into biological experiments. Specifically, we built a machine learning model to predict the antibacterial activity of compounds and validated it by testing an external library for antibacterial activity in both in silico and in vitro. Our work highlights the importance of combining diverse data modalities with biological networks to gain profound insights into the mechanisms driving drug discovery. These workflows and approaches lay a strong foundation for identifying and prioritizing optimal drug candidates, facilitating their transition from preclinical studies to clinical trials and, ultimately, to the market.en
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectKnowledge graphs
dc.subjectPatent landscaping
dc.subjectAntimicrobial ML models
dc.subjectTranscriptomics data
dc.subject.ddc004 Informatik
dc.titleBridging in-vitro, in-silico and corporate realms for pharmaceutical drug discovery
dc.typeDissertation oder Habilitation
dc.identifier.doihttps://doi.org/10.48565/bonndoc-708
dc.publisher.nameUniversitäts- und Landesbibliothek Bonn
dc.publisher.locationBonn
dc.rights.accessRightsopenAccess
dc.identifier.urnhttps://nbn-resolving.org/urn:nbn:de:hbz:5-84972
dc.relation.doihttps://doi.org/10.1093/bioinformatics/btaa834
dc.relation.doihttps://doi.org/10.1093/bioadv/vbad045
dc.relation.doihttps://doi.org/10.1038/s41597-024-03371-4
dc.relation.doihttps://doi.org/10.1093/bioinformatics/btac716
dc.relation.doihttps://doi.org/10.1016/j.ailsci.2023.100069
dc.relation.doihttps://doi.org/10.1371/journal.pcbi.1009909
dc.relation.doihttps://doi.org/10.1021/acs.jcim.4c02347
ulbbn.pubtypeErstveröffentlichung
ulbbnediss.affiliation.nameRheinische Friedrich-Wilhelms-Universität Bonn
ulbbnediss.affiliation.locationBonn
ulbbnediss.thesis.levelDissertation
ulbbnediss.dissID8497
ulbbnediss.date.accepted09.09.2025
ulbbnediss.instituteZentrale wissenschaftliche Einrichtungen : Bonn-Aachen International Center for Information Technology (b-it)
ulbbnediss.fakultaetMathematisch-Naturwissenschaftliche Fakultät
dc.contributor.coRefereeImhof, Diana
ulbbnediss.contributor.orcidhttps://orcid.org/0000-0002-7683-0452


Dateien zu dieser Ressource

Thumbnail

Das Dokument erscheint in:

Zur Kurzanzeige

Die folgenden Nutzungsbestimmungen sind mit dieser Ressource verbunden:

Attribution-NonCommercial-NoDerivatives 4.0 International