Bridging <em>in-vitro</em>, <em>in-silico</em> and corporate realms for pharmaceutical drug discovery

Gadiya, Yojana

dc.contributor.advisor	Hofmann-Apitius, Martin
dc.contributor.author	Gadiya, Yojana
dc.date.accessioned	2025-11-14T09:04:42Z
dc.date.available	2025-11-14T09:04:42Z
dc.date.issued	14.11.2025
dc.identifier.uri	https://hdl.handle.net/20.500.11811/13685
dc.description.abstract	The onset of the COVID-19 pandemic in 2020 highlighted the urgent need for efficient navigation of big data. In response, numerous workflows and algorithms for data processing, aggregation, and analysis were developed and widely shared within the scientific community to address these emerging health challenges. However, these workflows have limitations, which can compromise their effectiveness and reliability. First, many workflows suffer from insufficient documentation and poor version control, making them difficult to reproduce, validate, and adapt for broader use. Second, they are often tailored to specific communities or domains, limiting their robustness and generalizability across diverse applications. Third, these workflows prioritize decisions on scientific data while overlooking critical aspects such as market applicability. Fourth, integration challenges stem from an overreliance on single-modality data analysis, neglecting the incorporation of heterogeneous data types. This limitation undermines the ability to make comprehensive go/no-go decisions based on a more holistic understanding of the problem. Finally, most existing workflows remain predominantly in silico, with minimal or no in vitro validation. This lack of experimental translation raises concerns about their applicability in real-world scenarios. In our work, we developed reproducible and well-documented pipelines to integrate and consolidate knowledge across various domains to enhance pandemic preparedness. These pipelines contextualize knowledge through graphs constructed from data extracted via manual curation of scientific publications and experimental results. Designed with flexibility in mind, the pipelines are agnostic, allowing their application across multiple domains, including diverse therapeutic indication areas. Next, to expand the scope of the underlying data beyond research, we mined patent literature, a valuable resource capturing the marketing and commercial landscape of drug discovery. Using a tool we developed called PEMT, we identified patterns in compound and target-agnostic strategies employed in the commercial sector. Moreover, to address integration challenges between knowledge graphs and omics-based technologies, we merged knowledge graphs with transcriptomics data. This enabled us to decipher the mechanisms of action of key drug and drug-like candidates. Furthermore, we demonstrated the successful translation of in silico work into biological experiments. Specifically, we built a machine learning model to predict the antibacterial activity of compounds and validated it by testing an external library for antibacterial activity in both in silico and in vitro. Our work highlights the importance of combining diverse data modalities with biological networks to gain profound insights into the mechanisms driving drug discovery. These workflows and approaches lay a strong foundation for identifying and prioritizing optimal drug candidates, facilitating their transition from preclinical studies to clinical trials and, ultimately, to the market.	en
dc.language.iso	eng
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	Knowledge graphs
dc.subject	Patent landscaping
dc.subject	Antimicrobial ML models
dc.subject	Transcriptomics data
dc.subject.ddc	004 Informatik
dc.title	Bridging in-vitro, in-silico and corporate realms for pharmaceutical drug discovery
dc.type	Dissertation oder Habilitation
dc.identifier.doi	https://doi.org/10.48565/bonndoc-708
dc.publisher.name	Universitäts- und Landesbibliothek Bonn
dc.publisher.location	Bonn
dc.rights.accessRights	openAccess
dc.identifier.urn	https://nbn-resolving.org/urn:nbn:de:hbz:5-84972
dc.relation.doi	https://doi.org/10.1093/bioinformatics/btaa834
dc.relation.doi	https://doi.org/10.1093/bioadv/vbad045
dc.relation.doi	https://doi.org/10.1038/s41597-024-03371-4
dc.relation.doi	https://doi.org/10.1093/bioinformatics/btac716
dc.relation.doi	https://doi.org/10.1016/j.ailsci.2023.100069
dc.relation.doi	https://doi.org/10.1371/journal.pcbi.1009909
dc.relation.doi	https://doi.org/10.1021/acs.jcim.4c02347
ulbbn.pubtype	Erstveröffentlichung
ulbbnediss.affiliation.name	Rheinische Friedrich-Wilhelms-Universität Bonn
ulbbnediss.affiliation.location	Bonn
ulbbnediss.thesis.level	Dissertation
ulbbnediss.dissID	8497
ulbbnediss.date.accepted	09.09.2025
ulbbnediss.institute	Zentrale wissenschaftliche Einrichtungen : Bonn-Aachen International Center for Information Technology (b-it)
ulbbnediss.fakultaet	Mathematisch-Naturwissenschaftliche Fakultät
dc.contributor.coReferee	Imhof, Diana
ulbbnediss.contributor.orcid	https://orcid.org/0000-0002-7683-0452

Files in this item

Name:: 8497.pdf
Size:: 17.6MB
Format:: PDF

View/Open

This item appears in the following Collection(s)

E-Dissertationen (4470)

Show simple item record

The following license files are associated with this item:

Bridging in-vitro, in-silico and corporate realms for pharmaceutical drug discovery

Files in this item

This item appears in the following Collection(s)