Intuitive Explainable Artificial Intelligence for Molecular Design

Lamens, Alec

dc.contributor.advisor	Bajorath, Jürgen
dc.contributor.author	Lamens, Alec
dc.date.accessioned	2026-06-09T07:57:26Z
dc.date.available	2026-06-09T07:57:26Z
dc.date.issued	09.06.2026
dc.identifier.uri	https://hdl.handle.net/20.500.11811/14192
dc.description.abstract	With the onset of artificial intelligence (AI), machine learning (ML) approaches have become increasingly embedded into the drug discovery pipeline with the aim of accelerating research and reducing cost. Exemplary applications include quantitative structure-property relationship modeling and de novo design of candidate molecules. However, the practical impact of these ML models has been limited by the opaque nature of their predictions, termed the black-box problem. The proliferation of large-scale public chemical datasets combined with advances in high-performance computing has driven a shift from traditional ML approaches toward more complex deep learning (DL) methods. While powerful, these models further exacerbate the inability to rationalize model outcomes. Therefore, the field of explainable AI (XAI) has attracted substantial interest to improve the transparency of model decisions. Given the wide range of available XAI approaches and lack of established practices, these methods are often applied without careful consideration for their suitability to the task at hand. This represents a problem in drug discovery where interpretable explanations are critical to guide experimental follow-up. To address this issue, this dissertation proposes the adaptation of established XAI approaches to operate on the level of chemical structure and facilitate interpretable ML for molecular design. The first study demonstrates the use of Shapley additive explanations (SHAP) to analyze feature sets driving the correct prediction of compound promiscuity. This is followed by a benchmarking study comparing several methods for the calculation of feature importance values, revealing inconsistencies across approaches that complicate the interpretation of feature-importance explanations. To support chemically meaningful interpretability, SHAP values are then combined with counterfactual reasoning, yielding the SHAP-CF methodology. Thereafter, a novel counterfactual method is introduced that generates candidate counterfactuals through the iterative recombination of molecular cores and substituents. This method is evaluated on a challenging multi-class kinase inhibitor prediction task, demonstrating its capacity to generate large numbers of counterfactuals, enabling in-depth analysis. Next, a closely related XAI concept, contrastive explanations, is adapted for molecular design by introducing rational chemical modifications to test compounds through the exchange of scaffolds or substituents with structural analogues, termed MolCE. This approach is shown to provide chemically relevant contrastive explanations that enable direct causal insights into the structural factors that drive model predictions. Finally, MolAnchor is introduced, a domain-adapted variant of the rule-based Anchors method. By conforming the principles underlying the Anchors framework to operate on retrosynthetically meaningful molecular fragments, MolAnchor produces interpretable fragment-level decision rules. A follow-up study showcases how the if-then formulation of the fragment rules is particularly complementary to causal reasoning. Taken together, the studies presented hereindemonstrate the utility of XAI methods that have been augmented with domain-specific information to provide chemically intuitive explanations for predictive models applied to molecular design.	en
dc.language.iso	eng
dc.rights	In Copyright
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/
dc.subject	XAI
dc.subject	Machine learning
dc.subject	Molecular Design
dc.subject	CADD
dc.subject	Interpretability
dc.subject.ddc	004 Informatik
dc.subject.ddc	540 Chemie
dc.title	Intuitive Explainable Artificial Intelligence for Molecular Design
dc.type	Dissertation oder Habilitation
dc.publisher.name	Universitäts- und Landesbibliothek Bonn
dc.publisher.location	Bonn
dc.rights.accessRights	openAccess
dc.identifier.urn	https://nbn-resolving.org/urn:nbn:de:hbz:5-90419
dc.relation.doi	https://doi.org/10.3390/molecules28020825
dc.relation.doi	https://doi.org/10.1002/minf.202500067
dc.relation.doi	https://doi.org/10.3390/molecules28145601
dc.relation.doi	https://doi.org/10.1002/cmdc.202300586
dc.relation.doi	https://doi.org/10.1039/D4MD00128A
dc.relation.doi	https://doi.org/10.1186/s13321-025-01100-6
dc.relation.doi	https://doi.org/10.1016/j.ejmcr.2024.100230
dc.relation.doi	https://doi.org/10.1021/acs.jcim.4c02153
dc.relation.doi	https://doi.org/10.1039/d5sc08461j
ulbbn.pubtype	Erstveröffentlichung
ulbbnediss.affiliation.name	Rheinische Friedrich-Wilhelms-Universität Bonn
ulbbnediss.affiliation.location	Bonn
ulbbnediss.thesis.level	Dissertation
ulbbnediss.dissID	9041
ulbbnediss.date.accepted	27.05.2026
ulbbnediss.institute	Zentrale wissenschaftliche Einrichtungen : Bonn-Aachen International Center for Information Technology (b-it)
ulbbnediss.fakultaet	Mathematisch-Naturwissenschaftliche Fakultät
dc.contributor.coReferee	Vogt, Martin
ulbbnediss.contributor.orcid	https://orcid.org/0000-0001-6409-3216
ulbbnediss.contributor.gnd	1402513941

Dateien zu dieser Ressource

Name:: 9041.pdf
Größe:: 34.1MB
Format:: PDF

Dokument öffnen

Das Dokument erscheint in:

E-Dissertationen (4581)

Zur Kurzanzeige

Die folgenden Nutzungsbestimmungen sind mit dieser Ressource verbunden: