WBN0Mz3VAC@OpenReview

Total: 1

#1 KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors [PDF] [Copy] [Kimi1] [REL]

Authors: Benson Chen, Tomasz Danel, Gabriel Dreiman, Patrick McEnaney, Nikhil Jain, Kirill Novikov, Spurti Akki, Joshua L. Turnbull, Virja Pandya, Boris Belotserkovskii, Jared Weaver, Ankita Biswas, Dat Nguyen, Kent Gorday, Mohammad M Sultan, Nathaniel Stanley, Daniel Whalen, Divya Kanichar, Christoph Klein, Emily Fox, R. Watts

DNA-Encoded Libraries (DELs) represent a transformative technology in drug discovery, facilitating the high-throughput exploration of vast chemical spaces. Despite their potential, the scarcity of publicly available DEL datasets presents a bottleneck for the advancement of machine learning methodologies in this domain. To address this gap, we introduce KinDEL, one of the largest publicly accessible DEL datasets and the first one that includes binding poses from molecular docking experiments. Focused on two kinases, Mitogen-Activated Protein Kinase 14 (MAPK14) and Discoidin Domain Receptor Tyrosine Kinase 1 (DDR1), KinDEL includes 81 million compounds, offering a rich resource for computational exploration. Additionally, we provide comprehensive biophysical assay validation data, encompassing both on-DNA and off-DNA measurements, which we use to evaluate a suite of machine learning techniques, including novel structure-based probabilistic models. We hope that our benchmark, encompassing both 2D and 3D structures, will help advance the development of machine learning models for data-driven hit identification using DELs.

Subject: ICML.2025 - Poster