Name: Poster 080: Classifying Enzyme Substrates Using Machine Learning
Start: 2025-04-24T14:00:00-0500
End: 2025-04-24T16:00:00-0500

Thursday April 24, 2025 2:00pm - 4:00pm CDT

Davies Center: Ojibwe Ballroom (330)

Knowing what types of enzymes a molecule will interact with can aid drug development by minimizing side effects due to unwanted interactions. In this project, we built and interpreted models for classifying enzyme substrates. We utilized the machine learning technique XGBoost in Python to build a predictive model for each enzyme class using the original molecular data as well as top linear combinations of the data obtained using Principal Components Analysis. We will discuss the process of developing code to automatically tune the parameters of XGBoost to optimize the model. We will also present examples of how to interpret these models by writing code to visualize the impact of variables in each model and identifying common factors in the top contributing variables of significant principal components to characterize each enzyme class. For example, we found that the probability of a molecule interacting with oxidoreductase enzymes is positively associated with the number of nonpolar regions. A particular descriptor is NOCount, the number of (polar) NO groups in the molecule, which was negatively associated with the probability of interacting with oxidoreductases.

Presenters

Kyle He

University of Wisconsin - Eau Claire

Faculty Mentor

Abra Brisbin

Mathematics, University of Wisconsin - Eau Claire

Thursday April 24, 2025 2:00pm - 4:00pm CDT
Davies Center: Ojibwe Ballroom (330) 77 Roosevelt Ave, Eau Claire, WI 54701, USA

CERCA Posters, 2 Thursday

Department Mathematics
College CAS

UWEC CERCA 2025

Kyle He

Abra Brisbin

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!