Events

Events Calendar

ChE Seminar Series - Accelerating Protein Engineering with Artificial Intelligence

Tuesday, January 20, 2026
3:30 pm - 4:30 pm

Location: GLT 5.104

Proteins are central to health, sustainability, and chemical synthesis, yet engineering them for specific functions is challenging due to the vastness of the sequence design space. Directed evolution, inspired by natural evolution, has enabled remarkable advances but is slow and limited to local optimization. In this talk, I present my research on integrating machine learning with experimental workflows to overcome key bottlenecks of directed evolution. First, I introduce Active Learning-Assisted Directed Evolution (ALDE), which leverages Bayesian optimization to enable efficient and rapid protein fitness optimization. Next, I discuss Contrastive Reaction-Enzyme Pretraining (CREEP), a retrieval model for discovering enzymes with new-to-nature functions. Finally, I present emerging generative approaches to unify these two perspectives, to enable holistic data-driven protein engineering. Together, these innovations point toward a future where AI-driven strategies can automate biomolecular engineering: unlocking sustainable synthesis, novel therapeutics, and programmable biology at the molecular scale.

 

Jason Yang is a final-year Chemical Engineering PhD candidate at Caltech, co-advised by Professor Frances Arnold and Professor Yisong Yue, where he is supported by the NSF graduate research fellowship program and the Google PhD Fellowship. He received dual BS degrees in Chemical Engineering and Applied Math at Yale University and was awarded the Barry Goldwater Scholarship. Overall, Jason is interested in developing machine learning-assisted workflows for protein discovery, design, and optimization. Additionally, he has applied machine learning, computational modeling, and experimentation to real-world applications through internships at Profluent Bio, Genentech, and NREL.

Speaker: Jason Yang, Caltech