Deciphering regulatory landscapes using interpretable deep learning models

Anshul Kundaje - Dept. of Genetics and Dept. of Computer Science, Stanford University

Sept. 28, 2018, 2:30 p.m. - Sept. 28, 2018, 3:30 p.m.

Trottier 2100

Hosted by: Prof. Mathieu Blanchette

Functional genomics experiments profiling genome-wide regulatory state have revealed millions of putative regulatory elements in diverse cell states. These massive datasets have spurred the development of Deep Neural Networks (DNNs) that can accurate map DNA sequence to associated cell-type specific molecular phenotypes such as TF binding, chromatin accessibility and gene expression. Beyond high prediction accuracy, the primary appeal of DNNs is that they are capable of learning predictive sequence features and modeling non-linear feature interactions directly from raw DNA sequence without any prior assumptions. Hence, interpreting these purported black box models could reveal novel insights into the combinatorial regulatory code. I will present efficient interpretation engines for extracting predictive and biological meaningful patterns from integrative deep learning models of regulatory DNA. I will show how we can use interpretable deep learning models to obtain new insights into protein-DNA sequence affinity models, infer high-resolution point binding events of TFs, reveal epistatic motif interactions in cis-regulatory sequence grammars, unravel dynamic regulatory drivers of cellular differentiation and interpret non-coding regulatory genetic variants associated with disease.