Research

My research falls into two main areas: developing new theory and methods in network data science, and leveraging statistics and network science to tackle complex biological problems using modern omics data.

My theoretical work includes studying the optimal phase transition for global testing in the mixed membership stochastic block model and developing an optimally adaptive global detection test. I also developed a new rate optimal algorithm for mixed membership estimation in dynamic networks under severe degree heterogeneity and temporal smoothness. The methods I design come with strong statistical guarantees and are applicable to, e.g., social networks and biological networks.

A large part of my methods work focuses on developing frameworks in computational biology. I developed a Python package to find combinatorial markers for cell populations identified with single-cell RNA-seq data. In another work, I conceived a drug repurposing pipeline to identify candidate drugs for repurposing against SARS-CoV-2. I also contributed to an experimental design framework to better design in vitro cell reprogramming experiments.

My applied work leverages statistics and network science to analyze modern genomic data sets in the context of cellular aging and nuclear mechanogenomics. I developed a novel time-course approach to identify age-associated transcription factors and investigated the coupling between gene transcription and spatial gene clustering in aging. I also characterized the map between adhesome protein assemblies on the cell membrane and 3D organization of the corresponding genes in nucleus of IMR90 fibroblasts.