Jian Cao(曹健)

Jian Cao(曹健)

Assistant Professor in Statistics

University of Houston

About

I am an assistant professor specialized in Computational and Spatial Statistics in the Department of Mathematics at The University of Houston. I worked as a postdoc with Dr. Matthias Katzfuss at Texas A&M University. I obtained my Ph.D. degree in Statistics at King Abdullah University of Science and Technology (KAUST), advised by Dr. Marc Genton. Prior to that, I obtained a B.S. in Mathematics from University of Science and Technology of China.

My research focuses on scalable Gaussian Process (GP) regression, including truncated GP, latent GP, multivariate GP, and high-dimensional GP. During my postdoc, I worked on scalable Gaussian process regression and variable selection, transport maps, and variational Bayes, most of which are based on the Vecchia approximation of Gaussian processes. During my PhD, I studied scalable evaluations of multivariate normal probabilities, mainly exploiting low-rank matrices and efficient quasi-Monte Carlo sampling rules. My research is mostly related to applications in spatial statistics, climate science, and argricultural science.

Interests
  • Computational statistics
  • Conditional independence approximation
  • Gaussian-process-related machine learning
  • Multivariate normal probability
  • Truncated multivariate normal
Education
  • Bachelor in Applied Mathematics, 2014

    University of Science and Technology of China

  • Master in Finance, 2016

    Shanghai Jiaotong University

  • PhD in Statistics, 2020

    King Abdullah University of Science and Technology

Awards and Honors

  • Al-Kindi Statistics Student Research Award, 2020
  • Best Student Paper, 2019

Publications

Teaching

  • Statistics for the Sciences, 2023 Fall, University of Houston
  • Statistics for the Sciences, 2024 Spring, University of Houston
  • Inferential Statistics, 2024 Fall, University of Houston
  • Statistics for the Sciences, 2025 Spring, University of Houston

Software

  • tlrmvnmvt: Estimating multivariate normal (MVN) probabilities using tile-low-rank (TLR) matrix representation.

  • VeccTMVN: Estimating multivariate MVN probabilities and sampling from truncated MVN (TMVN) distributions using Vecchia approximation and exponential tilted importance sampling.

  • nntmvn: Using the sequential nearest neighbor (SNN) method to draw samples from the truncated multivariate normal (TMVN) distributions.