Jian Cao (曹健)

Jian Cao (曹健)

Postdoctoral Researcher in Statistics

Texas A&M University

About

I am a postdoctoral researcher in the Department of Statistics and the Institute of Data Science at Texas A&M University, supervised by Dr. Matthias Katzfuss. I obtained my Ph.D. degree in Statistics at King Abdullah University of Science and Technology (KAUST), advised by Dr. Marc Genton. Prior to that, I obtained a B.S. in Mathematics from University of Science and Technology of China.

I have broad research interests in scalable computational methods and theories behind those. Currently in my postdoc, I am working on scalable Gaussian process regression and variable selection, transport maps, and variational Bayes, most of which are based on the Vecchia approximation of Gaussian processes. During my PhD, I studied scalable evaluations of multivariate normal probabilities, mainly exploiting low-rank matrices and efficient quasi-Monte Carlo sampling rules. My research is mostly related to applications in spatial statistics, climate science, and argricultural science. My goal is to make scalable Gaussian-based methods a ready-to-use tool for various machine-learning tasks.

Download my CV. Contact me: jian.cao@tamu.edu

Interests
  • Gaussian processes
  • Machine learning
  • Scalable statistical computing
  • Spatial statistics
  • Dimension reduction
Education
  • PhD in Statistics, 2020

    King Abdullah University of Science and Technology

  • B.S. in Mathematics, 2014

    University of Science and Technology of China

Awards and Honors

  • Al-Kindi Statistics Student Research Award, King Abdullah University of Science and Technology, 2020
  • Distinguished Student Paper Award, Section on Statistical Computing and the Section on Statistical Graphics of ASA, 2019

Publications

Published/Accepted

Preprints/Under Review

Teaching

Teaching Assistant

  • Teaching Assistant for MS level Probability and Statistics (Fall 2018)
  • Teaching Assistant for MS level Probability and Statistics (Fall 2017)

Webinar

  • TAMIDS Webinar: Scalable Gaussian Process Approximation and Optimization (April 2022)

Software

  • tlrmvnmvt: an R package for fast computing multivariate normal probabilities. Available on GitHub

  • ExaGeoStat: a parallel high performance unified framework for computational geostatistics on many-core systems. Available on GitHub

Talks & Posters

  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. ENVR 2022 Workshop, Provo, UT, USA. Poster
  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. IMSI Gaussian Processes Workshop, Chicago, IL, USA. Poster
  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. Joint Statistical Meetings, Washington D.C., USA. Contributed Session
  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. ISBA 2022 World Meeting, June 2022, Montreal, Quebec, Canada. Contributed Talk
  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. SETCASA Poster Competition, May 2022, College Station, TX, USA. Poster
  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. Texas A&M Statistics Cafe, May 2022, College Station, TX, USA. Talk
  • Scalable Gaussian Process Regression and Variable Selection under Automatic Relevance Determination Kernels. TAMIDS Research Conference, Dec 2021, College Station, TX, USA. Talk
  • Sum of Kronecker Products Representation for Spatial Covariance Matrices and Its Factorization. Joint Statistical Meetings, Aug 2020, Virtual Conference. Contributed Session
  • Exploiting Low Rank Covariance Structures for Computing High-Dimensional Normal and Student-t Probabilities. Joint Statistical Meetings, Aug 2019, Denver, CO, USA. Topic-Contributed Session
  • Exploiting Low Rank Covariance Structures for Computing High-Dimensional Normal and Student-t Probabilities. Big Data Meets Large-Scale Computing, Sep 2018, Los Angeles, CA, USA. Poster
  • Hierarchical-block Conditioning Approximations for High-dimensional Multivariate Normal Probabilities. Joint Statistical Meetings, Aug 2018, Vancouver, BC, Canada. Poster
  • Hierarchical-block Conditioning Approximations for High-dimensional Multivariate Normal Probabilities. Joint Statistical Meetings, Aug 2017, Baltimore, MD, USA. Contributed Session

Contact