StatAI Lab

About us

StatAI Lab, led by Dr. Fan Zhou at Shanghai University of Finance and Economics, focuses on the deep integration of artificial intelligence and statistics. Our research explores how statistical principles can enhance the interpretability and uncertainty quantification of AI systems, while also investigating how modern AI can advance statistical reasoning and data analysis.

On GitHub

Our Research Focus

Our research integrates statistical principles with deep learning and reinforcement learning to build theoretical foundations for AI models and develop practical AI algorithms. We focus on developing reliable methods to ensure robust decision-making, interpretability, and reliable uncertainty quantification in modern AI systems.

AI for Statistics: We connect foundation models with statistical inference by developing benchmark datasets to evaluate and enhance the statistical reasoning capabilities of large language models, and by designing agentic systems for automated statistical reasoning, including AI agents that assist in theorem proving. More broadly, we use statistical principles to improve the reliability, security, and interpretability of LLMs.
Methodology and Theory of Reinforcement Learning: We develop both theoretical foundations and practical methodologies for RL, focusing on error analysis and variance control in deep reinforcement learning, as well as principled distributional modeling of rewards; our work addresses challenges such as data dependence in offline settings and the validity of quantile-based representations, enabling statistically robust policy evaluation, consistent value estimation, and reliable decision-making under distribution shifts.
Two-Sided Market and Spatio-temporal System: We develop data-driven models and decision frameworks for dynamic two-sided markets, including supply–demand equilibrium metics, reinforcement learning based matching policy, and a suite of A/B testing methods. These approaches enable efficient matching, robust policy evaluation, and improved system performance in complex, time-evolving environments.
Uncertainty Quantification: Reliable AI requires principled uncertainty assessment. We develop statistically grounded methods for understanding uncertainty, reliability, and distributional behavior in modern learning systems, with applications ranging from robust statistical prediction and influence-based model assessment to trustworthy large language models and quantile-based learning for sequential decision-making.
Graph Representation Learning: Since network data is everywhere, we develop novel statistical methods for complex graph structures. Our work includes creating diffusion-based representation learning and addressing semi-supervised learning challenges with non-ignorable missing data.

Learn more about our research

Latest News

2026-07-21 We are excited to announce the release of DataSciEval, a unified benchmark for evaluating the data science capabilities of large language models and AI agents, jointly developed with HK PolyU CMFAI. Learn more →
2026-06-22 Two papers on RLHF and A/B testing have been accepted to STAl-X 2026, with one paper selected for the Paper Award. Learn more →
2026-05-19 Professor Fan Zhou has been honored with the James E. Grizzle Distinguished Alumni Award by the University of North Carolina at Chapel Hill. Learn more →
2026-05-02 Professor Fan Zhou accepted the invitation to serve as Area Chair for NeurIPS 2026. Learn more →
2026-04-23 We are excited to announce the release of StatProver, a brand new agentic statistical proof assistant. StatProver helps users clarify the problem, find references, outline skeleton steps, and write the proof. Learn more →

All News

Join Our Team

We enthusiastically welcome prospective PhD and Master’s students from diverse universities who are passionate about research at the intersection of LLMs and statistics. Whether you are from a local or international institution, we value the unique perspective you bring to our team. We are always open to fostering collaborations and inquiries from researchers and practitioners across all universities, institutions, and companies.

Prospective Students

PhD Candidates

Strong mathematical and statistical foundations, self-motivated, and passionate about research
Familiar with Python and GPU computing; programming skills are a plus

Master’s Students

Interested in pursuing further PhD studies or working in industry roles related to algorithms and research
Master’s students will participate in systematic research training led by PhD students and may be recommended for PhD positions at top domestic or international universities, or for internships at AI research divisions

If you are interested in our projects and would like to join StatAI Lab as a full-time member, please send your CV along with a brief statement of your research interests to zhoufan@mail.shufe.edu.cn

Collaboration & Partnerships

We are also open to collaborations with enterprises, institutions, schools, and individuals who share an interest in LLMs and statistical research. If you are interested in working with us, please send a brief introduction of yourself or your organization, along with a statement detailing the nature of the proposed collaboration (e.g., joint academic research, remote collaboration, data sharing, compute resource sponsorship, etc.) to zhoufan@mail.shufe.edu.cn.

zhoufan@mail.shufe.edu.cn