StatAI Lab

News

Latest news and updates from StatAI Lab.

StatEval — Benchmarking Statistical Reasoning in Large Language Models

We are excited to announce **[StatEval](https://stateval.github.io/)**, the first benchmark systematically organized along both difficulty and disciplinary axes to evaluate large language models' statistical reasoning. StatEval includes a Foundational Knowledge Dataset of over 13,000 problems from 50+ textbooks and a Statistical Research Dataset of over 2,000 proof-based questions sourced from 18 top-tier journals. Both datasets are publicly available on **[Hugging Face](https://huggingface.co/datasets/0v01111/StatEval-Foundational-knowledge)**.

StatProver — Agentic Statistical Proof Assistant

We are excited to announce the release of **[StatProver](https://statprover.com)**, a brand new agentic statistical proof assistant. StatProver helps users clarify the problem, find references, outline skeleton steps, and write the proof.

New paper accepted at JCGS

Our paper, "[Spatio-Temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications to Ridesharing](/assets/publications/JCGS接收.pdf)" (Run Yang, Runpeng Dai, Siran Gao, Xiaocheng Tang, Fan Zhou, Hongtu Zhu), has been accepted in the Journal of Computational and Graphical Statistics (JCGS).

New paper accepted at EACL 2026

Our paper, "[Breach in the Shield: Unveiling the Vulnerabilities of Large Language Models](/assets/publications/2026.eacl-long.161.pdf)" (Runpeng Dai, Run Yang, Fan Zhou, Hongtu Zhu), has been accepted at the European Chapter of the Association for Computational Linguistics (EACL 2026).