News
Latest news and updates from StatAI Lab.
We are excited to announce **[StatEval](https://stateval.github.io/)**, the first benchmark systematically organized along both difficulty and disciplinary axes to evaluate large language models' statistical reasoning. StatEval includes a Foundational Knowledge Dataset of over 13,000 problems from 50+ textbooks and a Statistical Research Dataset of over 2,000 proof-based questions sourced from 18 top-tier journals. Both datasets are publicly available on **[Hugging Face](https://huggingface.co/datasets/0v01111/StatEval-Foundational-knowledge)**.
We are excited to announce the release of **[StatProver](https://statprover.com)**, a brand new agentic statistical proof assistant. StatProver helps users clarify the problem, find references, outline skeleton steps, and write the proof.
Our paper, "[Spatio-Temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications to Ridesharing](/assets/publications/JCGS接收.pdf)" (Run Yang, Runpeng Dai, Siran Gao, Xiaocheng Tang, Fan Zhou, Hongtu Zhu), has been accepted in the Journal of Computational and Graphical Statistics (JCGS).
Our paper, "[Breach in the Shield: Unveiling the Vulnerabilities of Large Language Models](/assets/publications/2026.eacl-long.161.pdf)" (Runpeng Dai, Run Yang, Fan Zhou, Hongtu Zhu), has been accepted at the European Chapter of the Association for Computational Linguistics (EACL 2026).