<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.4.1">Jekyll</generator><link href="/feed.xml" rel="self" type="application/atom+xml" /><link href="/" rel="alternate" type="text/html" /><updated>2026-04-21T13:16:09+00:00</updated><id>/feed.xml</id><title type="html">StatAI Lab</title><subtitle>StatAI Lab is dedicated to exploring the deep integration of large language models (LLMs) and statistics.</subtitle><entry><title type="html">StatEval — Benchmarking Statistical Reasoning in Large Language Models</title><link href="/2026/02/15/stateval.html" rel="alternate" type="text/html" title="StatEval — Benchmarking Statistical Reasoning in Large Language Models" /><published>2026-02-15T00:00:00+00:00</published><updated>2026-04-21T13:15:29+00:00</updated><id>/2026/02/15/stateval</id><content type="html" xml:base="/2026/02/15/stateval.html"><![CDATA[<p>We are excited to announce <strong><a href="https://stateval.github.io/">StatEval</a></strong>, the first benchmark systematically organized along both difficulty and disciplinary axes to evaluate large language models’ statistical reasoning, developed by the team of Professor Fan Zhou at Shanghai University of Finance and Economics.</p>

<p>StatEval includes two carefully curated datasets:</p>

<ul>
  <li><strong>Foundational Knowledge Dataset</strong> — over 13,000 problems sourced from 50+ textbooks, covering the full spectrum of foundational statistical knowledge.</li>
  <li><strong>Statistical Research Dataset</strong> — over 2,000 proof-based questions collected from 18 top-tier journals in statistics, probability, econometrics, and machine learning.</li>
</ul>

<p>Both test sets are publicly available and can be accessed on Hugging Face:</p>

<ul>
  <li><a href="https://huggingface.co/datasets/0v01111/StatEval-Foundational-knowledge">StatEval-Foundational-Knowledge</a></li>
  <li><a href="https://huggingface.co/datasets/0v01111/StatEval-Statistical-Research">StatEval-Statistical-Research</a></li>
  <li><a href="https://stateval.github.io/">StatEval Website</a></li>
</ul>]]></content><author><name></name></author><summary type="html"><![CDATA[We are excited to announce **[StatEval](https://stateval.github.io/)**, the first benchmark systematically organized along both difficulty and disciplinary axes to evaluate large language models' statistical reasoning. StatEval includes a Foundational Knowledge Dataset of over 13,000 problems from 50+ textbooks and a Statistical Research Dataset of over 2,000 proof-based questions sourced from 18 top-tier journals. Both datasets are publicly available on **[Hugging Face](https://huggingface.co/datasets/0v01111/StatEval-Foundational-knowledge)**.]]></summary></entry><entry><title type="html">New paper accepted at JCGS</title><link href="/2026/01/15/jcgs2026.html" rel="alternate" type="text/html" title="New paper accepted at JCGS" /><published>2026-01-15T00:00:00+00:00</published><updated>2026-04-21T13:15:29+00:00</updated><id>/2026/01/15/jcgs2026</id><content type="html" xml:base="/2026/01/15/jcgs2026.html"><![CDATA[<p>Our paper, “<a href="/assets/publications/JCGS接收.pdf">Spatio-Temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications to Ridesharing</a>” (Authors: Run Yang, Runpeng Dai, Siran Gao, Xiaocheng Tang, Fan Zhou, Hongtu Zhu), has been accepted in the <em>Journal of Computational and Graphical Statistics (JCGS)</em>. This work develops novel spatiotemporal prediction methods for fine-grained origin-destination matrices, with applications to ridesharing platform optimization. Congratulations to all authors!</p>]]></content><author><name></name></author><summary type="html"><![CDATA[Our paper, "[Spatio-Temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications to Ridesharing](/assets/publications/JCGS接收.pdf)" (Run Yang, Runpeng Dai, Siran Gao, Xiaocheng Tang, Fan Zhou, Hongtu Zhu), has been accepted in the Journal of Computational and Graphical Statistics (JCGS).]]></summary></entry><entry><title type="html">New paper accepted at EACL 2026</title><link href="/2026/01/10/eacl2026.html" rel="alternate" type="text/html" title="New paper accepted at EACL 2026" /><published>2026-01-10T00:00:00+00:00</published><updated>2026-04-21T13:15:29+00:00</updated><id>/2026/01/10/eacl2026</id><content type="html" xml:base="/2026/01/10/eacl2026.html"><![CDATA[<p>Our paper, “<a href="/assets/publications/2026.eacl-long.161.pdf">Breach in the Shield: Unveiling the Vulnerabilities of Large Language Models</a>” (Authors: Runpeng Dai, Run Yang, Fan Zhou, Hongtu Zhu), has been accepted at the <em>European Chapter of the Association for Computational Linguistics (EACL 2026)</em>. This work investigates the security vulnerabilities of large language models and paves the way for more robust LLM deployments. Congratulations to all authors!</p>]]></content><author><name></name></author><summary type="html"><![CDATA[Our paper, "[Breach in the Shield: Unveiling the Vulnerabilities of Large Language Models](/assets/publications/2026.eacl-long.161.pdf)" (Runpeng Dai, Run Yang, Fan Zhou, Hongtu Zhu), has been accepted at the European Chapter of the Association for Computational Linguistics (EACL 2026).]]></summary></entry></feed>