:pencil: [NeurIPS 2024] Two papers, “Are Large Language Models Good Statisticians?” and “CRAG - Comprehensive RAG Benchmark”, were accepted by NeurIPS 2024 Datasets and Benchmarks Track.