Publications

2025

  • Establishing best practices for building rigorous agentic benchmarks. Yuxuan Zhu, Tengjun Jin, Yada Pruksachatkun, Andy Zhang, Shu Liu, Sasha Cui, Sayash Kapoor, Shayne Longpre, Kevin Meng, Rebecca Weiss, Fazl Barez, Rahul Gupta, Jwala Dhamala, Jacob Merizian, Mario Giulianelli, Harry Coppock, Cozmin Ududec, Jasjeet Sekhon, Jacob Steinhardt, Antony Kellerman, Sarah Schwettmann, Matei Zaharia, Ion Stoica, Percy Liang, Daniel Kang. arXiv, 2025. [bib] [paper]
  • Understanding in-context learning of addition via activation subspaces. Xinyan Hu, Kayo Yin, Michael I. Jordan, Jacob Steinhardt, Lijie Chen. arXiv, 2025. [bib] [paper]
  • Uncovering gaps in how humans and LLMs interpret subjective language. Erik Jones, Arjun Patrawala, Jacob Steinhardt. International Conference on Learning Representations (ICLR), 2025. Spotlight presentation.[bib] [paper]
  • Eliciting language model behaviors with investigator agents. Xiang Lisa Li, Neil Chowdhury, Daniel D. Johnson, Tatsunori Hashimoto, Percy Liang, Sarah Schwettmann, Jacob Steinhardt. arXiv, 2025. [bib] [paper]
  • Iterative label refinement matters more than preference optimization under weak supervision. Yaowen Ye, Cassidy Laidlaw, Jacob Steinhardt. arXiv, 2025. [bib] [paper]

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2012

2011

2010

2009

  • On coloring the odd-distance graph. Jacob Steinhardt. Electronic Journal of Combinatorics, 2009.[bib] [paper]

2007