Publications
Filter by Year
Show Only
Mitigating Memorization in Language Models
Mansi Sakarvadia, Aswathy Ajith, Arham Khan, Nathaniel Hudson, Caleb Geniesse, Kyle Chard, Yaoqing Yang, Ian Foster, Michael Mahoney.
ICLR 2025.
Spotlight
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
Weijia Shi, Jaechan Lee, Yangsibo Huang, Sadhika Malladi, Jieyu Zhao, Ari Holtzman, Daogao Liu, Luke Zettlemoyer, Noah A. Smith, Chiyuan Zhang.
ICLR 2025.
Forking Paths in Neural Text Generation
Eric Bigelow, Ari Holtzman, Hidenori Tanaka, Tomer Ullman.
ICLR 2025.
CaseSumm: A Large-Scale Dataset for Long-Context Summarization from U.S. Supreme Court Opinions
Mourad Heddaya, Kyle MacMillan, Anup Malani, Hongyuan Mei, Chenhao Tan.
NAACL 2025.
Building machines that learn and think with people
Katherine M Collins, Ilia Sucholutsky, Umang Bhatt, Kartik Chandra, Lionel Wong, Mina Lee, Cedegao E Zhang, Tan Zhi-Xuan, Mark Ho, Vikash Mansinghka, Adrian Weller, Joshua B Tenenbaum, Thomas L Griffiths.
Nature Human Behavior 2024.
Byte Latent Transformer: Patches Scale Better Than Tokens
Artidoro Pagnoni, Ram Pasunuru, Pedro Rodriguez, John Nguyen, Benjamin Muller, Margaret Li, Chunting Zhou, Lili Yu, Jason Weston, Luke Zettlemoyer, Gargi Ghosh, Mike Lewis, Ari Holtzman, Srinivasan Iyer.
arXiv 2024.
Benchmarks as Microscopes: A Call for Model Metrology
Michael Saxon, Ari Holtzman, Peter West, William Yang Wang, Naomi Saphra.
COLM 2024.
Human-Centered Evaluation and Auditing of Language Models
Ziang Xiao, Wesley Hanwen Deng, Michelle S Lam, Motahhare Eslami, Juho Kim, Mina Lee, Q Vera Liao.
CHI EA'24 2024.
A Design Space for Intelligent and Interactive Writing Assistants
Mina Lee, et al. (+35 authors).
CHI 2024.
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution
Minghan Li, Xilun Chen, Ari Holtzman, Beidi Chen, Jimmy Lin, Wen-tau Yih, Xi Victoria Lin.
NeurIPS 2024.
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
Ethan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati.
NeurIPS 2024.
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models
Yanhong Li*, Chenghao Yang*, Allyson Ettinger.
NAACL Findings 2024.
Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media Posts
Chenghao Yang, Tuhin Chakrabarty, Karli R Hochstatter, Melissa N Slavin, Nabila El-Bassel, Smaranda Muresan.
NAACL Findings 2024.
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints
Chaoqi Wang, Yibo Jiang, Chenghao Yang, Han Liu, Yuxin Chen.
ICLR 2024.
Spotlight
Causal Micro-Narratives
Mourad Heddaya, Qingcheng Zeng, Alexander Zentefis, Rob Voigt, Chenhao Tan.
EMNLP Workshop on Narrative Understanding (WNU) 2024.
Causal Reasoning and Large Language Models: Opening a New Frontier for Causality
Emre Kıcıman, Robert Ness, Amit Sharma, Chenhao Tan.
arXiv 2023.
Machine Explanations and Human Understanding
Chacha Chen, Shi Feng, Amit Sharma, Chenhao Tan.
TMLR; FAccT 2023.
Selective Explanations: Leveraging Human Input to Align Explainable AI
Vivian Lai, Yiming Zhang, Chacha Chen, Q. Vera Liao, Chenhao Tan.
CSCW 2023.
Learning Human-Compatible Representations for Case-Based Decision Support
Han Liu, Yizhou Tian, Chacha Chen, Shi Feng, Yuxin Chen, Chenhao Tan.
ICLR 2023.
Language of Bargaining
Mourad Heddaya, Solomon Dworkin, Chenhao Tan, Rob Voigt, Alexander Zentefis.
ACL 2023.
FLamE: Few-shot Learning from Natural Language Explanations
Yangqiaoyu Zhou, Yiming Zhang, Chenhao Tan.
ACL 2023.
Generative Models as a Complex Systems Science: How can we make sense of language model behavior?
Ari Holtzman, Peter West, Luke Zettlemoyer.
arXiv 2023.
QLoRA: Efficient Finetuning of Quantized LLMs
Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer.
NeurIPS 2023.
Contrastive Decoding: Open-ended Text Generation as Optimization
Xiang Lisa Li, Ari Holtzman, Daniel Fried, Percy Liang, Jason Eisner, Tatsunori Hashimoto, Luke Zettlemoyer, Mike Lewis.
ACL 2023.
Evaluating Human-Language Model Interaction
Mina Lee, Megha Srivastava, Amelia Hardy, John Tickstun, Esin Durmus, Ashwin Paranjape, Ines Gerard-Ursin, Xiang Lisa Li, Faisal Ladhak, Frieda Rong, Rose E. Wang, Minae Kwon, Joon Sung Park, Hancheng Cao, Tony Lee, Rishi Bommasani, Michael Bernstein, Percy Liang.
Transcations of Machine Learning Research (TMLR) 2023.
Towards Explainable AI Writing Assistants for Non-native English Speakers
Yewon Kim, Mina Lee, Donghwi Kim, Sung-Ju Lee.
Workshop on Intelligent and Interactive Writing Assistants (In2Writing) 2023.
Efficient Shapley Values Estimation by Amortization for Text Classification
Chenghao Yang, Fan Yin, He He, Kai-Wei Chang, Xiaofei Ma, Bing Xiang.
ACL 2023.
ReCode: Robustness Evaluation of Code Generation Models
Shiqi Wang*, Zheng Li*, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, Bing Xiang.
ACL 2023.
Best Paper Nomination
Can You Follow Me? Testing Situational Understanding in ChatGPT
Chenghao Yang, Allyson Ettinger.
EMNLP 2023.
Award Nomination
CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities
Mina Lee, Percy Liang, Qian Yang.
CHI 2022.
Honorable mention award
Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation
Vivian Lai, Samuel Carton, Rajat Bhatnagar, Q. Vera Liao, Yunfeng Zhang, Chenhao Tan.
CHI 2022.
Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping
Chenghao Yang, Xuezhe Ma.
EMNLP 2022.
Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations
Ramaravind Kommiya Mothilal, Amit Sharma, Chenhao Tan.
FAccT 2020.
On Human Predictions with Explanations and Predictions of Machine Learning Models: A Case Study on Deception Detection
Vivian Lai, Chenhao Tan.
FAccT 2019.
Creative Writing with a Machine in the Loop: Case Studies on Slogans and Stories
Elizabeth Clark, Anne Ross, Chenhao Tan, Yangfeng Ji, Noah A. Smith.
IUI 2018.
Friendships, Rivalries, and Trysts: Characterizing Relations between Ideas in Texts
Chenhao Tan, Dallas Card, Noah A. Smith.
ACL 2017.
Winning Arguments: Interaction Dynamics and Persuasion Strategies in Good-faith Online Discussions
Chenhao Tan, Vlad Niculae, Cristian Danescu-Niculescu-Mizil, Lillian Lee.
WWW 2016.