Publications
My work is regularly published at leading AI, ML, and NLP conferences (NeurIPS, ICLR, ICML, ACL, EMNLP, NAACL). The full list of publications is available on my Google Scholar and Semantic Scholar.
2025
-
Tulu 3: Pushing Frontiers in Open Language Model Post-Training
Nathan Lambert, Jacob Morrison, Valentina Pyatkin, Shengyi Huang, Hamish Ivison, Faeze Brahman, Lester James Validad Miranda, Alisa Liu, Nouha Dziri, Xinxi Lyu, Yuling Gu, Saumya Malik, Victoria Graf, Jena D. Hwang, Jiangjiang Yang, Ronan Le Bras, Oyvind Tafjord, Christopher Wilhelm, Luca Soldaini, Noah A. Smith, Yizhong Wang, Pradeep Dasigi, Hannaneh Hajishirzi
COLM, 2025
-
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
Zhiyuan Zeng, Yizhong Wang, Hannaneh Hajishirzi, Pang Wei Koh
COLM, 2025
-
Establishing Task Scaling Laws via Compute-Efficient Model Ladders
Akshita Bhagia, Jiacheng Liu, Alexander Wettig, David Heineman, Oyvind Tafjord, Ananya Harsh, Luca Soldaini, Noah A. Smith, Dirk Groeneveld, Pang Wei Koh, Jesse Dodge, Hannaneh Hajishirzi
COLM, 2025
-
ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data
Tong Chen, Faeze Brahman, Jiacheng Liu, Niloofar Mireshghallah, Weijia Shi, Pang Wei Koh, Luke Zettlemoyer, Hannaneh Hajishirzi
COLM, 2025
-
2 OLMo 2 Furious
OLMo Team, Evan Pete Walsh, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Shane Arora, Akshita Bhagia, Yuling Gu, Shengyi Huang, Matt Jordan, Nathan Lambert, Dustin Schwenk, Oyvind Tafjord, Taira Anderson, David Atkinson, Faeze Brahman, Christopher Clark, Pradeep Dasigi, Nouha Dziri, Allyson Ettinger, Michal Guerquin, David Heineman, Hamish Ivison, Pang Wei Koh, Jiacheng Liu, Saumya Malik, William Merrill, Lester James Validad Miranda, Jacob Morrison, Tyler Murray, Crystal Nam, Jake Poznanski, Valentina Pyatkin, Aman Rangapur, Michael Schmitz, Sam Skjonsberg, David Wadden, Christopher Wilhelm, Michael Wilson, Luke Zettlemoyer, Ali Farhadi, Noah A. Smith, Hannaneh Hajishirzi
COLM, 2025
-
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, YenSung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross Girshick, Ali Farhadi, Aniruddha Kembhavi
CVPR, 2025 — Best paper honorable mention
-
Fluid Language Model Benchmarking
Valentin Hofmann, David Heineman, Ian Magnusson, Kyle Lo, Jesse Dodge, Maarten Sap, Pang Wei Koh, Chun Wang, Hannaneh Hajishirzi, Noah A. Smith
COLM, 2025
-
OLMES: A Standard for Language Model Evaluations
Yuling Gu, Oyvind Tafjord, Bailey Kuehl, Dany Haddad, Jesse Dodge, Hannaneh Hajishirzi
Findings of NAACL, 2025
-
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan
ACL, 2025
-
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Lester James V. Miranda, Yizhong Wang, Yanai Elazar, Sachin Kumar, Valentina Pyatkin, Faeze Brahman, Noah A. Smith, Hannaneh Hajishirzi, Pradeep Dasigi
ACL, 2025
-
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens
Jiacheng Liu, Taylor Blanton, Yanai Elazar, Sewon Min, YenSung Chen, Arnavi Chheda-Kothary, Huy Tran, Byron Bischoff, Eric Marsh, Michael Schmitz, Cassidy Trier, Aaron Sarnat, Jenna James, Jon Borchardt, Bailey Kuehl, Evie Cheng, Karen Farley, Sruthi Sreeram, Taira Anderson, David Albright, Carissa Schoenick, Luca Soldaini, Dirk Groeneveld, Rock Yuren Pang, Pang Wei Koh, Noah A. Smith, Sophie Lebrecht, Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi, Jesse Dodge
ACL Demo Track, 2025 Best Demo Paper Award
-
RewardBench: Evaluating Reward Models for Language Modeling
Nathan Lambert, Valentina Pyatkin, Jacob Morrison, Lester James Validad Miranda, Bill Yuchen Lin, Khyathi Chandu, Nouha Dziri, Sachin Kumar, Tom Zick, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi
Findings of NAACL, 2025
-
ComPO: Community Preferences for Language Model Personalization
Sachin Kumar, Chan Young Park, Yulia Tsvetkov, Noah A. Smith, Hannaneh Hajishirzi
NAACL, 2025
-
A Systematic Examination of Preference Learning through the Lens of Instruction-Following
Joongwon Kim, Anirudh Goyal, Aston Zhang, Bo Xiong, Rui Hou, Melanie Kambadur, Dhruv Mahajan, Hannaneh Hajishirzi, Liang Tan
NAACL, 2025
-
OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Morrison, Sewon Min, Weijia Shi, Evan Pete Walsh, Oyvind Tafjord, Nathan Lambert, Yuling Gu, Shane Arora, Akshita Bhagia, Dustin Schwenk, David Wadden, Alexander Wettig, Binyuan Hui, Tim Dettmers, Douwe Kiela, Ali Farhadi, Noah A. Smith, Pang Wei Koh, Amanpreet Singh, Hannaneh Hajishirzi
ICLR, Oral, 2025
-
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions
Sarah Wiegreffe, Oyvind Tafjord, Yonatan Belinkov, Hannaneh Hajishirzi, Ashish Sabharwal
ICLR, Spotlight, 2025
2024
-
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Hamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi
NeurIPS, 2024
-
MatFormer: Nested Transformer for Elastic Inference
Fnu Devvrit, Sneha Kudugunta, Aditya Kusupati, Tim Dettmers, Kaifeng Chen, Inderjit S Dhillon, Yulia Tsvetkov, Hannaneh Hajishirzi, Sham M. Kakade, Ali Farhadi, Prateek Jain
NeurIPS, 2024
-
Decoding-Time Language Model Alignment with Multiple Objectives
Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon Shaolei Du
NeurIPS, 2024
-
The Art of Saying No: Contextual Noncompliance in Language Models
Faeze Brahman, Sachin Kumar, Vidhisha Balachandran, Pradeep Dasigi, Valentina Pyatkin, Abhilasha Ravichander, Sarah Wiegreffe, Nouha Dziri, Khyathi Chandu, Jack Hessel, Yulia Tsvetkov, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi
NeurIPS D&B, 2024
-
Paloma: A Benchmark for Evaluating Language Model Fit
Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge
NeurIPS D&B, 2024
-
ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
Mohammadreza Salehi, Jae Sung Park, Aditya Kusupati, Ranjay Krishna, Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi
NeurIPS D&B, 2024
-
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Gary (Jiacheng) Liu, Sewon Min, Luke Zettlemoyer, Yejin Choi, Hannaneh Hajishirzi
COLM, 2024
-
Fine-grained Hallucination Detection and Editing for Language Models
Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi
COLM, 2024
-
Membership Inference Attacks Work on Large Language Models?
Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi
COLM, 2024
-
Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding
Gary (Jiacheng) Liu, Andrew Cohen, Ramakanth Pasunuru, Yejin Choi, Hannaneh Hajishirzi, Asli Celikyilmaz
COLM, 2024
-
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi
ACL, 2024 — Best paper award
-
Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo
ACL, 2024 — Best Resource Paper Award
-
Set the Clock: Temporal Alignment of Pretrained Language Models
Bowen Zhao, Zander Brumbaugh, Yizhong Wang, Hannaneh Hajishirzi, Noah A Smith
ACL, 2024
-
APT: Adaptive pruning and tuning pretrained language models for efficient training and inference
Bowen Zhao, Hannaneh Hajishirzi, Qingqing Cao
ICLR, 2024
-
Data Engineering for Scaling Language Models to 128K Context
Y Fu, R Panda, X Niu, X Yue, H Hajishirzi, Y Kim, H Peng
ICML, 2024
-
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, Hannaneh Hajishirzi
ICLR, 2024
-
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Context
Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chunyuan Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao
ICLR, 2024
-
What's In My Big Data?
Yanai Elazar, Akshita Bhagia, Ian Helgi Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Evan Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hannaneh Hajishirzi, Noah A. Smith, Jesse Dodge
ICLR, 2024
-
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Sewon Min, Suchin Gururangan, Eric Wallace, Weijia Shi, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer
ICLR, 2024
-
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
Qingqing Cao, Sewon Min, Yizhong Wang, Hannaneh Hajishirzi
ICLR, 2024
2023
-
Self-Instruct: Aligning Language Model with Self-Generated Instructions
Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A Smith, Daniel Khashabi, Hannaneh Hajishirzi
ACL, 2023
-
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
Alex Mallen, Akari Asai, Victor Zhong, Rajarshi Das, Daniel Khashabi, Hannaneh Hajishirzi
ACL, 2023
-
Task-Aware Retrieval with Instructions
Akari Asai, Timo Schick, Patrick Lewis, Xilun Chen, Gautier Izacard, Sebastian Riedel, Hannaneh Hajishirzi, Wen-tau Yih
Findings of ACL, 2023
-
PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Qingqing Cao, Bhargavi Paranjape, Hannaneh Hajishirzi
ACL, 2023
-
CREPE: Open-Domain Question Answering with False Presuppositions
Xinyan Velocity Yu, Sewon Min, Luke Zettlemoyer, Hannaneh Hajishirzi
ACL, 2023
-
Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations
Xinxi Lyu, Sewon Min, Iz Beltagy, Luke Zettlemoyer, Hannaneh Hajishirzi
ACL, 2023
-
Nonparametric Masked Language Modeling
Sewon Min, Weijia Shi, Mike Lewis, Xilun Chen, Wen-tau Yih, Hannaneh Hajishirzi, Luke Zettlemoyer
Findings of ACL, 2023
-
HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation
Hamish Ivison, Akshita Bhagia, Yizhong Wang, Hannaneh Hajishirzi, Matthew Peters
ACL, 2023
-
Elaboration-Generating Commonsense Question Answering at Scale
Wenya Wang, Vivek Srikumar, Hanna Hajishirzi, Noah A. Smith
ACL, 2023
-
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Hamish Ivison, Noah A. Smith, Hannaneh Hajishirzi, Pradeep Dasigi
Findings of ACL, 2023
-
FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning
Qinyuan Ye, Iz Beltagy, Matthew E. Peters, Xiang Ren, Hannaneh Hajishirzi
ACL, 2023
-
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy*, Prithviraj Ammanabrolu*, Kianté Brantley, Jack Hessel, Rafet Sifa, Christian Bauckhage, Hannaneh Hajishirzi, Yejin Choi
ICLR, 2023
-
Editing Models with Task Arithmetic
Gabriel Ilharco, Marco Tulio Ribeiro, Mitchell Wortsman, Suchin Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi
ICLR, 2023
-
INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
Zeqiu Wu, Ryu Parish, Hao Cheng, Sewon Min, Prithviraj Ammanabrolu, Mari Ostendorf, Hannaneh Hajishirzi
TACL, 2023
2022
-
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Yizhong Wang, Swaroop Mishra, ..., Hannaneh Hajishirzi, Daniel Khashabi
EMNLP, 2022
[Project Page]
-
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
Jiacheng Liu, Skyler Hallinan, Ximing Lu, Pengfei He, Sean Welleck, Hannaneh Hajishirzi, Yejin Choi
EMNLP, 2022
-
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
Akari Asai, Mohammadreza Salehi, Mathew Peters, Hannaneh Hajishirzi
EMNLP, 2022
-
Rethinking the Role of Demonstrations: What makes In-context Learning Work?
Sewon Min, Xinxi Lyu, Ari Holtzman, Mikel Artetxe, Mike Lewis, Hannaneh Hajishirzi, Luke Zettlemoyer
EMNLP, 2022
-
CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
Zeqiu Wu, Yi Luan, Hannah Rashkin, David Reitter, Hannaneh Hajishirzi, Mari Ostendorf, Gaurav Singh Tomar
EMNLP, 2022
-
SciFact-Open: Towards open-domain scientific claim verification
David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Iz Beltagy, Lucy Lu Wang, Hannaneh Hajishirzi
EMNLP Findings, 2022
-
Exploring The Landscape of Distributional Robustness for Question Answering Models
Anas Awadalla, Mitchell Wortsman, Gabriel Ilharco, Sewon Min, Hannaneh Hajishirzi, Ludwig Schmidt
EMNLP Findings, 2022
-
NaturalProver: Grounded Mathematical Proof Generation with Language Models
Sean Welleck, Jiacheng Liu, Ximing Lu, Hannaneh Hajishirzi, Yejin Choi
NeurIPS, 2022
-
Patching open-vocabulary models by interpolating weights
Gabriel Ilharco, Mitchell Wortsman, Samir Yitzhak Gadre, Shuran Song, Hannaneh Hajishirzi, Simon Kornblith, Ali Farhadi, Ludwig Schmidt
NeurIPS, 2022
-
MetaICL: Learning to Learn In Context
Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
NAACL, 2022
-
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts
Daniel Khashabi, Xinxi Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sameer Singh, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Yejin Choi
NAACL, 2022
-
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks
Akari Asai, Matt Gardner, Hannaneh Hajishirzi
NAACL, 2022
-
Aligning to Social Norms and Values in Interactive Narratives
Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hannaneh Hajizhirzi, Yejin Choi
NAACL, 2022
-
MultiVerS: Improving scientific claim verification with weak supervision and full-document context
David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi
NAACL Findings, 2022
-
Robust fine-tuning of zero-shot models
Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon Kornblith, Rebecca Roelofs, Raphael Gontijo Lopes, Hannaneh Hajishirzi, Ali Farhadi, Hongseok Namkoong, Ludwig Schmidt
CVPR, 2022 Best Paper Award Finalist
-
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
Swaroop Mishra, Daniel Khashabi, Chitta Baral, Hannaneh Hajishirzi
ACL, 2022
-
Noisy Channel Language Model Prompting for Few-Shot Text Classification
Sewon Min, Mike Lewis, Hannaneh Hajishirzi, Luke Zettlemoyer
ACL, 2022
-
FaVIQ: FAct Verification from Information-seeking Questions
Jungsoo Park, Sewon Min, Jaewoo Jang, Luke Zettlemoyer, Hannaneh Hajishirzi
ACL, 2022
-
Generated Knowledge Prompting for Commonsense Reasoning
Jiacheng Liu, Alisa Liu, Ximing Lu, Sean Welleck, Peter West, Ronan Le Bras, Yejin Choi, Hannaneh Hajishirzi
ACL, 2022
-
Reframing Instructional Prompts to GPTk's Language
Swaroop Mishra, Daniel Khashabi, Chitta Baral, Yejin Choi, Hannaneh Hajishirzi
ACL, 2022
2021
-
One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval
Akari Asai, Xinyan Yu, Jungo Kasai, Hannaneh Hajishirzi
NeurIPS, 2021
-
DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
Zeqiu Wu*, Bo-Ru Lu*, Hannaneh Hajishirzi, Mari Ostendorf
EMNLP, 2021
-
Joint Passage Ranking for Diverse Multi-Answer Retrieval
Sewon Min, Kenton Lee, Ming-Wei Chang, Kristina Toutanova, Hannaneh Hajishirzi
EMNLP, 2021
-
Probing Across Time: What Does RoBERTa Know and When?
Leo Z. Liu*, Yizhong Wang*, Jungo Kasai, Hannaneh Hajishirzi, Noah A. Smith
EMNLP Findings, 2021
-
GooAQ: Open Question Answering with Diverse Answer Types
Daniel Khashabi, Amos Ng, Tushar Khot, Ashish Sabharwal, Hannaneh Hajishirzi, Chris Callison-Burch
EMNLP Findings, 2021
-
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni, Ali Farhadi
EMNLP, 2021
-
Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study
Rahul Nadkarni, David Wadden, Iz Beltagy, Noah A. Smith, Hannaneh Hajishirzi, Tom Hope
AKBC, 2021
-
Efficient Passage Retrieval with Hashing for Open-domain Question Answering
Ikuya Yamada, Akari Asai, Hannaneh Hajishirzi
ACL, 2021
-
Prompting Contrastive Explanations for Commonsense Reasoning Tasks
Bhargavi Paranjape, Julian Michael, Marjan Ghazvininejad, Luke Zettlemoyer, Hannaneh Hajishirzi
Findings of ACL, 2021
-
XOR QA: Cross-lingual Open-Retrieval Question Answering
Akari Asai, Jungo Kasai, Jonathan H. Clark, Kenton Lee, Eunsol Choi, Hannaneh Hajishirzi
NAACL, 2021
[project page]
-
Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco, Rowan Zellers, Ali Farhadi, Hannaneh Hajishirzi
NAACL, 2021
-
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Aida Amini*, Tom Hope*, David Wadden, Madeleine Zuylen, Sravanthi Parasa, Eric Horvitz, Daniel Weld, Roy Schwartz, Hannaneh Hajishirzi
NAACL, 2021
-
DeLighT: Deep and Light-weight Transformer
Sachin Mehta, Marjan Ghazvininejad, Srinivasan Iyer, Luke Zettlemoyer, Hannaneh Hajishirzi
ICLR, 2021 [code]
-
MultiModalQA: Complex Question Answering over Text, Tables and Images
Alon Talmor, Ori Yoran, Amnon Catav, Dan Lahav, Yizhong Wang, Akari Asai, Gabriel Ilharco, Hannaneh Hajishirzi, Jonathan Berant
ICLR, 2021 [code]
-
A Controllable Model of Grounded Response Generation
Zeqiu Wu, Michel Galley, Chris Brockett, Yizhe Zhang, Xiang Gao, Chris Quirk, Rik Koncel-Kedziorski, Jianfeng Gao, Hannaneh Hajishirzi, Mari Ostendorf, Bill Dolan
AAAI, 2021 [code]
-
DiCENet: Dimension-wise convolutions for efficient networks
Sachin Mehta, Hannaneh Hajishirzi, Mohammad Rastegari
PAMI Journal [code]
2020
-
An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction
Bhargavi Paranjape, Mandar Joshi, John Thickstun, Hannaneh Hajishirzi, Luke Zettlemoyer
EMNLP, 2020 [code]
-
Fact or Fiction: Verifying Scientific Claims
David Wadden, Shanchuan Lin, Kyle Lo, Lucy Lu Wang, Madeleine van Zuylen, Arman Cohan, Hannaneh Hajishirzi
EMNLP, 2020 [code]
-
AmbigQA: Answering Ambiguous Open-domain Questions
Sewon Min, Julian Michael, Hannaneh Hajishirzi, Luke Zettlemoyer
EMNLP, 2020 [code]
-
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith, Yejin Choi
EMNLP, 2020 [code]
-
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Jaemin Cho, Jiasen Li, Hannaneh Hajishirzi, Ani Kembhavi
EMNLP, 2020 [code]
-
IIRC: A Dataset of Incomplete Information / Reading Comprehension Questions
James Ferguson, Matt Gardner, Hannaneh Hajishirzi, Tushar Khot, Pradeep Dasigi
EMNLP, 2020 [code]
-
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi, Sewon Min, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark, Hannaneh Hajishirzi
Findings of EMNLP, 2020 [code]
-
Evaluating NLP models via contrast sets
Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang, Ben Zhou
Findings of EMNLP, 2020 [code]
-
MedICaT: A Dataset of Medical Images, Captions, and Textual References
Sanjay Subramanian, Lucy Lu Wang, Ben Bogin, Sachin Mehta, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi
Findings of EMNLP, 2020 [code]
-
Logic-Guided Data Augmentation and Regularization for Consistent Question Answering
Akari Asai, Hannaneh Hajishirzi
ACL, 2020 [code]
-
ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages
Colin Lockard, Prashant Shiralkar, Xin Luna Dong, Hannaneh Hajishirzi
ACL, 2020
-
SciREX: A Challenge Dataset for Document-Level Information Extraction
Sarthak Jain, Madeleine van Zuylen, Hannaneh Hajishirzi, Iz Beltagy
ACL, 2020
[dataset: SciREX] [code]
-
Contextualized Sparse Representation with Rectified N-Gram Attention for Open-Domain Question Answering
Jinhyuk Lee, Minjoon Seo, Hannaneh Hajishirzi, Jaewoo Kang
ACL, 2020 [code: SPARC]
-
Procedural Reading Comprehension with Attribute-Aware Context Flow
Aida Amini, Antoine Bosselut, Bhavana Dalvi Mishra, Yejin Choi, Hannaneh Hajishirzi
AKBC, 2020 — Best paper honorable mention
-
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling
Sachin Mehta, Rik Koncel-Kedziorski, Mohammad Rastegari, Hannaneh Hajishirzi
ICLR, 2020
-
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering
Akari Asai, Kazuma Hashimoto, Hannaneh Hajishirzi, Richard Socher, Caiming Xiong
ICLR, 2020
[code]
2019
-
Entity, Relation, and Event Extraction with Contextualized Span Representations
David Wadden, Ulme Wennberg, Yi Luan, Hannaneh Hajishirzi
EMNLP, 2019 [code: DyGIE++]
-
Mixture Content Selection for Diverse Sequence Generation
Jaemin Cho, Minjoon Seo, Hannaneh Hajishirzi
EMNLP, 2019 [code]
-
A Discrete Hard EM Approach for Weakly Supervised Question Answering
Sewon Min, Danqi Chen, Hannaneh Hajishirzi, Luke Zettlemoyer
EMNLP, 2019 [code]
-
On making reading comprehension more comprehensive
Matt Gardner, Jonathan Berant, Hannaneh Hajishirzi, Alon Talmor, Sewon Min
Machine Reading and Question Answering Workshop (MRQA), 2019
-
Question Answering is a Format; When is it Useful?
Matt Gardner, Jonathan Berant, Hannaneh Hajishirzi, Alon Talmor, Sewon Min
arXiv preprint, 2019
-
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Minjoon Seo, Jinhyuk Lee, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh Hajishirzi
ACL, 2019 [Demo]
-
Multi-hop Reading Comprehension through Question Decomposition and Rescoring
Sewon Min, Victor Zhong, Luke Zettlemoyer, Hannaneh Hajishirzi
ACL, 2019 [code]
-
Compositional Questions Do Not Necessitate Multi-hop Reasoning
Sewon Min, Eric Wallace, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi, Luke Zettlemoyer
ACL, 2019 [code]
-
A General Framework for Information Extraction using Dynamic Span Graphs
Yi Luan, David Wadden, Amy Shah, Mari Ostendorf, Hannaneh Hajishirzi
NAACL, 2019 [code]
-
Text Generation from Knowledge Graphs with Graph Transformers
Rik Koncel-Kedziorski, Dhanush Bekal, Yi Luan, Mirella Lapata, Hannaneh Hajishirzi
NAACL, 2019 [code]
-
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
Aida Amini, Saadia Gabriel, Shanchuan Lin, Rik Koncel-Kedziorski, Yejin Choi, Hannaneh Hajishirzi
NAACL, 2019 [code]
-
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
Sachin Mehta, Mohammad Rastegari, Linda Shapiro, Hannaneh Hajishirzi
CVPR, 2019 [code]
2018
-
Pyramidal Recurrent Unit for Language Modeling
Sachin Mehta, Rik Koncel-Kedziorski, Mohammad Rastegari, Hannaneh Hajishirzi
EMNLP, 2018 [code]
-
Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction
Yi Luan, Luheng He, Mari Ostendorf, Hannaneh Hajishirzi
EMNLP, 2018
[Dataset and code]
-
Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension
Minjoon Seo, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, Hannaneh Hajishirzi
EMNLP, 2018
[PiQA Task]
-
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
Sachin Mehta, Mohammad Rastegari, Anat Caspi, Linda Shapiro, Hannaneh Hajishirzi
ECCV, 2018 [code]
-
Semi-Supervised Event Extraction with Paraphrase Clusters
James Ferguson, Colin Lockard, Daniel S. Weld, Hannaneh Hajishirzi
NAACL, 2018
[code] [Data]
-
UW system at SemEval-2018 Task 7: Neural Relation Extraction Model with Selectively Incorporated Concept Embeddings
Yi Luan, Mari Ostendorf, Hannaneh Hajishirzi
SemEval, 2018
-
Neural Speed Reading via Skim-RNN
Sewon Min, Minjoon Seo, Ali Farhadi, Hannaneh Hajishirzi
ICLR, 2018 [code]
2017
-
Scientific Information Extraction with Semi-supervised Neural Tagging
Yi Luan, Mari Ostendorf, Hannaneh Hajishirzi
EMNLP, 2017 [code]
-
Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi
ICLR, 2017
[code] [Website] [Demo]
-
Question Answering through Transfer Learning from Large Fine-grained Supervision Data
Sewon Min, Minjoon Seo, Hannaneh Hajishirzi
ACL, 2017 [code]
-
University of Washington TAC-KBP 2016 System Description
James Ferguson, Colin Lockard, Natalie Hawkins, Stephen Soderland, Hannaneh Hajishirzi, Daniel S. Weld
TAC-KBP, 2017 [code]
-
Query-Reduction Networks for Question Answering
Minjoon Seo, Sewon Min, Ali Farhadi, Hannaneh Hajishirzi
ICLR, 2017 [code]
-
Are You Smarter Than A Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension
Aniruddha Kembhavi, Minjoon Seo, Eric Klove, Dustin Schwenk, Hannaneh Hajishirzi, Ali Farhadi
CVPR, 2017
[Data]
2016
-
A Theme-Rewriting Approach for Generating Algebra Word Problems
Rik Koncel-Kedziorski, Ioannis Konstas, Luke Zettlemoyer, Hannaneh Hajishirzi
EMNLP, 2016 [code and data]
-
A Diagram is Worth a Dozen Images
Aniruddha Kembhavi, Mike Salvato, Eric Kolve, Minjoon Seo, Hannaneh Hajishirzi, Ali Farhadi
ECCV, 2016
[Data]
-
Learning Prototypical Event Structure from Photo Albums
Antoine Bosselut, Jianfu Chen, David Warren, Hannaneh Hajishirzi, Yejin Choi
ACL, 2016
[project website] [Data]
-
Multiplicative Representations for Unsupervised Semantic Role Induction
Yi Luan, Yangfeng Ji, Hannaneh Hajishirzi, Boyang Li
ACL, 2016
-
Disfluency Detection using a Bidirectional LSTM
Vicky Zayats, Mari Ostendorf, Hannaneh Hajishirzi
Interspeech, 2016
-
MAWPS: A Math Word Problem Repository
Rik Koncel-Kedziorski, Subhro Roy, Aida Amini, Nate Kushman, Hannaneh Hajishirzi
NAACL, 2016
-
A Task-Oriented Approach for Cost-sensitive Recognition
Roozbeh Mottaghi, Hannaneh Hajishirzi, Ali Farhadi
CVPR, 2016
-
Parsing Algebraic Word Problems into Equations
Rik Koncel-Kedziorski, Hannaneh Hajishirzi, Ashish Sabharwal, Oren Etzioni, Siena Dumas Ang
TACL, 2016
-
Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects
Hessam Bagherinezhad, Hannaneh Hajishirzi, Yejin Choi, Ali Farhadi
AAAI, 2016
2015
-
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
Hamid Izadinia, Fereshteh Sadeghi, Santosh Kumar Divvala, Hannaneh Hajishirzi, Yejin Choi, Ali Farhadi
ICCV, 2015
-
Solving Geometry Problems: Combining Text and Diagram Interpretation
Minjoon Seo, Hannaneh Hajishirzi, Ali Farhadi, Oren Etzioni, Clint Malcolm
EMNLP, 2015
-
Talking to the crowd: What do people react to in online discussions?
Aaron Jaech, Vicky Zayats, Hao Fang, Mari Ostendorf, Hannaneh Hajishirzi
EMNLP, 2015
-
Discriminative and Consistent Similarities in Instance-Level Multiple Instance Learning
Mohammad Rastegardi, Hannaneh Hajishirzi, Ali Farhadi
CVPR, 2015
-
Learning Knowledge Graphs for Question Answering through Conversational Dialog
Ben Hixon, Peter Clark, Hannaneh Hajishirzi
NAACL HLT, 2015
[code and data]
-
Aligning Sentences from Standard Wikipedia to Simple Wikipedia
William Hwang, Hannaneh Hajishirzi, Mari Ostendorf, Wei Wu
NAACL HLT, 2015
[code and data]
-
Unediting: Detecting Disfluencies Without Careful Transcripts
Victoria Zayats, Mari Ostendorf, Hannaneh Hajishirzi
NAACL HLT, 2015
[code and data]
2014
-
Multi Resolution Language Grounding with Weak Supervision
Rik Koncel Kedziorski, Hannaneh Hajishirzi, Ali Farhadi
EMNLP, 2014
[code and data]
-
Learning to Solve Arithmetic Word Problems with Verb Categorization
Mohammad Javad Hosseini, Hannaneh Hajishirzi, Oren Etzioni, Nate Kushman
EMNLP, 2014
[code and data]
-
Multi-Domain Disfluency and Repair Detection
Victoria Zayats, Mari Ostendorf, Hannaneh Hajishirzi
Interspeech, 2014
[data]
-
Diagram Understanding in Geometry Questions
Min Joon Seo, Hannaneh Hajishirzi, Ali Farhadi, Oren Etzioni
AAAI, 2014
2013
-
Joint Coreference Resolution and Named-Entity Linking with Multi-pass Sieves
Hannaneh Hajishirzi, Leila Zilles, Daniel S. Weld, Luke Zettlemoyer
EMNLP, 2013
[code and data]
-
Managing Chaos: Models of Turn-taking in Character-multichild Interactions
Iolanda Leite, Hannaneh Hajishirzi, Sean Andrist, Jill F. Lehman
ICMI, 2013
-
Take or Wait? Learning Turn-Taking from Multiparty Data
Iolanda Leite, Hannaneh Hajishirzi, Sean Andrist, Jill F. Lehman
AAAI (Late Breaking Report), 2013
2012
-
Semantic Understanding of Professional Soccer Commentaries
Hannaneh Hajishirzi, Mohammad Rastegari, Ali Farhadi, Jessica Hodgins
UAI, 2012
[data] [press]
-
Using Group History to Identify Character-directed Utterances in Multi-child Interactions
Hannaneh Hajishirzi, Jill F. Lehman, Jessica Hodgins
SIGDIAL, 2012 — Best paper award [patent]
-
Question Answering in Natural Language Narratives Using Symbolic Probabilistic Reasoning
Hannaneh Hajishirzi, Erik T. Mueller
FLAIRS, 2012
-
Recognizing Character-directed Utterances in Multi-child Interactions
Hannaneh Hajishirzi, Jill Lehman, Kenichi Kumatani, Leonid Segal, Jessica Hodgins
HRI (Late-breaking report), 2012
2011
2007–2010
-
Adaptive Near-Duplicate Detection via Similarity Learning
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
SIGIR, 2010 [patent]
-
Reasoning about Deterministic Actions with Probabilistic Prior and Application to Stochastic Filtering
Hannaneh Hajishirzi, Eyal Amir
KR, 2010
-
Greedy Algorithms for Sensing Decision
Hannaneh Hajishirzi, Afsaneh Shirazi, Jaesic Choi, Eyal Amir
IJCAI, 2009
-
Sampling First Order Logical Particles
Hannaneh Hajishirzi, Eyal Amir
UAI, 2008
-
Stochastic Filtering in a Probabilistic Action Model
Hannaneh Hajishirzi, Eyal Amir
AAAI, 2007
Last updated